Block-sparse GPU kernels
Weβre releasing highly-optimized GPU kernels for an underexplored class of neural network architectures: networks with block-sparse weights. Depending on the chosen sparsity, these kernels can run orders of magnitude faster than cuBLAS or cuSPARSE. Weβve used them to attain state-of-the-art results...
Log in to bookmark articles and create collections
                        Isabella News