Simplify Sparse Deep Learning with Universal Sparse Tensor in nvmath-python

1 / 6

Simplify Sparse Deep Learning with Universal Sparse Tensor in nvmath-python

NVIDIA Technical Blog·Aart J.C. Bik·about 1 month ago

#3wXqY7dD

#x5b #x2d #developertoolstechniques #hpcscientificcomputing #cuda #tensor

Reading 0:00

15s threshold

In a previous post , we introduced the Universal Sparse Tensor (UST) , enabling developers to decouple a tensor’s sparsity from its memory layout for greater flexibility and performance. We’re excited to announce the integration of the UST into nvmath-python v0.9.0 to accelerate sparse scientific and deep learning applications. This post provides a walkthrough of key UST features, implementation details, and performance overview, including: Zero-cost interoperability: Data-movement-free conversion with PyTorch, SciPy, and CuPy. Custom formats: Define novel sparsity schemes. Polymorphic operations: Sparsity-agnostic functions automatically use optimized kernels or generate custom sparse code—eliminating the need for manual coding of new formats. PyTorch injection: Easily inject UST performance benefits into existing PyTorch models. Transparent caching: Avoid JIT/LTO recompilation and replanning—amortizing overhead over subsequent repeated execution of the same operation.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Create free account Log in

Menu

Simplify Sparse Deep Learning with Universal Sparse Tensor in nvmath-python