Menu

Post image 1
Post image 2
Post image 3
Post image 4
Post image 5
Post image 6
1 / 6
0

Simplify Sparse Deep Learning with Universal Sparse Tensor in nvmath-python

NVIDIA Technical Blog·Aart J.C. Bik·about 1 month ago
#3wXqY7dD
Reading 0:00
15s threshold

In a previous post , we introduced the Universal Sparse Tensor (UST) , enabling developers to decouple a tensor’s sparsity from its memory layout for greater flexibility and performance. We’re excited to announce the integration of the UST into nvmath-python v0.9.0 to accelerate sparse scientific and deep learning applications. This post provides a walkthrough of key UST features, implementation details, and performance overview, including: Zero-cost interoperability: Data-movement-free conversion with PyTorch, SciPy, and CuPy. Custom formats: Define novel sparsity schemes. Polymorphic operations: Sparsity-agnostic functions automatically use optimized kernels or generate custom sparse code—eliminating the need for manual coding of new formats. PyTorch injection: Easily inject UST performance benefits into existing PyTorch models. Transparent caching: Avoid JIT/LTO recompilation and replanning—amortizing overhead over subsequent repeated execution of the same operation.…

Continue reading — create a free account

Join HashtagPLUS to read full articles, follow hashtags, vote, and join the conversation.

Read More