Pre-built wheels for FlexGEMM - Efficient sparse convolution based on Triton.
Choose the configuration matching your CUDA and PyTorch versions:
pip install flexgemm --find-links https://pozzettiandrea.github.io/flexgemm-wheels/cu128-torch291/