-
Shows how declare, configure, compile, and run a CUTLASS GEMM using the Python interface
-
Shows how to fuse elementwise activation functions to GEMMs via the Python interface
-
02_pytorch_extension_grouped_gemm
Shows how to declare, compile, and run a grouped GEMM operation via the Python interface, along with how the emitted kernel can be easily exported to a PyTorch CUDA extension.
-
Shows how to declare, configure, compile, and run a CUTLASS Conv2d using the Python interface
-
Shows how to fuse elementwise activation functions to GEMMs via the Python Epilogue Visitor interface
python
Folders and files
Name | Name | Last commit date | ||
---|---|---|---|---|
parent directory.. | ||||