This repository has been archived by the owner on Dec 18, 2024. It is now read-only.
XeTLA v0.3.5
v0.3.5
- Enhanced limitation checking.
- Refined GEMM APIs’ name.
- Supported GEMM APIs load B from SLM.
- Supported GEMM of any (odd) shapes.
- Supported Streaming-K.
- Enhanced L3 K-slicing support.
- Improved GEMM's performance for large-N (M, N, K) shapes.
- Fixed tile load/store bugs.
- Enhanced examples, tests, and updated documents.