Skip to content
This repository has been archived by the owner on Dec 18, 2024. It is now read-only.

XeTLA v0.3.5

Compare
Choose a tag to compare
@taozha2 taozha2 released this 28 Sep 06:08
· 37 commits to main since this release

v0.3.5

  • Enhanced limitation checking.
  • Refined GEMM APIs’ name.
  • Supported GEMM APIs load B from SLM.
  • Supported GEMM of any (odd) shapes.
  • Supported Streaming-K.
  • Enhanced L3 K-slicing support.
  • Improved GEMM's performance for large-N (M, N, K) shapes.
  • Fixed tile load/store bugs.
  • Enhanced examples, tests, and updated documents.