Skip to content

v0.1.4 July release

Latest
Compare
Choose a tag to compare
@valarLip valarLip released this 13 Jul 09:00
· 132 commits to main since this release
980c240
  1. mxfp4 enable for gfx950, including GEMM, MoE, and per1x32 Quant
  2. multi-gpu tuning enable for most kind of GEMMs
  3. fp8 all reduce
  4. numbers of triton kernels

What's Changed

New Contributors

Full Changelog: v0.1.3...v0.1.4