v0.6.3.post1+rocm
Pre-release
Pre-release
github-actions
released this
29 Oct 21:12
·
98 commits
to main
since this release
What's Changed
- Upstream merge 24 10 21 by @gshtras in #240
- Using the correct datatype on prefix prefill for fp8 kv cache by @gshtras in #242
- Update CMakeLists.txt by @gshtras in #244
- update block_manager usage in setup_cython by @saienduri in #243
- [Bugfix][Kernel][Misc] Basic support for SmoothQuant, symmetric case by @rasmith in #237
- Add fp8 support for llama model family on Navi4x by @qli88 in #245
- Custom all reduce fix mi250 by @omirosh in #247
- Upstream merge 24 10 28 by @gshtras in #248
New Contributors
- @saienduri made their first contribution in #243
- @qli88 made their first contribution in #245
- @omirosh made their first contribution in #247
Full Changelog: v0.6.2.post1+rocm...v0.6.3.post1+rocm