This repository has been archived by the owner on Sep 12, 2024. It is now read-only.
v0.1.0
Pre-release
Pre-release
What's Changed
- feature: impl naive gpt2 by @hlhr202 in #51
- feature: implemented parallel inference for llama-rs, implemented naive sequential async inference for llama-cpp and rwkv-cpp by @hlhr202 in #52
Full Changelog: v0.0.37...v0.1.0