`drama_llama`

drama_llama is yet another Rust wrapper for llama.cpp. It is a work in progress and not intended for production use. The API will change.

For examples, see the bin folder. There are two example binaries.

Dittomancer - Chat with well represented personalities in the training.
Regurgitater - Test local language models for memorized content.

Supported Features

LLaMA 3 Support.
Iterators yielding candidates, tokens and pieces.
Stop criteria at regex, token sequence, and/or string sequence.
Metal support. CUDA may be enabled with the cuda and cuda_f16 features.
Rust-native sampling code. All sampling methods from llama.cpp have been translated.
N-gram based repetition penalties with custom exclusions for n-grams that should not be penalized.
Support for N-gram blocking with a default, hardcoded blocklist.

Contributing

Code is poetry. Make it pretty.
Respect is universal.
Use rustfmt.

Roadmap

Known issues

With LLaMA 3, safe vocabulary is not working yet so --vocab unsafe must be passed as a command line argument or VocabKind::Unsafe used for an Engine constructor.
The model doesn't load until genration starts, so there can be a long pause on first generation. However because mmap is used, on subsequent process launches, the model should already be cached by the OS.
Documentation is broken on docs.rs because llama.cpp's CMakeLists.txt generates code, and writing to the filesystem is not supported. For the moment use cargo doc --open instead. Others have fixed this by patching llama.cpp in their bindings, but I'm not sure I want to do that for now.

Generative AI Disclosure

Generative, AI, specifically Microsoft's Bing Copilot, GitHub Copilot, and Dall-E 3 were used for portions of this project. See inline comments for sections where generative AI was used. Completion was also used for getters, setters, and some tests. Logos were generated with Dall-E and post processed in Inkscape.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
assets		assets
bin		bin
models		models
src		src
tests/data		tests/data
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE.md		LICENSE.md
README.md		README.md
TERMS_OF_USE.md		TERMS_OF_USE.md
logo.svg		logo.svg
logo_inkscape.svg		logo_inkscape.svg
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`drama_llama`

Supported Features

Contributing

Roadmap

Known issues

Generative AI Disclosure

About

Releases

Packages

Languages

License

mdegans/drama_llama

Folders and files

Latest commit

History

Repository files navigation

drama_llama

Supported Features

Contributing

Roadmap

Known issues

Generative AI Disclosure

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`drama_llama`

Packages