GitHub - slowfastai/moerl-zoo: Custom patches and utilities for MoERL — an efficient RL fine-tuning framework for Mixture of Experts (MoE) LLMs

Custom patches and utilities for MoERL — an efficient RL fine-tuning framework for Mixture of Experts (MoE) LLMs.

✨ What is moerl-zoo?

moerl-zoo is a toolkit inspired by unsloth-zoo, designed to serve the needs of MoERL. It includes lightweight patches, wrappers, and utilities specifically adapted for MoE reinforcement learning fine-tuning pipelines.

🦥 This project is based on unsloth-zoo, created by Daniel Han-Chen and the Unsloth team. All modifications are made under the terms of the LGPL-3.0-or-later license.

🚀 Features

🧩 Custom monkey patches and related utilities designed for MoERL compatibility
🚀 Supports efficient MoE-based RL fine-tuning
⚡ Fully compatible with vllm, bitsandbytes, transformers, trl, and more

📦 Installation

pip install "moerl_zoo @ git+https://github.com/slowfastai/moerl-zoo.git"

🤝 Acknowledgements

Huge thanks to the Unsloth project for their foundational work in efficient LLM optimization ❤️. MoERL builds upon their vision while tailoring functionality toward MoE RL.

📜 License

This project is licensed under the GNU Lesser General Public License v3.0 or later (LGPL-3.0-or-later).

The original base project, 🦥 unsloth-zoo, is licensed under the GNU Lesser General Public License v3.0 or later (LGPL-3.0-or-later).

All original LICENSE and NOTICE files have been preserved where applicable.

Name		Name	Last commit message	Last commit date
Latest commit History 528 Commits
moerl_zoo		moerl_zoo
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

✨ What is moerl-zoo?

🚀 Features

📦 Installation

🤝 Acknowledgements

📜 License

About

Uh oh!

Releases

Packages

Languages

License

slowfastai/moerl-zoo

Folders and files

Latest commit

History

Repository files navigation

✨ What is moerl-zoo?

🚀 Features

📦 Installation

🤝 Acknowledgements

📜 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages