diff --git a/docs/source/index.md b/docs/source/index.md index 7088d42f3f..7d88ebf591 100644 --- a/docs/source/index.md +++ b/docs/source/index.md @@ -7,6 +7,25 @@ TRL is a full stack library where we provide a set of tools to train transformer language models with methods like Supervised Fine-Tuning (SFT), Group Relative Policy Optimization (GRPO), Direct Preference Optimization (DPO), Reward Modeling, and more. The library is integrated with 🤗 [transformers](https://github.com/huggingface/transformers). +
+ + thumbnail +

Supervised Fine-Tuning (SFT) Ministral-3 with QLoRA

+
+ + thumbnail +

GRPO Ministral-3 with QLoRA

+
+ + thumbnail +

OpenEnv Integration Guide

+
+ + thumbnail +

OpenEnv Wordle with GRPO

+
+
+ ## 🎉 What's New **OpenEnv Integration:** TRL now supports **[OpenEnv](https://huggingface.co/blog/openenv)**, the open-source framework from Meta for defining, deploying, and interacting with environments in reinforcement learning and agentic workflows.