Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Remove liger loss in favor of liger kernel
#4364 opened Oct 29, 2025 by sergiopaniego Loading…
5 tasks
Add support for Trackio completions logging in GRPOTrainer
#4359 opened Oct 28, 2025 by taha-yassine Loading…
2 of 5 tasks
Openenv wordle example
#4357 opened Oct 28, 2025 by burtenshaw Loading…
Support chat_template_kwargs
#4350 opened Oct 27, 2025 by pramodith Loading…
3 of 5 tasks
[experimental] GOLD Trainer
#4349 opened Oct 27, 2025 by kashif Draft
5 tasks
[OpenENV] Openenv rollout_func signature proposal
#4344 opened Oct 27, 2025 by kashif Loading…
5 tasks
docs: Add RapidFire AI integration guide
#4340 opened Oct 26, 2025 by kamran-rapidfireAI Loading…
5 tasks done
Update SFT QLoRA notebook with **14B** model on free Colab
#4336 opened Oct 24, 2025 by sergiopaniego Loading…
5 tasks
feat(trainer): add PAPOTrainer for preference-based optimization
#4334 opened Oct 24, 2025 by SolarWindRider Loading…
4 tasks done
wip - env
#4320 opened Oct 22, 2025 by qgallouedec Loading…
5 tasks
refactor: simplify parameter freezing in modeling_base.py
#4305 opened Oct 20, 2025 by Ki-Seki Loading…
2 of 5 tasks
GRPO: ScaleRL -> Support casting LM Head to FP32
#4303 opened Oct 18, 2025 by pramodith Loading…
4 of 5 tasks
[SFT] Log mean token accuracy from Liger kernel
#4302 opened Oct 18, 2025 by kashif Loading…
5 tasks
Tool call
#4300 opened Oct 18, 2025 by qgallouedec Draft
5 tasks
Add CISPO loss option and documentation
#4298 opened Oct 16, 2025 by gustavorubim Loading…
Fix DPO Trainer Bug For Qwen2-VL (Issue 2660)
#4257 opened Oct 11, 2025 by FabianSchuetze Loading…
1 of 3 tasks
Online-dpo-ben
#4252 opened Oct 10, 2025 by burtenshaw Draft
5 tasks
[Utils] fix DataCollatorForChatML
#4231 opened Oct 8, 2025 by kashif Draft
ProTip! no:milestone will show everything without a milestone.