Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Expose generation index to tool callables in GRPOTrainer
#4894 opened Jan 25, 2026 by lukehinds Loading…
4 of 5 tasks
Upgrade GitHub Actions to latest versions
#4893 opened Jan 24, 2026 by salmanmkc Loading…
docs: add DoRA (2402.09353) to Paper Index
#4892 opened Jan 24, 2026 by billycrapediem Loading…
3 of 5 tasks
[GRPO] feat: Geometric Sequence Masking
#4891 opened Jan 24, 2026 by LeonEricsson Draft
5 tasks
Fix grpo tool calling
#4890 opened Jan 23, 2026 by akshayballal95 Loading…
2 tasks done
fix(vLLM): Add tool calling support to VLLMClient.chat()
#4889 opened Jan 23, 2026 by kansalaman Loading…
1 of 2 tasks
GOLD training speed up
#4888 opened Jan 22, 2026 by 141forever Loading…
NeMo-Gym Integration
#4848 opened Jan 17, 2026 by cmunley1 Loading…
make dpo compatible with fsdp2
#4838 opened Jan 16, 2026 by flutist Loading…
4 of 5 tasks
feat: Support log_completion for swanlab backend
#4826 opened Jan 14, 2026 by ZiyiTsang Loading…
2 of 5 tasks
forward_masked_logits in SFTTrainer
#4794 opened Jan 8, 2026 by qgallouedec Draft
5 tasks
make dpo compatible with qwen3vl
#4773 opened Jan 4, 2026 by flutist Loading…
Extend CLI to orpo trainer
#4757 opened Dec 27, 2025 by murilo-cunha Loading…
3 of 5 tasks
fix: handle None eval_dataset in example code
#4756 opened Dec 27, 2025 by ciaoyizhen Loading…
1 of 4 tasks
perf: avoid output_hidden_states when only last_hidden_state is used
#4755 opened Dec 27, 2025 by ciaoyizhen Loading…
2 of 5 tasks
Clarify Accelerate usage in SFTTrainer documentation
#4744 opened Dec 23, 2025 by Likhita-17 Loading…
1 task done
fix minillm trainer
#4743 opened Dec 23, 2025 by t1101675 Loading…
5 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.