generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Expose generation index to tool callables in GRPOTrainer
#4894
opened Jan 25, 2026 by
lukehinds
Loading…
4 of 5 tasks
docs: add DoRA (2402.09353) to Paper Index
#4892
opened Jan 24, 2026 by
billycrapediem
Loading…
3 of 5 tasks
fix(vLLM): Add tool calling support to VLLMClient.chat()
#4889
opened Jan 23, 2026 by
kansalaman
Loading…
1 of 2 tasks
Add History-Aware Adaptive Difficulty Weighting (HA-DW) to GRPO
#4872
opened Jan 20, 2026 by
anonx3247
Loading…
feat: Support log_completion for swanlab backend
#4826
opened Jan 14, 2026 by
ZiyiTsang
Loading…
2 of 5 tasks
[GRPO] Add parquet logging for completions with individual rewards
#4818
opened Jan 13, 2026 by
qgallouedec
Loading…
Refactor KTO [3/N]: Extract dataset processing to _prepare_dataset method
#4788
opened Jan 8, 2026 by
albertvillanova
Loading…
Refactor KTO [2/N]: Improve config validation in KTOConfig
#4787
opened Jan 8, 2026 by
albertvillanova
Loading…
feat(sft): add generation-based evaluation support to SFTTrainer
#4768
opened Jan 2, 2026 by
CodersAcademy006
Loading…
fix: handle None eval_dataset in example code
#4756
opened Dec 27, 2025 by
ciaoyizhen
Loading…
1 of 4 tasks
perf: avoid output_hidden_states when only last_hidden_state is used
#4755
opened Dec 27, 2025 by
ciaoyizhen
Loading…
2 of 5 tasks
Clarify Accelerate usage in SFTTrainer documentation
#4744
opened Dec 23, 2025 by
Likhita-17
Loading…
1 task done
feat: Bidirectional masked importance sampling ratio (MIS) for IcePop
#4732
opened Dec 20, 2025 by
casinca
Loading…
4 of 5 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.