Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Adding video llm fine-tuning example
#2336 opened Nov 7, 2024 by mfarre Loading…
5 tasks
🫴 Better guide users in error reporting
#2327 opened Nov 5, 2024 by qgallouedec Loading…
10 tasks
🪡 Various RLOO fixes
#2325 opened Nov 4, 2024 by qgallouedec Loading…
5 tasks
Implementation DiscoPOP Loss
#2323 opened Nov 4, 2024 by fanconic Loading…
3 of 5 tasks
[Draft] Add eval_data_collator arg
#2311 opened Nov 3, 2024 by pdufour Draft
1 of 5 tasks
[Draft] Add autocast to prediction_step for SFTTrainer
#2310 opened Nov 3, 2024 by pdufour Draft
2 of 5 tasks
Added GitHub Star api and added back to top button in the file.
#2297 opened Oct 30, 2024 by hackit-coder Loading…
3 of 5 tasks
Update log_reports.py
#2289 opened Oct 28, 2024 by Yash-2707 Draft
🤏 New models for tests
#2287 opened Oct 27, 2024 by qgallouedec Draft
5 tasks
🔀 Add MergeModelCallBack
#2282 opened Oct 25, 2024 by August-murr Loading…
3 of 5 tasks
Asynchronous RLHF: Faster and More Efficient Online DPO
#2278 opened Oct 24, 2024 by mnoukhov Loading…
1 of 3 tasks
[GKD] add ULD type loss to GKD Trainer
#2263 opened Oct 22, 2024 by kashif Loading…
Add Error Handling for Stale Issue Script in GitHub Action
#2258 opened Oct 21, 2024 by Ananya54321 Loading…
2 of 5 tasks
Data mixer Integration
#2240 opened Oct 16, 2024 by August-murr Draft
3 of 5 tasks
[online-DPO] evaluaiton step error 🐛 bug Something isn't working
#2231 opened Oct 15, 2024 by kashif Draft
Add VAS to TRL ✨ enhancement New feature or request
#2195 opened Oct 7, 2024 by idanshen Loading…
[CGPO] CGPO Trainer (single task single objective) ✨ enhancement New feature or request
#2190 opened Oct 6, 2024 by gaetanlop Draft
9 of 10 tasks
Change KTO tokenization to use DPO's 🏋 KTO Related to KTO
#2187 opened Oct 6, 2024 by kawine Loading…
[CGPO] Mixture of judges 👨‍⚖️ judge Related to judges
#2159 opened Oct 3, 2024 by gaetanlop Loading…
4 tasks done
ProTip! Exclude everything labeled bug with -label:bug.