generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
🧞 Add
output_layer
to the list of lm_head_namings
in AutoModelForCausalLMWithValueHead
#2328
opened Nov 5, 2024 by
qgallouedec
Loading…
5 tasks
👩🏫 Add SFT notebook for chatbot development
#2321
opened Nov 4, 2024 by
qgallouedec
•
Draft
5 tasks
Added GitHub Star api and added back to top button in the file.
#2297
opened Oct 30, 2024 by
hackit-coder
Loading…
3 of 5 tasks
🖨️ Fix error text in BCO and KTO tokenizing function
#2286
opened Oct 26, 2024 by
PhilipMay
Loading…
Asynchronous RLHF: Faster and More Efficient Online DPO
#2278
opened Oct 24, 2024 by
mnoukhov
Loading…
1 of 3 tasks
Add Error Handling for Stale Issue Script in GitHub Action
#2258
opened Oct 21, 2024 by
Ananya54321
Loading…
2 of 5 tasks
[SFT VLM] Added support for Molmo models via standalone script
sft_vlm_molmo
#2236
opened Oct 15, 2024 by
sergiopaniego
Loading…
2 of 5 tasks
Remove ds_config scheuduler params to prevent deepseed from creating scheduler for ref_model
#2224
opened Oct 11, 2024 by
Ben-Schneider-code
Loading…
2 of 5 tasks
fixed: OverflowError: out of range integral type conversion attempted
#2206
opened Oct 9, 2024 by
himanshushukla12
Loading…
1 of 5 tasks
Change KTO tokenization to use DPO's
🏋 KTO
Related to KTO
#2187
opened Oct 6, 2024 by
kawine
Loading…
[CGPO] Mixture of judges
👨⚖️ judge
Related to judges
#2159
opened Oct 3, 2024 by
gaetanlop
Loading…
4 tasks done
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.