Issues · huggingface/trl

[Tracking issue] General dataset support

#2071 opened Sep 15, 2024 by qgallouedec

Open

[Tracking issue] Integrate native liger-kernel losses

#2495 opened Dec 17, 2024 by qgallouedec

Open 2

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

107 Open 1,185 Closed

🐛 bug 🏋 DPPO 🙋 help from community wanted ⏳ needs more info

#2505 opened Dec 20, 2024 by nguyenhoa-uit

5 of 9 tasks

✨ enhancement 🏋 SFT

#2504 opened Dec 19, 2024 by ggbetz

🙋 help from community wanted 🏋 PPO ❓ question 🏋 RLOO

#2496 opened Dec 17, 2024 by macheng6

✨ enhancement 🧒 good second issue

#2495 opened Dec 17, 2024 by qgallouedec

5 tasks

🐛 bug 🚀 deepspeed 🏋 DPO ⏳ needs more info

#2490 opened Dec 16, 2024 by sagie-dekel

7 of 9 tasks

🏋 GKD ❓ question

#2481 opened Dec 14, 2024 by hteague-qti

🐛 bug 🏋 DPO

#2473 opened Dec 13, 2024 by qingjianbuyi

7 of 9 tasks

🙋 help from community wanted ❓ question 🏋 RLOO

#2472 opened Dec 13, 2024 by macheng6

1 of 3 tasks

Provide Descriptions (READMEs) for trl-lib/dataset 🗃️ data 📚 documentation ✨ enhancement 👶 good first issue 🙋 help from community wanted

#2470 opened Dec 13, 2024 by Kallinteris-Andreas

🏋 DPO ✨ enhancement

#2469 opened Dec 13, 2024 by zhc7

🐛 bug 🏋 DPO

#2468 opened Dec 13, 2024 by zhc7

Probably a more reasonable method of packing ✨ enhancement 🧒 good second issue 🙋 help from community wanted 🏋 SFT

#2466 opened Dec 12, 2024 by AIR-hl

❓ question

#2465 opened Dec 11, 2024 by AMindToThink

3 tasks

Evaluation with OnlineDPO 🐛 bug 🏋 Online DPO

#2464 opened Dec 11, 2024 by MohamedAliRashad

7 of 9 tasks

Probaly mistake in DPOTrainer when compute/log grad_norm 🏋 DPO ❓ question

#2456 opened Dec 10, 2024 by AIR-hl

7 of 9 tasks

🏋 DPO ❓ question

#2452 opened Dec 9, 2024 by gp-1108

7 of 9 tasks

🙋 help from community wanted 🏋 PPO ❓ question

#2451 opened Dec 8, 2024 by hwhyyds

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues: huggingface/trl

Issues list