princeton-nlp / SimPO Public

Notifications You must be signed in to change notification settings
Fork 51
Star 766

Code
Issues 20
Pull requests
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: princeton-nlp/SimPO

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

20 Open 56 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Arena-Hard Gemma Template?

#78 opened Dec 9, 2024 by cinjon

can't reproduce AE-LC numbers in hf ckpt(Llama-3-8b-SFT-DPO, Llama-3-8b-SFT-SimPO))

#77 opened Dec 2, 2024 by schrieffer-z

Confused about number of steps

#76 opened Nov 25, 2024 by cinjon

Question regarding alpaca_eval sampling parameter

#75 opened Nov 21, 2024 by lancerts

Question about tuning set

#74 opened Nov 13, 2024 by yakazimir

Query about GSM8K evaluation

#73 opened Nov 12, 2024 by HCY123902

Request for SFT Training scripts and implementation details

#71 opened Oct 19, 2024 by HCY123902

About the update of environment.yml

#70 opened Oct 17, 2024 by lucasliunju

Hyper-parameter tuning for other models

#69 opened Sep 28, 2024 by Tejaswgupta

about evaluating Simpo-v0.2 by arena-hard

#68 opened Sep 21, 2024 by jimmy19991222

Where is the sequence parameter in SimPo_loss funciton

#67 opened Sep 20, 2024 by yunyiliu

decoding parameters (e.g., temperature) for Gemma-2?

#64 opened Sep 5, 2024 by iseesaw

Difference with changing the gradient accumulation - ZeroEval and AlpacaEval 2

#61 opened Aug 12, 2024 by sahsaeedi

Question about the annotators_config and reference_outputs in alpaca_eval

#55 opened Aug 1, 2024 by AIR-hl

bug using accelerate

#54 opened Jul 29, 2024 by cjakfskvnad

Experimental results on ARC-C subset for challeging reasoning?

#47 opened Jul 18, 2024 by tongyx361

reward/chosen is decreasing

#42 opened Jul 15, 2024 by zhangguoxin1

How to use local dataset

#41 opened Jul 15, 2024 by mazhengyufreedom

AttributeError: 'SimPOConfig' object has no attribute 'ref_model_init_kwargs'. Did you mean 'model_init_kwargs'?

#34 opened Jul 4, 2024 by Saumajit

Upstream SimPOTrainer to TRL

#3 opened May 25, 2024 by philschmid

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly