-
Notifications
You must be signed in to change notification settings - Fork 2k
Issues: huggingface/open-r1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[ROCm] GRPO Training with vLLM is hanging on MI300X system, w/o vLLM it works properly
#482
opened Mar 5, 2025 by
nikhil-tensorwave
Why don't rewards increase instead of staying at a certain value in GRPO?
#474
opened Mar 5, 2025 by
AXy1527
how to set the max_model_length, max_new_tokens and generation_size when evaluate ?
#472
opened Mar 5, 2025 by
ItGirls
Is it normal for a 1.5B model on an H100 80G to require several hundred hours for LiveCodeBench?
#466
opened Mar 4, 2025 by
wccccp
Questions Regarding Completion Length Change in Reproducing SimpleRL-Reason
#465
opened Mar 4, 2025 by
nonstopfor
Previous Next
ProTip!
Follow long discussions with comments:>50.