-
Notifications
You must be signed in to change notification settings - Fork 647
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[feat] Add math eval to CI nightly run #2663
[feat] Add math eval to CI nightly run #2663
Conversation
took a look of the recent flaky nightly tests, theres some flakiness around gsm8k threshold for certain models, but the main contributor is human eval timing out from |
As we discussed, use Thanks, great job! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why disable human eval rather than add more time out?
@XiaotongJiang LGTM |
Co-authored-by: Chayenne <[email protected]>
Fix forward for #2652
How did i test: