-
Notifications
You must be signed in to change notification settings - Fork 33
Pull requests: mlfoundations/evalchemy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Support for Big Bench Extra Hard (General-purpose reasoning eval)
#92
opened Mar 8, 2025 by
Hritikbansal
Loading…
Add the ability to do inference externally to Evalchemy
#91
opened Mar 6, 2025 by
RyanMarten
Loading…
OpenAI API key handling and task loading in evaluation framework
#88
opened Mar 4, 2025 by
jmercat
Loading…
ProTip!
no:milestone will show everything without a milestone.