Skip to content

Release BigCodeBench v0.1.7

Latest
Compare
Choose a tag to compare
@terryyz terryyz released this 27 Jun 23:52
· 21 commits to main since this release

Fix some identified issues:

  • The ground truth pass rate was not previously computed in the correct way.
  • Passed RAM limits would raise errors, as they were set as float type.
  • User permission is not correctly set up in the Evaluate Docker.

Features:
-- check-gt-only will print out the pass rate when finishing.