FYI, HumanEval 95 check dict case canonical solution is wrong #52

PootieT · 2023-04-19T15:44:55Z

arjunguha · 2023-04-19T20:33:08Z

Oh, there is more than just this one... lookup the degrees/radians problem in MBPP--sorry I forget the number. We actually don't use the canonical solutions at all in MultiPL-E. So, we should be okay about this.

The principle we've been following: we want to fix bugs in MultiPL-E, but preserve bugs in the underlying benchmarks. Hopefully, that will make comparisons easier to do. Let me know if you have other ideas.

PootieT closed this as completed Apr 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FYI, HumanEval 95 check dict case canonical solution is wrong #52

FYI, HumanEval 95 check dict case canonical solution is wrong #52

PootieT commented Apr 19, 2023

arjunguha commented Apr 19, 2023 •

edited

Loading

FYI, HumanEval 95 check dict case canonical solution is wrong #52

FYI, HumanEval 95 check dict case canonical solution is wrong #52

Comments

PootieT commented Apr 19, 2023

arjunguha commented Apr 19, 2023 • edited Loading

arjunguha commented Apr 19, 2023 •

edited

Loading