We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
The robust European language model benchmark.
Python 99 25
Collection of all evaluation results from the EuroEval framework.