Skip to content

Integrate alyah benchmark#1117

Open
amztheorytii wants to merge 2 commits intohuggingface:mainfrom
amztheorytii:main
Open

Integrate alyah benchmark#1117
amztheorytii wants to merge 2 commits intohuggingface:mainfrom
amztheorytii:main

Conversation

@amztheorytii
Copy link

@amztheorytii amztheorytii commented Jan 12, 2026

This PR is intended to integrate a benchmark called Alyah, which evaluates LLMs in the Emirati Culture. Benchmark has been made public at HF.
Considering it multilingual related benchmark, we have added a dedicated file under src/lighteval/tasks/multilingual/tasks/eval_emirati.py.

Considering it is a simple PR, we will appreciate a fast review process.

Credit to Contributers @Omar-Alkaabi, @amztheorytii

@amztheorytii amztheorytii changed the title add alyah benchmark Integrate alyah benchmark Jan 12, 2026
@NathanHB
Copy link
Member

hey @amztheorytii ! Is this ready for review ? :)

@amztheorytii
Copy link
Author

amztheorytii commented Jan 15, 2026

@NathanHB yeah we can go ahead with the reviewing process!
benchmark dataset is not yet public and you guys need the dataset to be public to start reviewing
I'll ping you once made public (should be in the coming days)

@basma-boussahaa
Copy link

hi @NathanHB
the dataset is public please review the PR and let us know if anything is needed

@basma-boussahaa
Copy link

hi @NathanHB
do you have any idea when this PR can be reviewed please?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants