Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Explore MS promptbench #931

Open
dcecchini opened this issue Dec 18, 2023 · 0 comments
Open

Explore MS promptbench #931

dcecchini opened this issue Dec 18, 2023 · 0 comments
Labels
⏭️ Next Release Issues or Request for the next release

Comments

@dcecchini
Copy link
Contributor

dcecchini commented Dec 18, 2023

Explore the new tool released by Microsoft for evaluation of LLMs.

Brief description:

It consists of a wide range of LLMs and evaluation datasets, covering diverse tasks, evaluation protocols, adversarial prompt attacks, and prompt engineering techniques. As a holistic library, it also supports several analysis tools for interpreting the results. It is designed in a modular fashion, allowing to build evaluation pipelines for custom projects.

So, I think we should check what are the techniques they use to evaluate the models, as well as datasets they support, tasks, and analysis tools to interpret the results.

Github link: promptbench

@ArshaanNazir ArshaanNazir added ⏭️ Next Release Issues or Request for the next release v2.1.0 Issue or request to be done in v2.1.0 release and removed ⏭️ Next Release Issues or Request for the next release labels Dec 20, 2023
@ArshaanNazir ArshaanNazir added ⏭️ Next Release Issues or Request for the next release and removed v2.1.0 Issue or request to be done in v2.1.0 release labels Feb 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
⏭️ Next Release Issues or Request for the next release
Projects
None yet
Development

No branches or pull requests

2 participants