Extracting prompts based on Risk Category #4

reinbugnot · 2024-10-29T05:44:47Z

Hi, I'm from NUS-NCS Cybersecurity Laboratory in SG.

I am interested in using the S-Eval dataset in our LLM risk evaluations. From the README.md file, there's a breakdown of how many prompts are available per Risk Category (i.e. Access Control, Hacker Attack, Malicious Code, etc. under Cybersecurity).

But the risk category information is currently not included in the .jsonl files inside s_eval/.

Can I ask if there's a way to group the prompts according to their risk categories? This will greatly help us in our use case. Thanks!

The text was updated successfully, but these errors were encountered:

zggg1p · 2024-10-29T06:02:16Z

Thanks for your support to our work. S-Eval has a detailed record of the detailed risk types (102 detailed risk subcategories in four levels) to which each prompt belongs. Currently, we have only made available the labels for the first-level risk dimensions. We will be releasing more fine-grained risk labels in the near future. Please stay tuned, and we will notify you as soon as they are available.

If our work is useful for your research, please star ⭐ our project.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extracting prompts based on Risk Category #4

Extracting prompts based on Risk Category #4

reinbugnot commented Oct 29, 2024

zggg1p commented Oct 29, 2024 •

edited

Loading

Extracting prompts based on Risk Category #4

Extracting prompts based on Risk Category #4

Comments

reinbugnot commented Oct 29, 2024

zggg1p commented Oct 29, 2024 • edited Loading

zggg1p commented Oct 29, 2024 •

edited

Loading