Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to do the automatic safety evaluations #3

Open
tivon-x opened this issue Jun 19, 2024 · 1 comment
Open

How to do the automatic safety evaluations #3

tivon-x opened this issue Jun 19, 2024 · 1 comment

Comments

@tivon-x
Copy link

tivon-x commented Jun 19, 2024

Hello!

This is very nice work!

I kindly want to know how to do the automatic safety evaluations. According to the paper, you use a safety critique llm for the evaluations. Will you release the safety critique llm in the future? Or are there any other methods for the automatic safety evaluations?

@IS2Lab
Copy link
Owner

IS2Lab commented Jun 20, 2024

Thanks for your recognition of our work. At present, we have no plans to release our safety critique llm in the short term. However, we are happy to provide support to help you with safety evaluations. If you require automatic safety evaluations, please provide a jsonl file including traceid, prompt, and the corresponding response and send it to [email protected] (if you have any questions about the required submission files, please contact this email first.) We will evaluate and return the results according to the order of requests.

In addition, for other automatic safety evaluations, we introduced and analyzed various methods in our paper. You can refer to the paper.

Thanks again for your understanding and support!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants