-
Notifications
You must be signed in to change notification settings - Fork 32
About critic-guide search in InternLM2.5-StepProver #42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
|
Following the question, may I ask what is the template of the prompt for the critic model? I wrote a version according to what the paper mentions, but not sure if it is proper:
|
Yes, I think it is right. |
The search algorithm stops immediately when "no goals" is met. Regarding the previous proof issue, here are some related discussions: Lean Zulip Chat. It is designed to alleviate the "swinging back and forth" issue. However, we haven't conducted experiments on other base models, so whether this phenomenon is model/approach-agnostic is still unknown. |
Thank you for your reply. I will conduct more experiments to verify this conclusion. |
Thank you for your work on InternLM2.5-StepProver. I have two questions:
The text was updated successfully, but these errors were encountered: