-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
critic speedup #219
critic speedup #219
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry, couldn't finish the review today. Monday is a holiday in Canada so I'll probably pick it up on Tuesday (unless I can work on it tonight, but not sure yet). Submitting partial review so you can already answer some of my questions.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A few small follow-up comments to my previous review.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Still not fully done but a few more comments
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
All done, finally!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please just fix the typo below and good to go, thanks for this refactor!
Signed-off-by: Gerald Shen <[email protected]>
6c8d698
to
606f690
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looking good, thanks!
Signed-off-by: Gerald Shen <[email protected]>
thanks for the review! I checked the numerics locally against the previous main and ran a nemo generate only test. They both look good to my eye so I'm merging now. |
* critic speedup Signed-off-by: Gerald Shen <[email protected]> * middle of PP should be broadcasted as well Signed-off-by: Gerald Shen <[email protected]> --------- Signed-off-by: Gerald Shen <[email protected]>
add some much needed cleanup to the critic and reward model inference servers.
rest of the changes are in the changelog