-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rollout Scheduling #145
Comments
What does static mean? Constant at, say 2? This is already supported. |
@mchantry Updated the description |
I like the idea of dynamic selection of increments and I was also wondering if this could be done by steps as well as by epochs? For example at step 1000, do roll 2, at step 10000, do roll 10. |
I agree with @mc4117. Some models show a better performance when trained for longer on 2-steps and only some iterations on longer rollout steps. | |
I like @mc4117 suggestion regarding supporting rollout by steps. I think this probably would make things easier if, in the future, we want to automate the training so that the 6-hour and the rollout steps are executed one after the other. |
Moving to a discussion (to try it out) |
Our current rollout implementation is very focused on sequential epoch increments, it would be good to generalise this to provide schedulers to control rollout.
Work was done in
aifs-mono
to enable this. hereI think this can be generalised and provide more general applicability.
Features
Below is a list of features and requirements as I see them
Improvements
Setup config at begin of training with rollout increment be
Questions
What other features may be needed?
The text was updated successfully, but these errors were encountered: