Skip to content

Continuing Training from a Previous Checkpoint #177

Answered by halx
SebastianPaucar asked this question in Q&A
Discussion options

You must be logged in to vote

REINVENT uses probabilistic models and as such is non-deterministic. Other than with setting the seed (to some extent) and using beam search with the transformers you cannot expect repeat runs to produce the same output. And that is perfectly fine. In fact, decorrelation will happen fairly quickly and what you should really do in practice, but probably few people do, is to combine results from multiple independent runs.

Set up a realistic example with realistic settings and check that for yourself.

Replies: 3 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by halx
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants