Run notebook end-to-end with GTX 3090/4090 #5
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
The main purpose of this PR is to enable users with RTX 3090/4090 cards to run the
lm-hackers.ipynb
notebook end-to-end. To achieve this, we need to free both the host and GPU memory before loading a new model. The free_memory function below was borrowed from miniai:Before loading a new model, the current model is deleted, and the
free_memory
function is called:NOTE: It's essential to use
del model
beforefree_memory
to ensure that the memory is released before reassignment. Failing to do so might result in a working memory exceeding 24GB.