You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi all, I'm working on a change to OLMo to support use of additional Pytorch Dataset Classes in our fork of OLMO, and I'm getting some OOM errors due to the use of process spawning rather than forming. I'm considering making process start method configurable, but wanted to understand more about the reasons for forcing all multiprocessing to be done with spawn before I went ahead with the change.
The text was updated successfully, but these errors were encountered:
The OOM errors you’re encountering might stem from increased memory usage associated with spawn due to the duplication of resources when processes are initialized(not 100% sure). We used Memmap implementation to minimize memory storage. Maybe you can reduce the workers/batch-size to ease up.
❓ The question
Hi all, I'm working on a change to OLMo to support use of additional Pytorch Dataset Classes in our fork of OLMO, and I'm getting some OOM errors due to the use of process spawning rather than forming. I'm considering making process start method configurable, but wanted to understand more about the reasons for forcing all multiprocessing to be done with spawn before I went ahead with the change.
The text was updated successfully, but these errors were encountered: