DDP with in-CPU-memory dataset #19543

DucoG · 2024-02-27T17:55:47Z

DucoG
Feb 27, 2024

Hi! I'm using Lightning to train a model using multiple GPUs on a large dataset. Loading the data from disk is very slow and my dataset is too big for it to fit in memory once per GPU so I've opted for the following approach but I'm not sure if it will work.

In DataModule.prepare_data I subdivide shards of the datasets over the number of GPU's provided . Making sure that every GPU has the same number of rows. The division is saved to a configuration file on disk.
In DataModule.setup, the correct data shards are loaded per rank according to the configuration file.
Making sure to set use_distributed_sampler = False and setting shuffle = True

Will this approach work? I was also wondering if there are better approaches as I haven't seen any examples like this online and it feels like im trying to reinvent the wheel but failing 😦

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DDP with in-CPU-memory dataset #19543

{{title}}

Replies: 0 comments

Select a reply

DDP with in-CPU-memory dataset #19543

DucoG Feb 27, 2024

Replies: 0 comments

DucoG
Feb 27, 2024