Introduce PerProcessContext to adjust DDP per process behavior (#913) #955

breakds · 2021-08-02T21:11:57Z

Motivation

To make DDP as transparent to the user as possible, we should not expect the user to have to manually update a configuration to achieve parity between DDP mode and non DDP mode. More specifically, if single process has 64 environments in parallel, we would expect in dual process the per-process number of parallel environments to be reduced accordingly (to 32).
Similar to only having process 0 (master process) writing checkpoint, we want to have only process 0 writing the summary. This is done by not enabling summary unless the process rank is 0 (or -1) when setting up PolicyTrainer.

Therefore, this PR targets the combined problem of #953 and #954

Solution

Introduce a global singleton called PerProcessContext. As its name suggests, this context maintains properties that are private to each process. In the current implementation, PerProcessContext allows accessing the rank (process ID) and the total number of processes from anywhere.

Design Choices:

Decided to use a global singleton to avoid having to pass context down to the very bottom of the calling stack, as discussed with @emailweixu in Adjust number of environments in DDP mode based on number of processes #953
Decided to disable summary by hijacking run_under_record_context as a whole instead of just hijacking _cond. This is because run_under_record_context will initialize the writer and the write the global steps before the first call to _cond, and we would like to avoid that as well.
Decided to transparently update the number of environments per process in the config, instead of providing a new configuration item, unlike what has been suggested by @le-horizon in Adjust number of environments in DDP mode based on number of processes #953. Both approaches have their benefits but I think having to update the config to run distributed training is a bit more work (for the user), and adding new configuration item may further damage the readability (of the config file). As long as the training/optimization is done the same, DDP and non-DDP shouldn't make a difference in terms of the training result.

Testing

Run training without DDP to make sure summary and number of environments are the same as before.
Run training with DDP with dual process to make sure

Number of processes are halved per process
There is only one events file being generated for summary

emailweixu · 2021-08-02T21:57:04Z

Should we add some unittest to ensure DDP can run correctly? (perhaps in a different PR). A possible place for this is alf/bin/train_play_test.py.

breakds · 2021-08-02T22:22:42Z

Should we add some unittest to ensure DDP can run correctly? (perhaps in a different PR). A possible place for this is alf/bin/train_play_test.py.

Agree. Created #956 for this.

le-horizon

LG

…zonRobotics#955)

breakds requested review from emailweixu and le-horizon August 2, 2021 21:12

breakds mentioned this pull request Aug 2, 2021

Adjust number of environments in DDP mode based on number of processes #953

Closed

Introduce PerProcessContext to adjust DDP per process behavior

97e2e78

breakds force-pushed the PR_breakds_per_process_context branch from 79939e1 to 97e2e78 Compare August 2, 2021 21:16

emailweixu approved these changes Aug 2, 2021

View reviewed changes

le-horizon approved these changes Aug 2, 2021

View reviewed changes

breakds merged commit 774f51f into pytorch Aug 3, 2021

breakds deleted the PR_breakds_per_process_context branch August 3, 2021 00:14

pd-perry pushed a commit to pd-perry/alf that referenced this pull request Dec 11, 2021

Introduce PerProcessContext to adjust DDP per process behavior (Hori…

f6c5fc6

…zonRobotics#955)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce PerProcessContext to adjust DDP per process behavior (#913) #955

Introduce PerProcessContext to adjust DDP per process behavior (#913) #955

breakds commented Aug 2, 2021

emailweixu commented Aug 2, 2021

breakds commented Aug 2, 2021

le-horizon left a comment

Introduce PerProcessContext to adjust DDP per process behavior (#913) #955

Introduce PerProcessContext to adjust DDP per process behavior (#913) #955

Conversation

breakds commented Aug 2, 2021

Motivation

Solution

Design Choices:

Testing

emailweixu commented Aug 2, 2021

breakds commented Aug 2, 2021

le-horizon left a comment

Choose a reason for hiding this comment