You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, mcr-dl's new_group() implementation for mpi uses torch.distributed to create new_group instead of having it's own implementation.
Refer following :
Problem :
Currently, mcr-dl's new_group() implementation for mpi uses torch.distributed to create new_group instead of having it's own implementation.
Refer following :
MCR-DL/mcr_dl/mpi.py
Line 74 in 870751f
We would like to have direct implementation instead of using torch.distributed similar to existing implementation of 'nccl'
Refer following :
MCR-DL/mcr_dl/nccl.py
Line 73 in 870751f
Testing :
The text was updated successfully, but these errors were encountered: