You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
More improvements to FSDP, benchmark against DDP (#13)
* leave root params in mem
* fix logic
* don't show mem usage all the time
* fix
* make configurable
* add alloc/free for unsharded data
* add alloc/free for unsharded grad
* fix
* record for
* revert 69d74c4 - alloc/free for unsharded grad
* revert alloc/free trick for unsharded params data
* add support for DDP in benchmark
* set device ids explicitly
* fix
* change up how weights are initialized
* fix test
* Add back alloc/free hack for unsharded data
* Revert "Add back alloc/free hack for unsharded data"
This reverts commit 0386841.
* Handle frozen layers with reshard-only post-backward hook
* Revert "Handle frozen layers with reshard-only post-backward hook"
This reverts commit 0f408d2.
* add to test
* add to test
* Fixes for frozen modules
* Divide grad before and after reducing for stability
* Add support for hybrid sharding
* make grad clipping optional
* clean up
* calculate grad norm more efficiently
* Revert "calculate grad norm more efficiently"
This reverts commit d66a683.
* fix
0 commit comments