Skip to content
This repository has been archived by the owner on Aug 7, 2024. It is now read-only.

[DISCUSSION] fix float8 all-gather in FSDP2 + TP: DTensor(WeightWithDynamicFloat8CastTensor)#326

Draft
weifengpy wants to merge 10 commits intopytorch-labs:mainfrom weifengpy:fsdp2

Commits

Commits on Jul 17, 2024

Commits on Jul 18, 2024

Commits on Jul 24, 2024

Commits on Aug 1, 2024