Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Hierarchical CP implementation (Ulysses + Ring) (#1209)
* change API for hierarchical CP Signed-off-by: Xiaowei Ren <[email protected]> * move fp8 code before qkv reshape Signed-off-by: Xiaowei Ren <[email protected]> * try to insert A2A for hierarchical CP Signed-off-by: Xiaowei Ren <[email protected]> * make fwd work Signed-off-by: Xiaowei Ren <[email protected]> * remove a redundant sync Signed-off-by: Xiaowei Ren <[email protected]> * make bwd of hierarchical CP work Signed-off-by: Xiaowei Ren <[email protected]> * fix dout a2a in bwd Signed-off-by: Xiaowei Ren <[email protected]> * fix q_f16 with fp8 Signed-off-by: Xiaowei Ren <[email protected]> * assert hierarchical CP implementation does not support THD format Signed-off-by: Xiaowei Ren <[email protected]> * bug fix Signed-off-by: Xiaowei Ren <[email protected]> * assert hierarchical CP does not support attn bias Signed-off-by: Xiaowei Ren <[email protected]> * add unit test for hierarchical CP Signed-off-by: Xiaowei Ren <[email protected]> * fix cp_comm_type in unit test Signed-off-by: Xiaowei Ren <[email protected]> * bug fix and code cleaning Signed-off-by: Xiaowei Ren <[email protected]> * minor change Signed-off-by: Xiaowei Ren <[email protected]> * an assert info change Signed-off-by: Xiaowei Ren <[email protected]> * dout shape fix Signed-off-by: Xiaowei Ren <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * move function definitions to the front of the first call Signed-off-by: Xiaowei Ren <[email protected]> * fix tensor view comments Signed-off-by: Xiaowei Ren <[email protected]> * refine CP unit test Signed-off-by: Xiaowei Ren <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * typo fix Signed-off-by: Xiaowei Ren <[email protected]> * typo fix Signed-off-by: Xiaowei Ren <[email protected]> * save cp_size_a2a and rank_a2a in fwd Signed-off-by: Xiaowei Ren <[email protected]> * add more explainations of cp_group in doc_string Signed-off-by: Xiaowei Ren <[email protected]> --------- Signed-off-by: Xiaowei Ren <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
- Loading branch information