Skip to content

Commit

Permalink
ADLR/megatron-lm!2310 - tests: Re-enable CP tests
Browse files Browse the repository at this point in the history
  • Loading branch information
ko3n1g committed Nov 6, 2024
1 parent f39c48d commit 358fbcf
Show file tree
Hide file tree
Showing 5 changed files with 12 additions and 4 deletions.
8 changes: 4 additions & 4 deletions tests/functional_tests/jet_recipes/gpt.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -101,11 +101,11 @@ products:
- gpt3_mr_tp2_pp2_dgx_a100_1N8G
- gpt3_mr_mcore_te_tp2_pp2_defer_embedding_wgrad_compute_dgx_a100_1N8G
- gpt3_mr_mcore_te_tp1_pp4_vp1_dist_optimizer_overlap_grad_reduce_param_gather_dgx_a100_1N8G
# - gpt3_mr_mcore_te_tp2_pp1_cp2_nondeterministic_dgx_a100_1N8G # cp and attention
# - gpt3_mr_mcore_te_tp2_pp2_cp2_nondeterministic_dgx_a100_1N8G # cp and attention
# - gpt3_mr_mcore_te_tp2_pp1_resume_torch_dist_cp2_nondeterministic_dgx_a100_1N8G # cp and attention
# - gpt3_mr_mcore_te_tp2_pp2_resume_torch_dist_cp2_nondeterministic_dgx_a100_1N8G # cp and attention
- gpt3_mr_mcore_te_tp1_pp4_vp1_dist_optimizer_overlap_grad_reduce_dgx_a100_1N8G
- gpt3_mr_mcore_te_tp2_pp1_cp2_nondeterministic_dgx_a100_1N8G # cp and attention
- gpt3_mr_mcore_te_tp2_pp2_cp2_nondeterministic_dgx_a100_1N8G # cp and attention
- gpt3_mr_mcore_te_tp2_pp1_resume_torch_dist_cp2_nondeterministic_dgx_a100_1N8G # cp and attention
- gpt3_mr_mcore_te_tp2_pp2_resume_torch_dist_cp2_nondeterministic_dgx_a100_1N8G # cp and attention
- environment: [lts, dev]
scope: [nightly]
platforms: [dgx_a100]
Expand Down
Original file line number Diff line number Diff line change
@@ -1,6 +1,8 @@
ENV_VARS:
CUDA_DEVICE_MAX_CONNECTIONS: 1
NVTE_ALLOW_NONDETERMINISTIC_ALGO: 1
NVTE_FUSED_ATTN: 0
NVTE_FLASH_ATTN: 1
MODEL_ARGS:
--num-layers: 12
--hidden-size: 512
Expand Down
Original file line number Diff line number Diff line change
@@ -1,6 +1,8 @@
ENV_VARS:
CUDA_DEVICE_MAX_CONNECTIONS: 1
NVTE_ALLOW_NONDETERMINISTIC_ALGO: 1
NVTE_FUSED_ATTN: 0
NVTE_FLASH_ATTN: 1
MODEL_ARGS:
--num-layers: 12
--hidden-size: 512
Expand Down
Original file line number Diff line number Diff line change
@@ -1,6 +1,8 @@
ENV_VARS:
CUDA_DEVICE_MAX_CONNECTIONS: 1
NVTE_ALLOW_NONDETERMINISTIC_ALGO: 1
NVTE_FUSED_ATTN: 0
NVTE_FLASH_ATTN: 1
MODEL_ARGS:
--num-layers: 12
--hidden-size: 512
Expand Down
Original file line number Diff line number Diff line change
@@ -1,6 +1,8 @@
ENV_VARS:
CUDA_DEVICE_MAX_CONNECTIONS: 1
NVTE_ALLOW_NONDETERMINISTIC_ALGO: 1
NVTE_FUSED_ATTN: 0
NVTE_FLASH_ATTN: 1
MODEL_ARGS:
--num-layers: 12
--hidden-size: 512
Expand Down

0 comments on commit 358fbcf

Please sign in to comment.