Skip to content

Actions: NVIDIA/TransformerEngine

Documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
3,534 workflow runs
3,534 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[MoE][PyTorch] Add mask-based MoE permutation
Documentation #5839: Pull request #1373 synchronize by hxbai
December 13, 2024 05:51 54s hxbai:permute_fusion
December 13, 2024 05:51 54s
[PyTorch] Fix autocast deprecation warnings
Documentation #5838: Pull request #1277 synchronize by yaox12
December 13, 2024 05:26 56s yaox12:xiny/fix_autocast_warning
December 13, 2024 05:26 56s
[PyTorch] Add weights_only=False for torch.load
Documentation #5837: Pull request #1374 synchronize by cyanguwa
December 13, 2024 04:59 1m 3s cyanguwa:fix_load
December 13, 2024 04:59 1m 3s
[PyTorch] Add weights_only=False for torch.load
Documentation #5836: Pull request #1374 opened by cyanguwa
December 13, 2024 04:59 1m 9s cyanguwa:fix_load
December 13, 2024 04:59 1m 9s
[MoE][PyTorch] Add mask-based MoE permutation
Documentation #5835: Pull request #1373 synchronize by pre-commit-ci bot
December 13, 2024 04:49 Action required hxbai:permute_fusion
December 13, 2024 04:49 Action required
[MoE][PyTorch] Add mask-based MoE permutation
Documentation #5834: Pull request #1373 opened by hxbai
December 13, 2024 04:49 1m 5s hxbai:permute_fusion
December 13, 2024 04:49 1m 5s
Add user to CI
Documentation #5833: Pull request #1371 opened by ksivaman
December 12, 2024 21:55 52s ksivaman:te_ci_add_user
December 12, 2024 21:55 52s
[JAX] Move parallel encoder tests to L0 distributed test set.
Documentation #5831: Pull request #1356 synchronize by pre-commit-ci bot
December 12, 2024 16:13 Action required phu0ngng:jax_multi_test
December 12, 2024 16:13 Action required
[JAX] Move parallel encoder tests to L0 distributed test set.
Documentation #5830: Pull request #1356 synchronize by phu0ngng
December 12, 2024 16:12 1m 10s phu0ngng:jax_multi_test
December 12, 2024 16:12 1m 10s
[JAX] Move parallel encoder tests to L0 distributed test set.
Documentation #5829: Pull request #1356 synchronize by phu0ngng
December 12, 2024 15:39 1m 5s phu0ngng:jax_multi_test
December 12, 2024 15:39 1m 5s
[JAX] Move parallel encoder tests to L0 distributed test set.
Documentation #5828: Pull request #1356 synchronize by phu0ngng
December 12, 2024 14:25 56s phu0ngng:jax_multi_test
December 12, 2024 14:25 56s
[common] Add max_t support for KV in THD
Documentation #5827: Pull request #1370 opened by cyanguwa
December 12, 2024 11:43 1m 16s cyanguwa:max_t_kv
December 12, 2024 11:43 1m 16s
[JAX] Fused attention unit tests fixes and refinements
Documentation #5826: Pull request #1352 synchronize by zlsh80826
December 12, 2024 09:25 1m 0s zlsh80826:rewang/fa-refactor
December 12, 2024 09:25 1m 0s
[common/PyTorch] Add FusedAttention support for SWA (left, right)
Documentation #5825: Pull request #1369 synchronize by pre-commit-ci bot
December 12, 2024 05:52 Action required cyanguwa:swa_padding_brcm
December 12, 2024 05:52 Action required
[common/PyTorch] Add FusedAttention support for SWA (left, right)
Documentation #5824: Pull request #1369 synchronize by cyanguwa
December 12, 2024 05:52 1m 1s cyanguwa:swa_padding_brcm
December 12, 2024 05:52 1m 1s
[common/PyTorch] Add FusedAttention support for SWA (left, right)
Documentation #5823: Pull request #1369 synchronize by pre-commit-ci bot
December 12, 2024 05:44 Action required cyanguwa:swa_padding_brcm
December 12, 2024 05:44 Action required
fused out correction in CP
Documentation #5821: Pull request #1248 synchronize by xiaoyao0115
December 12, 2024 05:01 Action required xiaoyao0115:fused_out_correction
December 12, 2024 05:01 Action required
[JAX] Move parallel encoder tests to L0 distributed test set.
Documentation #5820: Pull request #1356 synchronize by phu0ngng
December 11, 2024 23:13 56s phu0ngng:jax_multi_test
December 11, 2024 23:13 56s
[JAX] Bug fix for distributed normalization
Documentation #5819: Pull request #1366 opened by phu0ngng
December 11, 2024 22:54 58s phu0ngng:distributed_norm_fixes
December 11, 2024 22:54 58s
[JAX] Fused attention unit tests fixes and refinements
Documentation #5818: Pull request #1352 synchronize by zlsh80826
December 11, 2024 03:34 1m 20s zlsh80826:rewang/fa-refactor
December 11, 2024 03:34 1m 20s
Enabling FP8 all-gather for TE Float8Tensor when using Torch FSDP2
Documentation #5817: Pull request #1358 synchronize by pre-commit-ci bot
December 11, 2024 03:28 1m 30s youngeunkwon0405:fsdp2
December 11, 2024 03:28 1m 30s
Enabling FP8 all-gather for TE Float8Tensor when using Torch FSDP2
Documentation #5816: Pull request #1358 synchronize by youngeunkwon0405
December 11, 2024 03:28 1m 52s youngeunkwon0405:fsdp2
December 11, 2024 03:28 1m 52s
[JAX] Move parallel encoder tests to L0 distributed test set.
Documentation #5815: Pull request #1356 synchronize by phu0ngng
December 10, 2024 18:43 1m 4s phu0ngng:jax_multi_test
December 10, 2024 18:43 1m 4s