Skip to content

Actions: NVIDIA/TransformerEngine

Documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
3,534 workflow runs
3,534 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[C/JAX] Comm+GEMM Overlap API for TE/JAX
Documentation #5788: Pull request #1337 synchronize by denera
December 5, 2024 19:54 1m 4s denera:jax-collective-gemm-with-overlap
December 5, 2024 19:54 1m 4s
[PyTorch] Bugfix for wgrad bulk overlap conflict when dgrad overlap is reduce-scatter
Documentation #5787: Pull request #1341 synchronize by pre-commit-ci bot
December 5, 2024 15:47 Action required denera:rs-dgrad-overlap-bugfix
December 5, 2024 15:47 Action required
[JAX] Move parallel encoder tests to L0 distributed test set.
Documentation #5785: Pull request #1356 synchronize by phu0ngng
December 5, 2024 14:31 58s phu0ngng:jax_multi_test
December 5, 2024 14:31 58s
[PyTorch] Bugfix for wgrad bulk overlap conflict when dgrad overlap is reduce-scatter
Documentation #5784: Pull request #1341 synchronize by pre-commit-ci bot
December 5, 2024 14:28 Action required denera:rs-dgrad-overlap-bugfix
December 5, 2024 14:28 Action required
[JAX] Fused attention unit tests fixes and refinements
Documentation #5781: Pull request #1352 synchronize by zlsh80826
December 5, 2024 06:05 1m 2s zlsh80826:rewang/fa-refactor
December 5, 2024 06:05 1m 2s
Enabling FP8 all-gather for TE Float8Tensor when using Torch FSDP2
Documentation #5780: Pull request #1358 synchronize by youngeunkwon0405
December 5, 2024 01:49 1m 17s youngeunkwon0405:fsdp2
December 5, 2024 01:49 1m 17s
Enabling FP8 all-gather for TE Float8Tensor when using Torch FSDP2
Documentation #5778: Pull request #1358 synchronize by youngeunkwon0405
December 5, 2024 01:39 1m 14s youngeunkwon0405:fsdp2
December 5, 2024 01:39 1m 14s
Enabling FP8 all-gather for TE Float8Tensor when using Torch FSDP2
Documentation #5777: Pull request #1358 synchronize by pre-commit-ci bot
December 5, 2024 01:30 58s youngeunkwon0405:fsdp2
December 5, 2024 01:30 58s
Enabling FP8 all-gather for TE Float8Tensor when using Torch FSDP2
Documentation #5776: Pull request #1358 synchronize by youngeunkwon0405
December 5, 2024 01:30 1m 37s youngeunkwon0405:fsdp2
December 5, 2024 01:30 1m 37s
Enabling FP8 all-gather for TE Float8Tensor when using Torch FSDP2
Documentation #5775: Pull request #1358 synchronize by pre-commit-ci bot
December 5, 2024 01:21 1m 25s youngeunkwon0405:fsdp2
December 5, 2024 01:21 1m 25s
Enabling FP8 all-gather for TE Float8Tensor when using Torch FSDP2
Documentation #5770: Pull request #1358 synchronize by youngeunkwon0405
December 5, 2024 01:08 1m 16s youngeunkwon0405:fsdp2
December 5, 2024 01:08 1m 16s
Disable FP8 in Mcore integration test on older GPUs
Documentation #5767: Pull request #1357 synchronize by timmoon10
December 5, 2024 00:06 57s timmoon10:debug-mcore-test
December 5, 2024 00:06 57s
Disable FP8 in Mcore integration test on older GPUs
Documentation #5766: Pull request #1357 opened by timmoon10
December 5, 2024 00:05 1m 5s timmoon10:debug-mcore-test
December 5, 2024 00:05 1m 5s
[JAX] Move parallel encoder tests to L0 distributed test set.
Documentation #5765: Pull request #1356 synchronize by phu0ngng
December 4, 2024 19:41 59s phu0ngng:jax_multi_test
December 4, 2024 19:41 59s
[JAX] Move parallel encoder tests to L0 distributed test set.
Documentation #5764: Pull request #1356 opened by phu0ngng
December 4, 2024 19:41 1m 11s phu0ngng:jax_multi_test
December 4, 2024 19:41 1m 11s
Add paged attention support
Documentation #5763: Pull request #1355 synchronize by pre-commit-ci bot
December 4, 2024 05:45 1m 1s cyanguwa:paged_attention
December 4, 2024 05:45 1m 1s
Add paged attention support
Documentation #5762: Pull request #1355 synchronize by cyanguwa
December 4, 2024 05:44 1m 9s cyanguwa:paged_attention
December 4, 2024 05:44 1m 9s