Skip to content

Actions: NVIDIA/TransformerEngine

Deploy nightly docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
510 workflow run results
510 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[JAX] Consolidate FFI and old descriptor implementation for fused att…
Deploy nightly docs #701: Commit c036765 pushed by phu0ngng
October 30, 2024 01:05 1m 10s main
October 30, 2024 01:05 1m 10s
Add missed arguments of apply_rotary_pos_emb in MHA (#1296)
Deploy nightly docs #700: Commit ed1e85c pushed by xrennvidia
October 30, 2024 00:21 1m 23s main
October 30, 2024 00:21 1m 23s
Add check for GPU availability in attention (#1287)
Deploy nightly docs #699: Commit 8bdb54f pushed by cyanguwa
October 29, 2024 20:30 1m 9s main
October 29, 2024 20:30 1m 9s
[PyTorch] Skip t3hd/th3d for MQA/GQA tests (#1293)
Deploy nightly docs #698: Commit d710c24 pushed by cyanguwa
October 29, 2024 20:26 1m 17s main
October 29, 2024 20:26 1m 17s
[C/PyTorch] Userbuffers and comm+GEMM overlap algorithms refactored a…
Deploy nightly docs #697: Commit 933294d pushed by denera
October 29, 2024 15:06 1m 12s main
October 29, 2024 15:06 1m 12s
[PyTorch] Remove fast param getter from modules (#1291)
Deploy nightly docs #696: Commit 35bbe74 pushed by timmoon10
October 28, 2024 23:47 1m 5s main
October 28, 2024 23:47 1m 5s
[C/PyTorch] Add max_t support for THD (#1244)
Deploy nightly docs #695: Commit 7fb22c3 pushed by cyanguwa
October 25, 2024 20:29 1m 9s main
October 25, 2024 20:29 1m 9s
[C/PyTorch] Add THD MQA/GQA (#1266)
Deploy nightly docs #694: Commit 83f9cc0 pushed by cyanguwa
October 25, 2024 20:25 1m 10s main
October 25, 2024 20:25 1m 10s
Support building documentation in Python 3.12 (#1274)
Deploy nightly docs #693: Commit 8e4ee12 pushed by timmoon10
October 25, 2024 19:10 1m 7s main
October 25, 2024 19:10 1m 7s
[TE/JAX] Update required JAX version for FFI custom calls with cudaGr…
Deploy nightly docs #692: Commit 7cef756 pushed by phu0ngng
October 25, 2024 17:48 1m 3s main
October 25, 2024 17:48 1m 3s
[Pytorch] Check gradient in test numerics (#1229)
Deploy nightly docs #691: Commit 7b284fe pushed by pggPL
October 24, 2024 18:13 1m 4s main
October 24, 2024 18:13 1m 4s
[Paddle] Update type names for Paddle 3.0 (#1286)
Deploy nightly docs #690: Commit 7a5fd0c pushed by timmoon10
October 24, 2024 17:52 1m 6s main
October 24, 2024 17:52 1m 6s
[JAX] XLA Custom Calls with FFI for FusedAttnFwd, Quantize, Transpose…
Deploy nightly docs #689: Commit 18c2234 pushed by huanghua1994
October 24, 2024 15:27 1m 43s main
October 24, 2024 15:27 1m 43s
[JAX] Fix correctness of JAX fused attention with CP and improve nume…
Deploy nightly docs #688: Commit 20c7529 pushed by mgoldfarb-nvidia
October 24, 2024 13:51 1m 7s main
October 24, 2024 13:51 1m 7s
Add THD + GQA supports (#1260)
Deploy nightly docs #687: Commit d9b4bfb pushed by cyanguwa
October 22, 2024 22:37 58s main
October 22, 2024 22:37 58s
[JAX] Skip V100 encoder tests (#1262)
Deploy nightly docs #686: Commit 35f7d26 pushed by phu0ngng
October 22, 2024 19:44 1m 14s main
October 22, 2024 19:44 1m 14s
Fused Attention Support 64-bit Ragged Offsets for Large THD Tensors (…
Deploy nightly docs #685: Commit 7b18f23 pushed by mgoldfarb-nvidia
October 22, 2024 14:24 1m 11s main
October 22, 2024 14:24 1m 11s
[PyTorch] Reduce the number of FA versions in L3 tests (#1280)
Deploy nightly docs #684: Commit 29e3a09 pushed by timmoon10
October 21, 2024 21:43 1m 0s main
October 21, 2024 21:43 1m 0s
[PyTorch] Remove PyTorch L0 distributed test (#1273)
Deploy nightly docs #683: Commit 3ea7dd3 pushed by timmoon10
October 18, 2024 18:04 1m 9s main
October 18, 2024 18:04 1m 9s
[Paddle] Debug wheel test (#1265)
Deploy nightly docs #682: Commit 927bca7 pushed by timmoon10
October 18, 2024 17:22 1m 4s main
October 18, 2024 17:22 1m 4s
[PyTorch] Reorganize L1 tests (#1255)
Deploy nightly docs #681: Commit 41fe1e5 pushed by timmoon10
October 18, 2024 01:57 1m 4s main
October 18, 2024 01:57 1m 4s
Fix seq_dim in CP implementation (#1264)
Deploy nightly docs #680: Commit a488b8b pushed by xrennvidia
October 17, 2024 18:21 1m 33s main
October 17, 2024 18:21 1m 33s
[TE/JAX] Enabling CudaGraph for custom calls with FFI (#1228)
Deploy nightly docs #679: Commit 12f30ea pushed by phu0ngng
October 17, 2024 15:30 1m 5s main
October 17, 2024 15:30 1m 5s
[Bugfix] Fix bias for 0-dim tensors in gemm (#1246)
Deploy nightly docs #678: Commit 8e97c8d pushed by yaox12
October 17, 2024 14:48 1m 15s main
October 17, 2024 14:48 1m 15s
[PyTorch] Fix wgrads for GroupedLinear when weights don't require gra…
Deploy nightly docs #677: Commit 2d7020e pushed by yaox12
October 17, 2024 13:20 1m 7s main
October 17, 2024 13:20 1m 7s