Deploy nightly docs

Actions

Deploy nightly docs

Actions

Loading...
Loading

510 workflow run results

[JAX] Consolidate FFI and old descriptor implementation for fused att… Deploy nightly docs #701: Commit c036765 pushed by phu0ngng

October 30, 2024 01:05

1m 10s main

main

October 30, 2024 01:05

1m 10s

Add missed arguments of apply_rotary_pos_emb in MHA (#1296) Deploy nightly docs #700: Commit ed1e85c pushed by xrennvidia

October 30, 2024 00:21

1m 23s main

main

October 30, 2024 00:21

1m 23s

Add check for GPU availability in attention (#1287) Deploy nightly docs #699: Commit 8bdb54f pushed by cyanguwa

October 29, 2024 20:30

1m 9s main

main

October 29, 2024 20:30

1m 9s

[PyTorch] Skip t3hd/th3d for MQA/GQA tests (#1293) Deploy nightly docs #698: Commit d710c24 pushed by cyanguwa

October 29, 2024 20:26

1m 17s main

main

October 29, 2024 20:26

1m 17s

[C/PyTorch] Userbuffers and comm+GEMM overlap algorithms refactored a… Deploy nightly docs #697: Commit 933294d pushed by denera

October 29, 2024 15:06

1m 12s main

main

October 29, 2024 15:06

1m 12s

[PyTorch] Remove fast param getter from modules (#1291) Deploy nightly docs #696: Commit 35bbe74 pushed by timmoon10

October 28, 2024 23:47

1m 5s main

main

October 28, 2024 23:47

1m 5s

[C/PyTorch] Add max_t support for THD (#1244) Deploy nightly docs #695: Commit 7fb22c3 pushed by cyanguwa

October 25, 2024 20:29

1m 9s main

main

October 25, 2024 20:29

1m 9s

[C/PyTorch] Add THD MQA/GQA (#1266) Deploy nightly docs #694: Commit 83f9cc0 pushed by cyanguwa

October 25, 2024 20:25

1m 10s main

main

October 25, 2024 20:25

1m 10s

Support building documentation in Python 3.12 (#1274) Deploy nightly docs #693: Commit 8e4ee12 pushed by timmoon10

October 25, 2024 19:10

1m 7s main

main

October 25, 2024 19:10

1m 7s

[TE/JAX] Update required JAX version for FFI custom calls with cudaGr… Deploy nightly docs #692: Commit 7cef756 pushed by phu0ngng

October 25, 2024 17:48

1m 3s main

main

October 25, 2024 17:48

1m 3s

[Pytorch] Check gradient in test numerics (#1229) Deploy nightly docs #691: Commit 7b284fe pushed by pggPL

October 24, 2024 18:13

1m 4s main

main

October 24, 2024 18:13

1m 4s

[Paddle] Update type names for Paddle 3.0 (#1286) Deploy nightly docs #690: Commit 7a5fd0c pushed by timmoon10

October 24, 2024 17:52

1m 6s main

main

October 24, 2024 17:52

1m 6s

[JAX] XLA Custom Calls with FFI for FusedAttnFwd, Quantize, Transpose… Deploy nightly docs #689: Commit 18c2234 pushed by huanghua1994

October 24, 2024 15:27

1m 43s main

main

October 24, 2024 15:27

1m 43s

[JAX] Fix correctness of JAX fused attention with CP and improve nume… Deploy nightly docs #688: Commit 20c7529 pushed by mgoldfarb-nvidia

October 24, 2024 13:51

1m 7s main

main

October 24, 2024 13:51

1m 7s

Add THD + GQA supports (#1260) Deploy nightly docs #687: Commit d9b4bfb pushed by cyanguwa

October 22, 2024 22:37

58s main

main

October 22, 2024 22:37

58s

[JAX] Skip V100 encoder tests (#1262) Deploy nightly docs #686: Commit 35f7d26 pushed by phu0ngng

October 22, 2024 19:44

1m 14s main

main

October 22, 2024 19:44

1m 14s

Fused Attention Support 64-bit Ragged Offsets for Large THD Tensors (… Deploy nightly docs #685: Commit 7b18f23 pushed by mgoldfarb-nvidia

October 22, 2024 14:24

1m 11s main

main

October 22, 2024 14:24

1m 11s

[PyTorch] Reduce the number of FA versions in L3 tests (#1280) Deploy nightly docs #684: Commit 29e3a09 pushed by timmoon10

October 21, 2024 21:43

1m 0s main

main

October 21, 2024 21:43

1m 0s

[PyTorch] Remove PyTorch L0 distributed test (#1273) Deploy nightly docs #683: Commit 3ea7dd3 pushed by timmoon10

October 18, 2024 18:04

1m 9s main

main

October 18, 2024 18:04

1m 9s

[Paddle] Debug wheel test (#1265) Deploy nightly docs #682: Commit 927bca7 pushed by timmoon10

October 18, 2024 17:22

1m 4s main

main

October 18, 2024 17:22

1m 4s

[PyTorch] Reorganize L1 tests (#1255) Deploy nightly docs #681: Commit 41fe1e5 pushed by timmoon10

October 18, 2024 01:57

1m 4s main

main

October 18, 2024 01:57

1m 4s

Fix seq_dim in CP implementation (#1264) Deploy nightly docs #680: Commit a488b8b pushed by xrennvidia

October 17, 2024 18:21

1m 33s main

main

October 17, 2024 18:21

1m 33s

[TE/JAX] Enabling CudaGraph for custom calls with FFI (#1228) Deploy nightly docs #679: Commit 12f30ea pushed by phu0ngng

October 17, 2024 15:30

1m 5s main

main

October 17, 2024 15:30

1m 5s

[Bugfix] Fix bias for 0-dim tensors in gemm (#1246) Deploy nightly docs #678: Commit 8e97c8d pushed by yaox12

October 17, 2024 14:48

1m 15s main

main

October 17, 2024 14:48

1m 15s

[PyTorch] Fix wgrads for GroupedLinear when weights don't require gra… Deploy nightly docs #677: Commit 2d7020e pushed by yaox12

October 17, 2024 13:20

1m 7s main

main

October 17, 2024 13:20

1m 7s

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actions

Workflows

Management