Skip to content

Actions: NVIDIA/TransformerEngine

Deploy nightly docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
51 workflow run results
51 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Update doc to match with latest code. (#511)
Deploy nightly docs #273: Commit 5debfdb pushed by ptrendx
December 4, 2023 23:26 1m 33s main
December 4, 2023 23:26 1m 33s
Bump GoogleTest to 1.14.0 (#543)
Deploy nightly docs #272: Commit 178b64a pushed by ptrendx
December 4, 2023 22:55 1m 13s main
December 4, 2023 22:55 1m 13s
[PyTorch] TransformerLayer: add support for Falcon architecture (#513)
Deploy nightly docs #271: Commit 4e33a69 pushed by ptrendx
December 4, 2023 22:53 1m 50s main
December 4, 2023 22:53 1m 50s
[JAX] Add checkpoint_name for the recompute granularity control (#542)
Deploy nightly docs #270: Commit c898ab1 pushed by denera
December 4, 2023 17:13 1m 43s main
December 4, 2023 17:13 1m 43s
[PyTorch] Fix incorrect variable name in LayerNormMLP backward (#548)
Deploy nightly docs #269: Commit 92c1e50 pushed by timmoon10
December 1, 2023 23:26 1m 42s main
December 1, 2023 23:26 1m 42s
fix amax -> abs max in fp8_calibration (#534)
Deploy nightly docs #268: Commit 4f1d70f pushed by timmoon10
December 1, 2023 19:10 1m 28s main
December 1, 2023 19:10 1m 28s
[JAX] Prepare cross flash attention (#525)
Deploy nightly docs #267: Commit 4d444db pushed by cyanguwa
December 1, 2023 17:53 1m 34s main
December 1, 2023 17:53 1m 34s
wgrad should be zero'ed out if a weight parameter is shared among m…
Deploy nightly docs #266: Commit 387397a pushed by timmoon10
November 30, 2023 19:41 1m 34s main
November 30, 2023 19:41 1m 34s
[JAX] Support layernorm/rmsnorm sm_margin control through environment…
Deploy nightly docs #265: Commit 753eed3 pushed by denera
November 30, 2023 03:17 1m 18s main
November 30, 2023 03:17 1m 18s
[JAX] Use relative idx to ScaledUpperTriangMaskedSoftmaxFwdPrimitive …
Deploy nightly docs #264: Commit 0fc402f pushed by denera
November 30, 2023 03:02 1m 23s main
November 30, 2023 03:02 1m 23s
[PyTorch] Linear: fix computation for wgrad if sequence_parallel=True…
Deploy nightly docs #263: Commit d76118d pushed by timmoon10
November 28, 2023 22:40 1m 41s main
November 28, 2023 22:40 1m 41s
Use non-deprecated PyTorch methods to silence warnings (#541)
Deploy nightly docs #262: Commit 54e46e2 pushed by timmoon10
November 28, 2023 18:23 1m 50s main
November 28, 2023 18:23 1m 50s
Use unsigned char when instantiating DType::kByte (#540)
Deploy nightly docs #261: Commit cbcac3f pushed by timmoon10
November 28, 2023 18:20 1m 28s main
November 28, 2023 18:20 1m 28s
[Paddle] Add TP overlap (#443)
Deploy nightly docs #260: Commit 666539f pushed by timmoon10
November 23, 2023 02:52 1m 21s main
November 23, 2023 02:52 1m 21s
[Paddle] Fix issues (#515)
Deploy nightly docs #259: Commit 8864983 pushed by cyanguwa
November 21, 2023 19:55 1m 24s main
November 21, 2023 19:55 1m 24s
[JAX] Fix JAX distributed unit tests (#521)
Deploy nightly docs #258: Commit ea43b18 pushed by denera
November 20, 2023 18:41 1m 36s main
November 20, 2023 18:41 1m 36s
Changed VERSION to 1.2.0dev
Deploy nightly docs #257: Commit 6159af4 pushed by ptrendx
November 17, 2023 17:19 1m 36s main
November 17, 2023 17:19 1m 36s
Disable FAv2.1+ for causal mask in cross attention (#522)
Deploy nightly docs #256: Commit da55d24 pushed by ptrendx
November 17, 2023 17:15 1m 22s main
November 17, 2023 17:15 1m 22s
[PyTorch] FP8 Tensor improvements (#500)
Deploy nightly docs #255: Commit 1508821 pushed by ksivaman
November 17, 2023 17:13 1m 39s main
November 17, 2023 17:13 1m 39s
feat(code quality): Add comments for parallel welford variance calcul…
Deploy nightly docs #254: Commit e6676c5 pushed by ptrendx
November 16, 2023 23:33 1m 46s main
November 16, 2023 23:33 1m 46s
Fix flash-attn checks and RoPE DPA (#506)
Deploy nightly docs #253: Commit 7f2f7dd pushed by ksivaman
November 15, 2023 02:45 1m 48s main
November 15, 2023 02:45 1m 48s
[JAX] Migrating from Xmap to Custom Partitioning for All Custom Calls…
Deploy nightly docs #252: Commit 71e51ea pushed by denera
November 14, 2023 18:33 1m 33s main
November 14, 2023 18:33 1m 33s
Update README.rst - Installation section (#502)
Deploy nightly docs #251: Commit 7976bd0 pushed by ksivaman
November 13, 2023 23:28 1m 38s main
November 13, 2023 23:28 1m 38s
[PyTorch] Improve memory usage in backward of LayerNormLinear and Lay…
Deploy nightly docs #250: Commit a9cfbfd pushed by ksivaman
November 13, 2023 23:13 1m 47s main
November 13, 2023 23:13 1m 47s
[C/JAX] Support more mask types for the arbitrary seqlen kernels and …
Deploy nightly docs #249: Commit bfaec64 pushed by cyanguwa
November 13, 2023 21:00 1m 59s main
November 13, 2023 21:00 1m 59s