Skip to content

Actions: NVIDIA/TransformerEngine

Deploy nightly docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
533 workflow runs
533 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Support store_param_remainders feature from Apex in TE Fused Adam (…
Deploy nightly docs #775: Commit e536954 pushed by timmoon10
January 31, 2025 01:45 1m 26s main
January 31, 2025 01:45 1m 26s
Update neox to completed (#1439)
Deploy nightly docs #774: Commit 96534aa pushed by ptrendx
January 30, 2025 16:32 3m 1s main
January 30, 2025 16:32 3m 1s
Use log1p(x) instead of log(1+x) (#1401)
Deploy nightly docs #773: Commit 199e612 pushed by xrennvidia
January 28, 2025 00:51 1m 27s main
January 28, 2025 00:51 1m 27s
[MoE][PyTorch] Add mask-based MoE permutation (#1373)
Deploy nightly docs #772: Commit 2fce82b pushed by phu0ngng
January 27, 2025 16:01 1m 30s main
January 27, 2025 16:01 1m 30s
[JAX] Support segment_ids/pos as FA inputs (#1406)
Deploy nightly docs #771: Commit c2c3d54 pushed by zlsh80826
January 24, 2025 05:50 1m 20s main
January 24, 2025 05:50 1m 20s
[PyTorch] Avoid parameters function in op backward pass (#1403)
Deploy nightly docs #770: Commit 3d7ff1c pushed by timmoon10
January 22, 2025 21:06 1m 31s main
January 22, 2025 21:06 1m 31s
[PyTorch] Fix AttentionParams comparison logic (#1397)
Deploy nightly docs #769: Commit 7aa8118 pushed by cyanguwa
January 21, 2025 18:21 1m 35s main
January 21, 2025 18:21 1m 35s
[JAX] Consolidate the distributed fused attention test code (#1405)
Deploy nightly docs #768: Commit 6e84892 pushed by mgoldfarb-nvidia
January 17, 2025 04:08 1m 30s main
January 17, 2025 04:08 1m 30s
[PyTorch] te.Linear FP8 DGRAD+RS output bugfix (#1412)
Deploy nightly docs #767: Commit c2937c5 pushed by denera
January 16, 2025 20:32 1m 34s main
January 16, 2025 20:32 1m 34s
Make it an option to compile activation functions with fast math (#1410)
Deploy nightly docs #766: Commit 3d63cbb pushed by ksivaman
January 15, 2025 18:12 1m 33s main
January 15, 2025 18:12 1m 33s
[PyTorch] Adding TP overlap support for te.Linear with `parallel_mo…
Deploy nightly docs #765: Commit 2402406 pushed by denera
January 13, 2025 20:24 1m 31s main
January 13, 2025 20:24 1m 31s
Fix "refractor" typo in the PR template (#1402)
Deploy nightly docs #764: Commit cbc4653 pushed by timmoon10
January 13, 2025 19:28 1m 33s main
January 13, 2025 19:28 1m 33s
[JAX] Test_multiprocessing_encoder with process spawn in bash (#1394)
Deploy nightly docs #763: Commit a65ad37 pushed by phu0ngng
January 11, 2025 00:53 2m 30s main
January 11, 2025 00:53 2m 30s
Take token count quantization of fused attention into consideration f…
Deploy nightly docs #762: Commit 7b861e7 pushed by xrennvidia
January 10, 2025 09:47 1m 24s main
January 10, 2025 09:47 1m 24s
clean CP implementation for flash attention and cuDNN 9.6 (#1387)
Deploy nightly docs #761: Commit 560bccf pushed by xrennvidia
January 8, 2025 18:09 1m 32s main
January 8, 2025 18:09 1m 32s
[JAX] Correct fused attention output after each step of ring attentio…
Deploy nightly docs #760: Commit a4cb1d1 pushed by mgoldfarb-nvidia
January 8, 2025 16:09 1m 38s main
January 8, 2025 16:09 1m 38s
bug fix for using return_layernorm_output=True (#1382)
Deploy nightly docs #759: Commit 61cf102 pushed by timmoon10
January 8, 2025 02:07 1m 23s main
January 8, 2025 02:07 1m 23s
[JAX] Add THD + SWA unit tests (#1390)
Deploy nightly docs #758: Commit b898cbe pushed by zlsh80826
January 8, 2025 00:31 1m 20s main
January 8, 2025 00:31 1m 20s
Update copyright to include 2025 (#1388)
Deploy nightly docs #757: Commit c9ea6be pushed by ksivaman
January 2, 2025 22:21 1m 18s main
January 2, 2025 22:21 1m 18s
[common/PyTorch] Add cuDNN SWA (left, 0) + padding + bottom right cau…
Deploy nightly docs #756: Commit 838345e pushed by cyanguwa
December 20, 2024 05:32 1m 15s main
December 20, 2024 05:32 1m 15s
[JAX] Move parallel encoder tests to L0 distributed test set. (#1356)
Deploy nightly docs #755: Commit a3b32ec pushed by phu0ngng
December 18, 2024 15:47 1m 38s main
December 18, 2024 15:47 1m 38s
[PyTorch] Fix get_swa_mask() for padding masks (#1281)
Deploy nightly docs #754: Commit f033498 pushed by cyanguwa
December 18, 2024 02:15 1m 39s main
December 18, 2024 02:15 1m 39s
[PyTorch] Add weights_only=False for torch.load (#1374)
Deploy nightly docs #753: Commit 83dac8c pushed by cyanguwa
December 18, 2024 02:15 1m 40s main
December 18, 2024 02:15 1m 40s
[JAX] Fused attention unit tests fixes and refinements (#1352)
Deploy nightly docs #752: Commit 7f5c784 pushed by zlsh80826
December 17, 2024 07:41 1m 23s main
December 17, 2024 07:41 1m 23s
[common] Add max_t support for KV in THD (#1370)
Deploy nightly docs #751: Commit f4f35c2 pushed by cyanguwa
December 17, 2024 03:57 1m 20s main
December 17, 2024 03:57 1m 20s