-
Notifications
You must be signed in to change notification settings - Fork 537
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Common] Added an optimized gated rowwise MXFP8 SwiGLU kernel
#2328
opened Oct 31, 2025 by
Oleg-Goncharov
Loading…
5 of 13 tasks
[Pytorch] change fused cross entropy backward grad to fp32 and reduce one read/…
#2325
opened Oct 31, 2025 by
RandMist
Loading…
8 of 13 tasks
[JAX] L1_jax_distributed_test suit with individual executions
#2321
opened Oct 30, 2025 by
phu0ngng
Loading…
7 of 13 tasks
[PyTorch] Implement Selective Activation Checkpointing for LayerNormMLP with checkpoint flag
#2311
opened Oct 28, 2025 by
jaimec00
Loading…
7 of 13 tasks
[JAX] Make test tolerances stricter
#2306
opened Oct 27, 2025 by
jberchtold-nvidia
Loading…
8 of 13 tasks
[common] Remove kvpacked and qkvpacked attention functions for every kernel type.
#2287
opened Oct 20, 2025 by
pggPL
Loading…
8 of 13 tasks
[common] Misc improvements for attention
2.10.0
#2272
opened Oct 15, 2025 by
cyanguwa
Loading…
8 of 13 tasks
[Draft][JAX] E2E encoder sanity test with synthetic data
#2269
opened Oct 13, 2025 by
jberchtold-nvidia
Loading…
8 of 13 tasks
[PyTorch debug] Fixes to debug tests failures
#2268
opened Oct 13, 2025 by
pggPL
Loading…
7 tasks done
[Draft][JAX] Add "initialize" XLA stage to remaining TE/JAX primitives
#2260
opened Oct 10, 2025 by
jberchtold-nvidia
Loading…
8 of 13 tasks
[JAX] xla_home logging during JAX build
#2232
opened Oct 3, 2025 by
jberchtold-nvidia
Loading…
13 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-10-03.