Skip to content

Pull requests: ROCm/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

PyTorch FA test fix
#370 opened Nov 12, 2025 by Micky774 Loading…
13 tasks
Current scaling: two-stage amax kernel
#369 opened Nov 12, 2025 by matthiasdiener Draft
1 of 13 tasks
Userbuffer epic
#367 opened Nov 11, 2025 by alextmagro Draft
Enable AITER ASM distributed FA testing in jax/torch
#363 opened Nov 5, 2025 by Micky774 Loading…
13 tasks
Experimental rocSHMEM support
#356 opened Oct 29, 2025 by alextmagro Loading…
JAX FA Benchmarking Script
#351 opened Oct 24, 2025 by Micky774 Loading…
13 tasks
[NO MERGE] Release v2.4 rocm
#334 opened Oct 8, 2025 by alextmagro Loading…
[Fix] Added dbias and dgelu kernels for ROCm
#333 opened Oct 6, 2025 by AllenFarcas Loading…
6 of 13 tasks
CI: GitHub Action migration from Jenkins CI
#322 opened Sep 26, 2025 by leo-amd Loading…
Triton norms dispatch refactor
#305 opened Sep 5, 2025 by Micky774 Loading…
13 tasks
heyi's layernorm optimization
#225 opened Jul 3, 2025 by eliotwang Loading…
8 of 13 tasks
Added Dockerfile for CI images
#195 opened May 28, 2025 by VeeraRajasekhar Loading…
7 of 13 tasks
[ROCm] support triton-based flash-attn in TE
#177 opened May 1, 2025 by wangye805 Loading…
8 of 13 tasks
Update attention example attention.ipynb
#152 opened Mar 19, 2025 by anhminhnguyenhoang Loading…
5 of 13 tasks
Honor the NVTE_FUSED_ATTN_<backend> in test_fused_attn.py
#123 opened Feb 11, 2025 by wangye805 Loading…
13 tasks
ProTip! Filter pull requests by the default branch with base:dev.