forked from pytorch/pytorch
-
Notifications
You must be signed in to change notification settings - Fork 58
Pull requests: ROCm/pytorch
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Enable load-compute-store interleaving for unrolled elementwise kernel.
#1886
opened Feb 6, 2025 by
carlobertolli
•
Draft
[rocm6.4_internal_testing] [NAVI32] Skipped sdpa_2 test in test_aot_inductor for Navi32
#1882
opened Feb 5, 2025 by
iupaikov-amd
Loading…
[release/2.6] [ROCm] Improvements for vectorized elementwise kernels (#143269)
#1878
opened Feb 3, 2025 by
jerrymannil
Loading…
[AUTOGENERATED] [release/2.6] [ROCM] Enable *_load_dwordx4 ISA for BFloat16 and Half.
#1877
opened Feb 3, 2025 by
rocm-mici
Loading…
Revert "[release/2.4] fix test_pointwise_op_fusion_post_grad (#1763)"
#1865
opened Jan 30, 2025 by
dnikolaev-amd
Loading…
[Do NOT MERGE] [release/2.5] Enable tf32 testing on test_nn
#1859
opened Jan 27, 2025 by
jagadish-amd
Loading…
[ROCm] Eliminate the need for divisions in layernorm for default vector size.
#1850
opened Jan 22, 2025 by
doru1004
Loading…
[release/2.4] Update numpy versions to fix PyTorch wheel build issues
#1822
opened Jan 8, 2025 by
jithunnair-amd
•
Draft
[ROCm][WIP] Improve performance of casted elementwise add operations
#1805
opened Dec 20, 2024 by
doru1004
Loading…
[WIP][release/2.5] refactor condition to use miopen for batchnorm
#1787
opened Dec 13, 2024 by
dnikolaev-amd
•
Draft
[release/2.5] Fixed string comparison in test_cpp_wrapper_hipify
#1760
opened Nov 29, 2024 by
iupaikov-amd
Loading…
[release/2.5] Enabled force_shape_pad for test_pad_mm and test_slice_mm_bandwidth_computation
#1755
opened Nov 28, 2024 by
iupaikov-amd
Loading…
[release/2.5] Skipped some inductor tests for no hipcc rocm environments
#1697
opened Nov 13, 2024 by
iupaikov-amd
•
Draft
Previous Next
ProTip!
Follow long discussions with comments:>50.