🐛 Describe the bug
The Inductor test case "python test/inductor/test_torchinductor.py GPUTests.test_argmax_argmin2_xpu" failed when trying to update driver from LTS to LTS2, and if we fallback the op argmax, argmin to cpu, the case can pass again. So the accuracy of argmax, argmin has accuracy regression between LTS and LTS2.
Reproduce:
LTS:
python test/inductor/test_torchinductor.py GPUTests.test_argmax_argmin2_xpu: pass
LTS2:
python test/inductor/test_torchinductor.py GPUTests.test_argmax_argmin2_xpu : fail
PYTORCH_DEBUG_XPU_FALLBACK=1 PYTORCH_XPU_FALLBACK_OP=argmax,argmin python test/inductor/test_torchinductor.py GPUTests.test_argmax_argmin2_xpu : pass
Versions
Pytorch: main or nightly.