Add support for NHWC GridSample in the CUDA EP and enable grid_sample_test for all EPs #19562

mtavenrath · 2024-02-19T14:45:49Z

Description

I've added NHWC GridSample support to the CUDA EP to reduce the number of layout transforms. Also I've enabled the full set of GridSampleTests for all EPs. I've also added the GridSample OpSet 16 to the registered kernels.

Motivation and Context

This is the first PR is a series of enhancements of the CUDA EP improving NHWC support to avoid costly layout transforms between NWHC and NCHW nodes which are layout sensitive. Also testing was quite rudimentary for the CUDA EP while it was great for the CPU path. I've regenerated grid_sample_test.cc enabling tests for other platforms as well. Those tests resurfaced #10607 again which is fixed as well.

onnxruntime/core/providers/cuda/cuda_provider_factory.cc

onnxruntime/contrib_ops/cuda/grid_sample.cc

onnxruntime/test/providers/cpu/tensor/grid_sample_test.cc

tianleiwu · 2024-02-21T00:01:03Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

tianleiwu · 2024-02-21T00:01:04Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-python-checks-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline,Android CI Pipeline

tianleiwu · 2024-02-21T00:01:04Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-02-21T00:02:35Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-02-21T00:02:59Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-02-21T00:03:02Z

Azure Pipelines successfully started running 10 pipeline(s).

tianleiwu · 2024-02-21T00:49:27Z

/azp run Big Models

azure-pipelines · 2024-02-21T00:49:36Z

Azure Pipelines successfully started running 1 pipeline(s).

…ect for border padding mode with align = 1.

… GridSample NHWC version to 16.

onnxruntime/test/providers/cpu/tensor/grid_sample_test.cc

tianleiwu · 2024-02-21T20:42:34Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

tianleiwu · 2024-02-21T20:42:34Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-python-checks-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline,Big Models

tianleiwu · 2024-02-21T20:42:35Z

/azp run Android CI Pipeline,iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-02-21T20:42:53Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2024-02-21T20:43:10Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-02-21T20:43:13Z

Azure Pipelines successfully started running 10 pipeline(s).

onnxruntime/test/util/default_providers.cc

…WC ifdef

tianleiwu · 2024-02-22T17:26:31Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

tianleiwu · 2024-02-22T17:26:32Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-python-checks-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline,Android CI Pipeline

tianleiwu · 2024-02-22T17:26:33Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-02-22T17:26:49Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-02-22T17:27:09Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-02-22T17:27:11Z

Azure Pipelines successfully started running 10 pipeline(s).

onnxruntime/contrib_ops/cuda/grid_sample_impl.cu

tianleiwu · 2024-02-22T19:39:28Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

tianleiwu · 2024-02-22T19:39:29Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-python-checks-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline,Android CI Pipeline

tianleiwu · 2024-02-22T19:39:30Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-02-22T19:39:46Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-02-22T19:40:08Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-02-22T19:40:13Z

Azure Pipelines successfully started running 10 pipeline(s).

tianleiwu · 2024-02-22T21:32:47Z

The kernelDocumentation pipeline failed, please download documents from
https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1301115&view=artifacts&pathAsName=false&type=publishedArtifacts

and update the files with same name under docs/

tianleiwu · 2024-02-22T22:10:56Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

tianleiwu · 2024-02-22T22:10:56Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-python-checks-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline,Android CI Pipeline

tianleiwu · 2024-02-22T22:10:57Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-02-22T22:11:13Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-02-22T22:11:31Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-02-22T22:11:33Z

Azure Pipelines successfully started running 10 pipeline(s).

tianleiwu · 2024-02-23T01:06:56Z

/azp run Big Models

azure-pipelines · 2024-02-23T01:07:08Z

Azure Pipelines successfully started running 1 pipeline(s).

microsoft/onnxruntime#19562

mtavenrath changed the title ~~Add support for HWC GridSample in the CUDA EP and enable grid_sample_test for all EPs~~ Add support for NHWC GridSample in the CUDA EP and enable grid_sample_test for all EPs Feb 19, 2024

tianleiwu reviewed Feb 19, 2024

View reviewed changes

onnxruntime/core/providers/cuda/cuda_provider_factory.cc Outdated Show resolved Hide resolved

tianleiwu reviewed Feb 19, 2024

View reviewed changes

onnxruntime/contrib_ops/cuda/grid_sample.cc Outdated Show resolved Hide resolved

tianleiwu reviewed Feb 19, 2024

View reviewed changes

onnxruntime/test/providers/cpu/tensor/grid_sample_test.cc Outdated Show resolved Hide resolved

mtavenrath mentioned this pull request Feb 20, 2024

Test execute only the first EP when being passed a positive EP list instead of an excluded list #19573

Open

mtavenrath added 5 commits February 21, 2024 16:52

Implement support for NHWC GridSample in the CUDA EP.

89d697a

Fix issue microsoft#10607, grid_sample CUDA kernel clamping is incorr…

e88f1b2

…ect for border padding mode with align = 1.

Update grid_sample_test.cc to run on all execution providers & change…

dac6e76

… GridSample NHWC version to 16.

Run lintrunner & remove nhwc hack.

16bc365

Add functionality for inclusive list to run grid_sample tests

7431b20

mtavenrath force-pushed the gridsample_nhwc branch from 68fc83e to 7431b20 Compare February 21, 2024 15:55

tianleiwu reviewed Feb 21, 2024

View reviewed changes

onnxruntime/test/providers/cpu/tensor/grid_sample_test.cc Outdated Show resolved Hide resolved

tianleiwu reviewed Feb 21, 2024

View reviewed changes

onnxruntime/test/util/default_providers.cc Show resolved Hide resolved

Fix lvalue issue and protect DefaultCudaNHWCExecutionProvider() by NH…

79b8827

…WC ifdef

Fix lintrunner issues & unused variable without CUDA build.

2b5a5db

tianleiwu reviewed Feb 22, 2024

View reviewed changes

onnxruntime/contrib_ops/cuda/grid_sample_impl.cu Outdated Show resolved Hide resolved

Add missing (

58a81b9

Updating docs & fix lambda capture

c91413c

tianleiwu approved these changes Feb 22, 2024

View reviewed changes

tianleiwu merged commit 5e432a3 into microsoft:main Feb 23, 2024
79 of 81 checks passed

WolframRhodium added a commit to AmusementClub/vs-mlrt that referenced this pull request Apr 19, 2024

vsort/vs_onnxruntime.cpp: remove gridsample hack for cuda ep

2a0b7bc

microsoft/onnxruntime#19562

Add support for NHWC GridSample in the CUDA EP and enable grid_sample_test for all EPs #19562

Add support for NHWC GridSample in the CUDA EP and enable grid_sample_test for all EPs #19562

Conversation

mtavenrath commented Feb 19, 2024

Description

Motivation and Context

tianleiwu commented Feb 21, 2024

tianleiwu commented Feb 21, 2024

tianleiwu commented Feb 21, 2024

azure-pipelines bot commented Feb 21, 2024

azure-pipelines bot commented Feb 21, 2024

azure-pipelines bot commented Feb 21, 2024

tianleiwu commented Feb 21, 2024

azure-pipelines bot commented Feb 21, 2024

tianleiwu commented Feb 21, 2024

tianleiwu commented Feb 21, 2024

tianleiwu commented Feb 21, 2024

azure-pipelines bot commented Feb 21, 2024

azure-pipelines bot commented Feb 21, 2024

azure-pipelines bot commented Feb 21, 2024

tianleiwu commented Feb 22, 2024

tianleiwu commented Feb 22, 2024

tianleiwu commented Feb 22, 2024

azure-pipelines bot commented Feb 22, 2024

azure-pipelines bot commented Feb 22, 2024

azure-pipelines bot commented Feb 22, 2024

tianleiwu commented Feb 22, 2024

tianleiwu commented Feb 22, 2024

tianleiwu commented Feb 22, 2024

azure-pipelines bot commented Feb 22, 2024

azure-pipelines bot commented Feb 22, 2024

azure-pipelines bot commented Feb 22, 2024

tianleiwu commented Feb 22, 2024

tianleiwu commented Feb 22, 2024

tianleiwu commented Feb 22, 2024

tianleiwu commented Feb 22, 2024

azure-pipelines bot commented Feb 22, 2024

azure-pipelines bot commented Feb 22, 2024

azure-pipelines bot commented Feb 22, 2024

tianleiwu commented Feb 23, 2024

azure-pipelines bot commented Feb 23, 2024