[CUDA] QuantizeLinear and DequantizeLinear opset 25 by tianleiwu · Pull Request #28046 · microsoft/onnxruntime

tianleiwu · 2026-04-13T05:41:54Z

No description provided.

github-actions

You can commit the suggested changes from lintrunner.

Copilot

Pull request overview

This PR extends the CUDA Execution Provider’s support for ONNX QuantizeLinear/DequantizeLinear opset 25 by adding the appropriate kernel registrations for opset 25 (and aligning registrations across opsets 21–24), plus targeted CUDA-only tests and regenerated operator kernel documentation.

Changes:

Register CUDA kernels for QuantizeLinear/DequantizeLinear across opsets 21–22, 23–24, and 25+ with the correct type-constraint sets.
Add CUDA-only opset 25 unit tests for QuantizeLinear and DequantizeLinear (including per-axis and blocked int4/uint4 cases).
Update docs/OperatorKernels.md to reflect opset 25+ signatures/type constraints (auto-generated output update).

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

File	Description
onnxruntime/test/providers/cpu/tensor/quantize_linear_test.cc	Adds CUDA-only opset 25 Q/DQ tests and a helper to run without CPU fallback.
onnxruntime/core/providers/cuda/tensor/quantize_linear.cc	Adds/adjusts CUDA kernel registrations for opsets 21–22, 23–24, and 25+.
onnxruntime/core/providers/cuda/cuda_execution_provider.cc	Updates CUDA kernel class declarations/registry entries to include opset 25 Q/DQ and align versioned registrations.
docs/OperatorKernels.md	Regenerated kernel doc entries reflecting opset 25+ for Q/DQ.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

QuantizeLinear and DequantizeLinear CUDA ops up to opset 25

b51657b

github-actions Bot reviewed Apr 13, 2026

View reviewed changes

Comment thread onnxruntime/core/providers/cuda/tensor/quantize_linear.cc Outdated

Comment thread onnxruntime/test/providers/cpu/tensor/quantize_linear_test.cc Outdated

Comment thread onnxruntime/test/providers/cpu/tensor/quantize_linear_test.cc Outdated

tianleiwu added 2 commits April 13, 2026 00:05

lintrunner

a007014

Update doc

54f2d47

tianleiwu requested a review from Copilot April 14, 2026 03:08

Copilot started reviewing on behalf of tianleiwu April 14, 2026 03:11 View session

Copilot AI reviewed Apr 14, 2026

View reviewed changes

Comment thread onnxruntime/test/providers/cpu/tensor/quantize_linear_test.cc

tianleiwu requested review from kunal-vaishnavi and titaiwangms April 14, 2026 03:49

kunal-vaishnavi approved these changes Apr 14, 2026

View reviewed changes

kunal-vaishnavi merged commit 97e0a00 into main Apr 14, 2026
105 of 106 checks passed

kunal-vaishnavi deleted the tlwu/20260411/qdq_opset_25 branch April 14, 2026 18:23

BrewTestBot mentioned this pull request May 8, 2026

onnxruntime 1.26.0 Homebrew/homebrew-core#281672

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA] QuantizeLinear and DequantizeLinear opset 25#28046

[CUDA] QuantizeLinear and DequantizeLinear opset 25#28046
kunal-vaishnavi merged 3 commits into
mainfrom
tlwu/20260411/qdq_opset_25

tianleiwu commented Apr 13, 2026

Uh oh!

github-actions Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tianleiwu commented Apr 13, 2026

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants