Skip to content

[CUDA] QuantizeLinear and DequantizeLinear opset 25#28046

Merged
kunal-vaishnavi merged 3 commits into
mainfrom
tlwu/20260411/qdq_opset_25
Apr 14, 2026
Merged

[CUDA] QuantizeLinear and DequantizeLinear opset 25#28046
kunal-vaishnavi merged 3 commits into
mainfrom
tlwu/20260411/qdq_opset_25

Conversation

@tianleiwu
Copy link
Copy Markdown
Contributor

No description provided.

Copy link
Copy Markdown
Contributor

@github-actions github-actions Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You can commit the suggested changes from lintrunner.

Comment thread onnxruntime/core/providers/cuda/tensor/quantize_linear.cc Outdated
Comment thread onnxruntime/test/providers/cpu/tensor/quantize_linear_test.cc Outdated
Comment thread onnxruntime/test/providers/cpu/tensor/quantize_linear_test.cc Outdated
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR extends the CUDA Execution Provider’s support for ONNX QuantizeLinear/DequantizeLinear opset 25 by adding the appropriate kernel registrations for opset 25 (and aligning registrations across opsets 21–24), plus targeted CUDA-only tests and regenerated operator kernel documentation.

Changes:

  • Register CUDA kernels for QuantizeLinear/DequantizeLinear across opsets 21–22, 23–24, and 25+ with the correct type-constraint sets.
  • Add CUDA-only opset 25 unit tests for QuantizeLinear and DequantizeLinear (including per-axis and blocked int4/uint4 cases).
  • Update docs/OperatorKernels.md to reflect opset 25+ signatures/type constraints (auto-generated output update).

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

File Description
onnxruntime/test/providers/cpu/tensor/quantize_linear_test.cc Adds CUDA-only opset 25 Q/DQ tests and a helper to run without CPU fallback.
onnxruntime/core/providers/cuda/tensor/quantize_linear.cc Adds/adjusts CUDA kernel registrations for opsets 21–22, 23–24, and 25+.
onnxruntime/core/providers/cuda/cuda_execution_provider.cc Updates CUDA kernel class declarations/registry entries to include opset 25 Q/DQ and align versioned registrations.
docs/OperatorKernels.md Regenerated kernel doc entries reflecting opset 25+ for Q/DQ.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread onnxruntime/test/providers/cpu/tensor/quantize_linear_test.cc
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants