Validate g_idx values in MatMulNBits to prevent OOB read by vraspar · Pull Request #27582 · microsoft/onnxruntime

vraspar · 2026-03-06T22:51:25Z

Description

In Dequantize4BitsKernelReOrder (CPU and CUDA EP), values from the g_idx tensor are used directly as array indices into the scales and zero_points buffers without bounds checking. This PR adds value-range validation and tests for the g_idx input tensor in the MatMulNBits operator.

Motivation and Context

Copilot

Pull request overview

Adds input validation for the deprecated g_idx (group index) input to MatMulNBits to prevent out-of-bounds reads when it is used to index into per-block scales/zero_points, and adds regression tests to ensure invalid indices are rejected.

Changes:

Add range validation for group_index values in matmul_nbits_helper::CheckInputs ([0, k_blocks)).
Add unit tests that expect INVALID_ARGUMENT on negative and out-of-range g_idx values.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
onnxruntime/contrib_ops/cpu/quantization/matmul_nbits_helper.h	Adds `g_idx` value-range validation in shared input checking used by CPU and CUDA MatMulNBits implementations.
onnxruntime/test/contrib_ops/matmul_4bits_test.cc	Adds two negative tests to verify invalid `g_idx` values are rejected.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…pe and update test for out-of-range g_idx values

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…date tests for out-of-range g_idx values

tianleiwu

Thanks for addressing this OOB read vulnerability — the CPU-side validation logic is well-structured with a clear error message. However, the CUDA EP path still has a gap in release builds.

See inline comments for details.

- Add rid clamping after CUDA_KERNEL_ASSERT in Dequantize4BitsKernelReOrder to prevent OOB memory access in release builds where the assert is a no-op - Remove unnecessary #ifdef NDEBUG guard around InvalidGIdx tests since CUDA EP is already excluded via OpTester::Run() parameters Addresses review feedback from tianleiwu. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

tianleiwu

LGTM

Validate g_idx values in MatMulNBits to prevent OOB read

3ef5882

vraspar requested a review from Copilot March 6, 2026 23:00

Copilot started reviewing on behalf of vraspar March 6, 2026 23:01 View session

tianleiwu reviewed Mar 6, 2026

View reviewed changes

Comment thread onnxruntime/contrib_ops/cpu/quantization/matmul_nbits_helper.h Outdated

Copilot AI reviewed Mar 6, 2026

View reviewed changes

Comment thread onnxruntime/test/contrib_ops/matmul_4bits_test.cc

yuslepukhin reviewed Mar 6, 2026

View reviewed changes

Comment thread onnxruntime/contrib_ops/cpu/quantization/matmul_nbits_helper.h Outdated

Enhance group_index validation in CheckInputs to ensure CPU device ty…

a24153b

…pe and update test for out-of-range g_idx values

vraspar requested review from Copilot and yuslepukhin March 24, 2026 21:17

Copilot started reviewing on behalf of vraspar March 24, 2026 21:18 View session

Copilot AI reviewed Mar 24, 2026

View reviewed changes

Comment thread onnxruntime/contrib_ops/cpu/quantization/matmul_nbits_helper.h

Comment thread onnxruntime/test/contrib_ops/matmul_4bits_test.cc Outdated

Add validation for group_index in Dequantize4BitsKernelReOrder and up…

9788fb8

…date tests for out-of-range g_idx values

hariharans29 reviewed Mar 30, 2026

View reviewed changes

Comment thread onnxruntime/test/contrib_ops/matmul_4bits_test.cc

vraspar added 2 commits April 1, 2026 22:18

Exclude CUDA EP and skip InvalidGIdx tests in debug builds

7964701

Exclude OpenVINO EP from InvalidGIdx tests

f8627a3

tianleiwu requested changes Apr 3, 2026

View reviewed changes

Comment thread onnxruntime/contrib_ops/cpu/quantization/matmul_nbits_helper.h

Comment thread onnxruntime/contrib_ops/cuda/quantization/dequantize_blockwise_4bits.cu

Comment thread onnxruntime/test/contrib_ops/matmul_4bits_test.cc

tianleiwu approved these changes Apr 7, 2026

View reviewed changes

vraspar enabled auto-merge (squash) April 7, 2026 20:15

vraspar merged commit 127704c into main Apr 9, 2026
101 of 104 checks passed

vraspar deleted the vrasapar/matmulbits-g-idx branch April 9, 2026 18:06

vraspar mentioned this pull request Apr 10, 2026

Validate token_id bounds in NGramRepeatBlock to prevent OOB write #28039

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validate g_idx values in MatMulNBits to prevent OOB read#27582

Validate g_idx values in MatMulNBits to prevent OOB read#27582
vraspar merged 6 commits into
mainfrom
vrasapar/matmulbits-g-idx

vraspar commented Mar 6, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianleiwu left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianleiwu left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

vraspar commented Mar 6, 2026

Description

Motivation and Context

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianleiwu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianleiwu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants