Validate seqlens_k against cos_cache bounds in GroupQueryAttention to… by apsonawane · Pull Request #28277 · microsoft/onnxruntime

apsonawane · 2026-04-29T18:59:32Z

Description

Validate seqlens_k values against cos_cache.shape[0] in GroupQueryAttention::Compute() when do_rotary is enabled, to prevent out-of-bounds reads in the rotary embedding lookup.

Root Cause

CheckRotaryCaches() validates cos_cache.shape[0] >= total_sequence_length, but runtime position IDs are derived from seqlens_k (a separate, per-batch input). An attacker can set total_sequence_length small enough to pass the guard while setting seqlens_k[b] far beyond cos_cache.shape[0], causing position_id = seqlens_k[b] to index out of bounds into the cos/sin cache. The resulting heap bytes are used as rotation values and propagate into the inference output.

Fix

Add an explicit bounds check in Compute() that rejects any seqlens_k[b] >= cos_cache.shape[0] before position IDs are computed. This is defense-in-depth alongside the existing RunRotaryEmbedding position_ids validation added in #27597.

Security

Impact: Heap OOB read (CWE-125) — adjacent heap memory leaks into inference output via cos/sin rotation values.
Attack vector: Any GQA-based LLM serving endpoint (Llama, Phi, Mistral) that accepts seqlens_k as an inference input. No model modification required.

Testing

Verified that crafted inputs with seqlens_k exceeding cos_cache dimensions now return INVALID_ARGUMENT instead of silently producing results containing leaked heap data.

… prevent rotary embedding OOB read

Copilot

Pull request overview

This PR hardens the CPU GroupQueryAttention implementation by validating runtime seqlens_k values against the rotary embedding cache length when do_rotary is enabled, preventing out-of-bounds reads from the cos/sin caches that could leak heap data into inference outputs.

Changes:

Add a per-batch bounds check in GroupQueryAttention::Compute() rejecting seqlens_k[b] >= cos_cache.shape[0] under do_rotary_.
Return INVALID_ARGUMENT with a descriptive error when the bound is violated.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

github-actions

You can commit the suggested changes from lintrunner.

github-actions

You can commit the suggested changes from lintrunner.

vraspar

Nice security fix -- the gap between CheckRotaryCaches (validates against total_sequence_length) and the actual position ID derivation (from seqlens_k) is a real and subtle bug. Good catch. A few non-blocking nits below.

Nit 1: CUDA EP coverage — This check only protects the CPU EP. The CUDA GQA kernel also uses do_rotary and derives position IDs similarly. Consider moving this validation into CheckInputs() in group_query_attention_helper.h (around line 282, after CheckRotaryCaches) so all EPs get the protection in one place. That said, seqlens_k may be on GPU in the CUDA path making a host-side loop infeasible, so the current placement is pragmatic.

Nit 2: Negative seqlens_k values — The condition seqlens_k_data[b] >= rotary_cache_max_seq will let negative values through, which would also produce OOB position IDs. Adding seqlens_k_data[b] < 0 to the check would catch that here with a clear error message. Likely already caught downstream by the RunRotaryEmbedding validation from #27597, so not urgent.

Nit 3: Multi-batch test — The two test cases cover the key scenarios well. A possible addition: a batch_size=2 test where seqlens_k = {3, 10} (one valid, one OOB) to verify the loop iterates correctly and the error message includes the right batch index (seqlens_k[1] = 10).

Overall this is clean, well-scoped, and well-documented. The error messages are descriptive and the tests directly exercise the vulnerability path.

Validate seqlens_k against cos_cache bounds in GroupQueryAttention to…

2818327

… prevent rotary embedding OOB read

vraspar requested a review from Copilot April 29, 2026 19:03

vraspar reviewed Apr 29, 2026

View reviewed changes

Comment thread onnxruntime/contrib_ops/cpu/bert/group_query_attention.cc Outdated

Copilot started reviewing on behalf of vraspar April 29, 2026 19:10 View session

Copilot AI reviewed Apr 29, 2026

View reviewed changes

Comment thread onnxruntime/contrib_ops/cpu/bert/group_query_attention.cc Outdated

Comment thread onnxruntime/contrib_ops/cpu/bert/group_query_attention.cc Outdated

Comment thread onnxruntime/contrib_ops/cpu/bert/group_query_attention.cc Outdated

Address comments

caca09b

github-actions Bot reviewed Apr 29, 2026

View reviewed changes

Comment thread onnxruntime/contrib_ops/cpu/bert/group_query_attention.cc Outdated

Comment thread onnxruntime/test/contrib_ops/group_query_attention_op_test.cc Outdated

apsonawane added 3 commits April 29, 2026 17:27

Fix unit tests

e2d30d1

Fix lint errors

d049874

Fix tests

a8f6fd1

github-actions Bot reviewed Apr 30, 2026

View reviewed changes

Comment thread onnxruntime/test/contrib_ops/group_query_attention_op_test.cc Outdated

Fix lint

8b96eca

vraspar reviewed May 1, 2026

View reviewed changes

apsonawane added 2 commits May 1, 2026 13:58

Address comments

aebe7dd

Address comments

7fc1973

apsonawane enabled auto-merge (squash) May 4, 2026 19:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validate seqlens_k against cos_cache bounds in GroupQueryAttention to…#28277

Validate seqlens_k against cos_cache bounds in GroupQueryAttention to…#28277
apsonawane wants to merge 8 commits intomainfrom
asonawane/embedlookup

apsonawane commented Apr 29, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Uh oh!

Uh oh!

vraspar left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

apsonawane commented Apr 29, 2026

Description

Root Cause

Fix

Security

Testing

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vraspar left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants