[ROCm][CI] Fix accuracy for llama-nemotron-vl pooling tests by AndreasKaratzas · Pull Request #37613 · vllm-project/vllm

AndreasKaratzas · 2026-03-19T23:40:14Z

Follow-up for:

[ROCm][CI] Cleaning and restructuring amd-ci legacy pipeline #34839

Fixes small accuracy diff due to differences in HF and vLLM attention backends on ROCm in mi250_1: Multi-Modal Models (Extended Pooling)

Motivation: https://buildkite.com/vllm/amd-ci/builds/6701/steps/canvas?sid=019d07a7-1a1e-445a-8480-1feaf029a19d&tab=output

cc @kenroche

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

AndreasKaratzas · 2026-03-19T23:40:39Z

Testing MI250 to see if issue is resolved (added rocm and ready labels).

gemini-code-assist

Code Review

This pull request addresses an accuracy issue for llama-nemotron-vl pooling tests on ROCm by generalizing the patch to force SDPA for vision encoders. The changes refactor patch_hf_vision_attn_for_rocm to support more model architectures and apply this patch in the relevant tests. Additionally, the relative tolerance for test assertions is increased for ROCm to account for numerical differences. My feedback includes a suggestion to improve the robustness of the patching logic to prevent potential errors with different model structures in the future.

gemini-code-assist · 2026-03-19T23:41:49Z

tests/models/multimodal/conftest.py

    if hasattr(inner, "vision_embedding"):
        vit = inner.vision_embedding[0]
-        for layer in vit.encoder.layers:
-            if hasattr(layer, "self_attn"):
-                layer.self_attn.vision_config._attn_implementation = "sdpa"
+        _patch_encoder_layers(vit.encoder)


The current implementation assumes that inner.vision_embedding is a non-empty list and that its first element has an encoder attribute. This could lead to IndexError or AttributeError if a model has a vision_embedding attribute with a different structure. To make this patch more robust and prevent future test failures, it's better to add checks for the list's existence and content, as well as for the presence of the encoder attribute.

Suggested change

if hasattr(inner, "vision_embedding"):

vit = inner.vision_embedding[0]

for layer in vit.encoder.layers:

if hasattr(layer, "self_attn"):

layer.self_attn.vision_config._attn_implementation = "sdpa"

_patch_encoder_layers(vit.encoder)

if hasattr(inner, "vision_embedding") and inner.vision_embedding:

vit = inner.vision_embedding[0]

if hasattr(vit, "encoder"):

_patch_encoder_layers(vit.encoder)

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

AndreasKaratzas · 2026-03-20T15:14:55Z

Test group confirmed passing: https://buildkite.com/vllm/amd-ci/builds/6723/steps/canvas?sid=019d09de-226d-4565-8db4-bd4f91370f0d&tab=output

tests/models/multimodal/conftest.py

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

…ject#37613) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

AndreasKaratzas added 2 commits March 19, 2026 18:33

[ROCm][CI] Fix accuracy for llama-nemotron-vl pooling tests

3ee23ba

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

[ROCm][CI] Fix accuracy for llama-nemotron-vl pooling tests

3d14ac6

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

AndreasKaratzas added rocm Related to AMD ROCm ready ONLY add when PR is ready to merge/full CI is needed labels Mar 19, 2026

github-project-automation bot added this to AMD Mar 19, 2026

github-project-automation bot moved this to Todo in AMD Mar 19, 2026

mergify bot added llama Related to Llama models multi-modality Related to multi-modality (#4194) labels Mar 19, 2026

gemini-code-assist bot reviewed Mar 19, 2026

View reviewed changes

AndreasKaratzas added 2 commits March 19, 2026 18:47

[ROCm][CI] Fix accuracy for llama-nemotron-vl pooling tests

a435a51

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

[Bugfix] Fix conftest path

d903429

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

AndreasKaratzas marked this pull request as ready for review March 20, 2026 15:14

AndreasKaratzas requested review from DarkLight1337, noooop and ywang96 as code owners March 20, 2026 15:14

DarkLight1337 reviewed Mar 20, 2026

View reviewed changes

tests/models/multimodal/conftest.py Outdated Show resolved Hide resolved

[ROCm][CI] Fix accuracy for llama-nemotron-vl pooling tests

45bdbbc

Signed-off-by: Andreas Karatzas <akaratza@amd.com>

DarkLight1337 approved these changes Mar 20, 2026

View reviewed changes

DarkLight1337 enabled auto-merge (squash) March 20, 2026 15:42

DarkLight1337 merged commit fb4e8bf into vllm-project:main Mar 20, 2026
20 of 22 checks passed

github-project-automation bot moved this from Todo to Done in AMD Mar 20, 2026

AndreasKaratzas deleted the akaratza_fix_multi_mod_pooling branch March 20, 2026 17:21

chooper26 pushed a commit to intellistream/vllm-hust that referenced this pull request Mar 21, 2026

[ROCm][CI] Fix accuracy for llama-nemotron-vl pooling tests (vllm-pro…

0a4ba3b

…ject#37613) Signed-off-by: Andreas Karatzas <akaratza@amd.com>

This was referenced Mar 21, 2026

Revert "[Model] Deprecate the score task (this will not affect users)." (#37537) #37726

Closed

[Model] Deprecate the score task (this will not affect users). #37537

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm][CI] Fix accuracy for llama-nemotron-vl pooling tests#37613

[ROCm][CI] Fix accuracy for llama-nemotron-vl pooling tests#37613
DarkLight1337 merged 5 commits intovllm-project:mainfrom
ROCm:akaratza_fix_multi_mod_pooling

AndreasKaratzas commented Mar 19, 2026

Uh oh!

AndreasKaratzas commented Mar 19, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Mar 19, 2026

Uh oh!

AndreasKaratzas Mar 19, 2026

Uh oh!

AndreasKaratzas commented Mar 20, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

AndreasKaratzas commented Mar 19, 2026

Uh oh!

AndreasKaratzas commented Mar 19, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

AndreasKaratzas Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

AndreasKaratzas commented Mar 20, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants