[ROCm][CI] Fix TP size issue for `test_gpt_oss` by micah-wil · Pull Request #35887 · vllm-project/vllm

micah-wil · 2026-03-03T17:22:37Z

Quantized Models Test is allocated to a 1 GPU agent pool in CI, but tries to run multi-GPU tests (example: https://buildkite.com/vllm/amd-ci/builds/5699/steps/canvas?sid=019cb28b-7107-44a7-adde-1af22fb4f7b7&tab=output#019cb28b-71fb-4bda-bc58-43ef57384abc/L1654)

This PR skips the multi-GPU test cases if there are not enough GPUs available.

Signed-off-by: Micah Williamson <micah.williamson@amd.com>

gemini-code-assist

Code Review

The pull request effectively addresses the issue of multi-GPU tests failing in single-GPU CI environments by conditionally skipping these tests when insufficient GPUs are available. This is a practical fix that improves CI stability and efficiency. The implementation correctly uses cuda_device_count_stateless to determine available resources.

gemini-code-assist · 2026-03-03T17:25:07Z

 import pytest
 from packaging import version

+from vllm.utils.torch_utils import cuda_device_count_stateless


According to PEP 8, imports should generally be grouped in the following order: standard library imports, third-party imports, and then local application/library specific imports. The vllm.utils.torch_utils import is a local application import and should be placed after packaging.version to maintain consistency with common Python style guidelines.

import pytest from packaging import version from vllm.utils.torch_utils import cuda_device_count_stateless

References

Imports should be grouped in the following order: standard library, third-party, and local application/library specific imports. Each group should be separated by a blank line. ^(link)

Signed-off-by: Micah Williamson <micah.williamson@amd.com>

fix tp size issue

62176a6

Signed-off-by: Micah Williamson <micah.williamson@amd.com>

micah-wil requested review from DarkLight1337 and ywang96 as code owners March 3, 2026 17:22

mergify Bot added gpt-oss Related to GPT-OSS models rocm Related to AMD ROCm labels Mar 3, 2026

github-project-automation Bot added this to gpt-oss Issues & Enhancements and AMD Mar 3, 2026

github-project-automation Bot moved this to Todo in AMD Mar 3, 2026

github-project-automation Bot moved this to To Triage in gpt-oss Issues & Enhancements Mar 3, 2026

gemini-code-assist Bot reviewed Mar 3, 2026

View reviewed changes

gshtras approved these changes Mar 3, 2026

View reviewed changes

github-project-automation Bot moved this from To Triage to Ready in gpt-oss Issues & Enhancements Mar 3, 2026

gshtras enabled auto-merge (squash) March 3, 2026 20:10

github-actions Bot added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 3, 2026

gshtras merged commit e721300 into vllm-project:main Mar 3, 2026
16 of 17 checks passed

github-project-automation Bot moved this from Ready to Done in gpt-oss Issues & Enhancements Mar 3, 2026

github-project-automation Bot moved this from Todo to Done in AMD Mar 3, 2026

micah-wil deleted the micah/gpt-oss-tp-size branch March 4, 2026 16:23

Copilot AI pushed a commit to machov/vllm that referenced this pull request Mar 10, 2026

[ROCm][CI] Fix TP size issue for test_gpt_oss (vllm-project#35887)

f16a852

Signed-off-by: Micah Williamson <micah.williamson@amd.com>

avinashsingh77 pushed a commit to avinashsingh77/vllm that referenced this pull request Mar 12, 2026

[ROCm][CI] Fix TP size issue for test_gpt_oss (vllm-project#35887)

da312a0

Signed-off-by: Micah Williamson <micah.williamson@amd.com>

wendyliu235 pushed a commit to wendyliu235/vllm-public that referenced this pull request Mar 18, 2026

[ROCm][CI] Fix TP size issue for test_gpt_oss (vllm-project#35887)

e06fcf5

Signed-off-by: Micah Williamson <micah.williamson@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ROCm][CI] Fix TP size issue for `test_gpt_oss`#35887

[ROCm][CI] Fix TP size issue for `test_gpt_oss`#35887
gshtras merged 1 commit intovllm-project:mainfrom
ROCm:micah/gpt-oss-tp-size

micah-wil commented Mar 3, 2026 •

edited by github-actions Bot

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Mar 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

micah-wil commented Mar 3, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

micah-wil commented Mar 3, 2026 •

edited by github-actions Bot

Loading