Skip to content

[CI] Migrate more B200 jobs to b200-k8s queue#42356

Merged
vllm-bot merged 2 commits into
mainfrom
ci/b200-k8s-migration
May 12, 2026
Merged

[CI] Migrate more B200 jobs to b200-k8s queue#42356
vllm-bot merged 2 commits into
mainfrom
ci/b200-k8s-migration

Conversation

@khluu
Copy link
Copy Markdown
Member

@khluu khluu commented May 12, 2026

Summary

  • Migrate 4 validated device: b200 jobs to device: b200-k8s across 3 test area configs
  • Affected jobs:
    • kernels-fusedmoe-layer-test-2-b200s (kernels.yaml)
    • moe-refactor-integration-test-b200-dp-temporary (lm_eval.yaml)
    • gpqa-eval-gpt-oss-b200 (lm_eval.yaml)
    • Spec Decode MTP hybrid (spec_decode.yaml)
  • All 4 jobs were validated on the b200-k8s queue in build #65711
  • The remaining 3 B200 jobs are migrated with test fixes in [CI] Migrate remaining B200 jobs to b200-k8s with test fixes #42387

Test plan

  • Triggered build with NOAUTO=1, unblocked all B200 jobs
  • All 4 jobs in this PR passed on b200-k8s queue

🤖 Generated with Claude Code

Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Signed-off-by: khluu <khluu000@gmail.com>
@khluu khluu requested a review from Harry-Chen as a code owner May 12, 2026 00:59
Copy link
Copy Markdown

@claude claude Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Claude Code Review

This repository is configured for manual code reviews. Comment @claude review to trigger a review and subscribe this PR to future pushes, or @claude review once for a one-time review.

Tip: disable this comment in your organization's Code Review settings.

@mergify mergify Bot added the ci/build label May 12, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the Buildkite configuration files across several test areas, including kernels, language model evaluation, and speculative decoding, by changing the target device from 'b200' to 'b200-k8s'. I have no feedback to provide as there were no review comments.

Revert 3 jobs that have pre-existing test failures back to the
old b200 queue. These will be migrated in a follow-up PR with
test fixes:
- Spec Decode Eagle: head_dim 192 unsupported on SM100
- Spec Decode Speculators+MTP: CUDA graph hang on DeepSeek
- LM Eval Small Models: server crashes on large models

Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Signed-off-by: khluu <khluu000@gmail.com>
@khluu khluu changed the title [CI] Migrate remaining B200 jobs to b200-k8s queue [CI] Migrate more B200 jobs to b200-k8s queue May 12, 2026
@vllm-bot vllm-bot merged commit f69644c into main May 12, 2026
8 checks passed
@vllm-bot vllm-bot deleted the ci/b200-k8s-migration branch May 12, 2026 07:38
weifang231 pushed a commit to weifang231/eb-vllm that referenced this pull request May 13, 2026
mfylcek pushed a commit to mfylcek/vllm that referenced this pull request May 19, 2026
jhu960213 pushed a commit to jhu960213/vllm that referenced this pull request May 20, 2026
h1t35h pushed a commit to h1t35h/vllm that referenced this pull request May 21, 2026
mvanhorn pushed a commit to mvanhorn/vllm that referenced this pull request Jun 4, 2026
Signed-off-by: khluu <khluu000@gmail.com>
Signed-off-by: Matt Van Horn <455140+mvanhorn@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants