[CI] Enable all hf transformers baselines in test_hybrid by tdoublep · Pull Request #23936 · vllm-project/vllm

tdoublep · 2025-08-29T11:44:35Z

Purpose

HF transformers recently released v4.55.3 that contains a fix for the mamba-related issues that prevented us from comparing to transformers as a baseline in the hybrid model tests. I also checked that the two models we listed in HF_UNSUPPORTED_MODELS now seem to work fine.

This is a useful step towards removing V0 code, since at that point we will no longer be able to use V0 output as a baseline for the V1 output, so we need to be able to rely on transformers for that.

cc @heheda12345

Test Plan

I will trigger Hybrid test in CI.

Test Result

Passing.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

gemini-code-assist

Code Review

This pull request enables Hugging Face Transformers baselines for all hybrid models in the test suite. This is made possible by a recent fix in transformers v4.55.3 that resolves issues with Mamba-related models. The changes involve removing the HF_UNSUPPORTED_MODELS list and updating the conditions in tests to always run the baseline comparison. Additionally, the minimum required transformers version for BambaForCausalLM and JambaForCausalLM has been correctly updated to 4.55.3. The changes are straightforward, correct, and improve test coverage.

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

heheda12345

Can you also remove the if hf_outputs is not None checks?

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

tdoublep · 2025-09-01T08:34:15Z

@heheda12345 done

heheda12345

LGTM! Thank you!

* 'main' of https://github.com/845473182/vllm: (457 commits) [BugFix] Fix routed_scaling_factor double mul for dots1 and glm4 MoE models (vllm-project#24132) [Misc] Add check for dual_chunk_attention (vllm-project#24070) [Doc]: fix typos in Python comments (vllm-project#24115) [Doc]: fix typos in Python comments (vllm-project#24093) [Compile] Fix Compile Warning for `w4a8_mm_entry.cu` (vllm-project#23660) fix some typos (vllm-project#24071) [V1] Wrapper which plumbs request-level logits processors into vLLM batch-level logits processing (vllm-project#23656) Upgrade xgrammar to 0.1.23 (vllm-project#22988) Update release pipeline post PyTorch 2.8.0 update (vllm-project#24073) [XPU] Fix the bug of LoRA logits on the XPU platform (vllm-project#24081) [CI/Build] Disable SiluMul NVFP4 quant fusion tests (vllm-project#24121) [Bug] R1 Accuracy: Fix `routed_scaling_factor` Double Mul Issue (vllm-project#24119) [AMD][Kernel][Bugfix] Cast offsets tensor bn to tl.int64 to avoid GPU segfault (vllm-project#23692) [CI] Enable all hf transformers baselines in test_hybrid (vllm-project#23936) [Log] Only Print Profiler Results on Rank 0 (vllm-project#23370) Fix weights loading for Apertus (vllm-project#24100) [Metrics] Deprecate TPOT in favor of ITL (vllm-project#24110) [Bugfix] Fix packed_factor missing attribute error (vllm-project#23902) Run ruff format on a few files. (vllm-project#24075) [Bugfix] Fix transform_config parsing in Compressed Tensors (vllm-project#23945) ...

…t#23936) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

tdoublep added 2 commits August 29, 2025 07:34

Enable all hf baselines in test_hybrid

af077fd

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

Add min_transformers_version for GraniteMoeHybrid and Mamba2

22b7cb0

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

tdoublep requested review from DarkLight1337 and ywang96 as code owners August 29, 2025 11:44

gemini-code-assist bot reviewed Aug 29, 2025

View reviewed changes

tdoublep added 2 commits August 29, 2025 07:47

Merge branch 'main' into hybrid-test-fix-hf

f608a19

Fix typo

3fcc3d0

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

heheda12345 reviewed Aug 29, 2025

View reviewed changes

Remove checks on hf_outputs.

9ba4908

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

Merge branch 'main' into hybrid-test-fix-hf

851d009

heheda12345 approved these changes Sep 2, 2025

View reviewed changes

heheda12345 enabled auto-merge (squash) September 2, 2025 18:43

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 2, 2025

heheda12345 merged commit d328f78 into vllm-project:main Sep 2, 2025
27 of 28 checks passed

eicherseiji pushed a commit to eicherseiji/vllm that referenced this pull request Sep 9, 2025

[CI] Enable all hf transformers baselines in test_hybrid (vllm-projec…

60ce6f5

…t#23936) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

hmellor mentioned this pull request Sep 12, 2025

Update to Transformers v4.56.2 #24638

Merged

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[CI] Enable all hf transformers baselines in test_hybrid (vllm-projec…

bacc0ac

…t#23936) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CI] Enable all hf transformers baselines in test_hybrid#23936

[CI] Enable all hf transformers baselines in test_hybrid#23936
heheda12345 merged 6 commits intovllm-project:mainfrom
tdoublep:hybrid-test-fix-hf

tdoublep commented Aug 29, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

heheda12345 left a comment

Uh oh!

tdoublep commented Sep 1, 2025

Uh oh!

heheda12345 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

tdoublep commented Aug 29, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

heheda12345 left a comment

Choose a reason for hiding this comment

Uh oh!

tdoublep commented Sep 1, 2025

Uh oh!

heheda12345 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tdoublep commented Aug 29, 2025 •

edited by github-actions bot

Loading