Missing updates for Llama4 on main by Luca-Calabria · Pull Request #940 · vllm-project/vllm-gaudi

Luca-Calabria · 2026-02-06T09:49:08Z

Added Llama4 missing fixes from #881 #862 #884 on main branch

Signed-off-by: Luca Calabria <luca.calabria@intel.com>

Copilot

Pull request overview

This PR ports Llama4-specific fixes from PRs #881, #862, and #884 to the main branch, focusing on attention scaling and chunked attention layer handling improvements.

Changes:

Updated _get_attn_scale_for_hpu implementation to remove closure dependency and match the actual attention scale calculation
Refactored chunked attention layer detection to be a standalone function and changed the signature of apply_model_specific_patches to accept model_runner instead of model
Consolidated model-specific patches by removing duplicate maybe_set_chunked_attention_layers method from the class

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-06T09:49:56Z

+            # add explicit warning
+            pass


The comment 'add explicit warning' suggests that an exception handler should log a warning, but the current implementation silently ignores exceptions. Consider adding a proper warning message using a logger to help with debugging when chunked attention setup fails.

Signed-off-by: Luca Calabria <luca.calabria@intel.com>

github-actions · 2026-02-06T13:26:43Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
17b17c068453e6dc6af79240bb94857ae175cc51

github-actions · 2026-02-07T06:20:50Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
17b17c068453e6dc6af79240bb94857ae175cc51

Added Llama4 missing fixes from #881 #862 #884 on main branch --------- Signed-off-by: Luca Calabria <luca.calabria@intel.com> Co-authored-by: Wojciech Pyszka <wpyszka@habana.ai>

Luca-Calabria added 3 commits February 6, 2026 10:23

refactoring apply patches for Llama4 from vllm-project#881

778c5cf

Signed-off-by: Luca Calabria <luca.calabria@intel.com>

flatten positions if QK norm is enabled from vllm-project#862

914b6cd

Signed-off-by: Luca Calabria <luca.calabria@intel.com>

fix Llama3 Maverick perf drop from vllm-project#904

fbf8a7c

Signed-off-by: Luca Calabria <luca.calabria@intel.com>

Copilot AI review requested due to automatic review settings February 6, 2026 09:49

Luca-Calabria requested review from adobrzyn, afierka-intel, iboiko-habana, kamil-kaczor, ksmusz, mgawarkiewicz-intel, michalkuligowski and xuechendi as code owners February 6, 2026 09:49

Copilot AI reviewed Feb 6, 2026

View reviewed changes

fix format

8d451d5

Signed-off-by: Luca Calabria <luca.calabria@intel.com>

adobrzyn approved these changes Feb 6, 2026

View reviewed changes

github-actions Bot mentioned this pull request Feb 6, 2026

🚦 Team Review Dashboard #701

Open

Merge branch 'main' into main

250d4c4

jkaniecki approved these changes Feb 6, 2026

View reviewed changes

Merge branch 'main' into main

3605b8d

Luca-Calabria and others added 4 commits February 9, 2026 08:59

Merge branch 'main' into main

a885ea0

Merge branch 'main' into main

3bf618c

Merge branch 'main' into main

8cfd3ee

Merge branch 'main' into main

50935fd

wpyszka enabled auto-merge (squash) February 9, 2026 15:50

wpyszka merged commit d5491ac into vllm-project:main Feb 9, 2026
12 of 13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing updates for Llama4 on main#940

Missing updates for Llama4 on main#940
wpyszka merged 10 commits into
vllm-project:mainfrom
Luca-Calabria:main

Luca-Calabria commented Feb 6, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 6, 2026

Uh oh!

github-actions Bot commented Feb 6, 2026

Uh oh!

github-actions Bot commented Feb 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

Luca-Calabria commented Feb 6, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Feb 6, 2026

✅ CI Passed

Uh oh!

github-actions Bot commented Feb 7, 2026

✅ CI Passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants