[FIX_FOR_VLLM_CUSTOM=0a54df28471be07b3d668ea21c5e411569d3baea] Fix DynamicNTKScalingRotaryEmbedding and HPUCompressedTensorsConfig by pawel-olejniczak · Pull Request #1479 · vllm-project/vllm-gaudi

pawel-olejniczak · 2026-05-21T20:34:39Z

Root cause

Upstream vLLM at SHA 0a54df28 introduced two API changes that broke vllm-gaudi:

PR Fix error in Dynamic NTK scaling vllm#41277 added a required max_trained_positions parameter to DynamicNTKScalingRotaryEmbedding.__init__(), causing the unit test to fail with TypeError.
PR Remove additional dead code as a follow-up to #42889 vllm#43144 removed sparsity_scheme_map and sparsity_ignore_list from CompressedTensorsConfig.__init__(), causing HPUCompressedTensorsConfig instantiation to fail during e2e tests.

Upstream PR

vllm-project/vllm#41277
Added max_trained_positions to DynamicNTKScalingRotaryEmbedding

vllm-project/vllm#43144
Removed sparsity parameters from CompressedTensorsConfig

Fix

Add max_trained_positions parameter to the rotary embedding unit test.
Remove stale sparsity_scheme_map and sparsity_ignore_list from HPUCompressedTensorsConfig init signature and super() call, plus the unused SparsityCompressionConfig import.

…or upstream vllm@0a54df28 Root cause: Upstream vLLM PRs #41277 and #43144 changed APIs in DynamicNTKScalingRotaryEmbedding (added max_trained_positions) and CompressedTensorsConfig (removed sparsity params). Upstream: vllm-project/vllm#41277, vllm-project/vllm#43144 Fix: Add max_trained_positions to rotary embedding test; remove stale sparsity_scheme_map and sparsity_ignore_list from HPUCompressedTensorsConfig init. Signed-off-by: Paweł Olejniczak <pawelx.olejniczak@intel.com>

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

This PR appears to remove sparsity-compression configuration plumbing from the Gaudi compressed-tensors path and updates a rotary-embedding unit test to include a max_trained_positions field.

Changes:

Removed SparsityCompressionConfig import and sparsity-related __init__ parameters/forwarding in hpu_compressed_tensors.py.
Updated rotary-embedding test config to include max_trained_positions.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
vllm_gaudi/ops/hpu_compressed_tensors.py	Drops sparsity config types and constructor parameters from the HPU compressed-tensors integration.
tests/unit_tests/ops/test_hpu_rotary_embedding.py	Adds `max_trained_positions` to the test configuration for dynamic NTK scaling.

github-actions · 2026-05-22T10:48:18Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
0a54df28471be07b3d668ea21c5e411569d3baea

Copilot AI review requested due to automatic review settings May 21, 2026 20:34

pawel-olejniczak requested review from PatrykWo, adobrzyn, afierka-intel, iboiko-habana, jbyczkow, kamil-kaczor, ksmusz, mgawarkiewicz-intel, michalkuligowski and xuechendi as code owners May 21, 2026 20:34

pawel-olejniczak temporarily deployed to pre-merge-approval May 21, 2026 20:34 — with GitHub Actions Inactive

Copilot AI reviewed May 21, 2026

View reviewed changes

Comment thread vllm_gaudi/ops/hpu_compressed_tensors.py

Comment thread vllm_gaudi/ops/hpu_compressed_tensors.py

Comment thread tests/unit_tests/ops/test_hpu_rotary_embedding.py

github-actions Bot mentioned this pull request May 21, 2026

🚦 Team Review Dashboard #701

Open

iboiko-habana approved these changes May 22, 2026

View reviewed changes

iboiko-habana merged commit 7b7bc8f into vllm-project:main May 22, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX_FOR_VLLM_CUSTOM=0a54df28471be07b3d668ea21c5e411569d3baea] Fix DynamicNTKScalingRotaryEmbedding and HPUCompressedTensorsConfig#1479

[FIX_FOR_VLLM_CUSTOM=0a54df28471be07b3d668ea21c5e411569d3baea] Fix DynamicNTKScalingRotaryEmbedding and HPUCompressedTensorsConfig#1479
iboiko-habana merged 1 commit into
vllm-project:mainfrom
pawel-olejniczak:fix/upstream-api-compat-0a54df28-v2

pawel-olejniczak commented May 21, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented May 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pawel-olejniczak commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Root cause

Upstream PR

Fix

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented May 22, 2026

✅ CI Passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pawel-olejniczak commented May 21, 2026 •

edited

Loading