Refactor long-context + LoRA flow by scsudhak-intel · Pull Request #807 · HabanaAI/vllm-fork

scsudhak-intel · 2025-02-10T05:44:08Z

This PR refactors long-context + LoRA flow to align with the upstream main branch vllm-project#12812.

HPU requires special handling while creating long_lora_offsets_tensor in convert_mapping. (refer)

As suggested by the vllm team this PR sets long_lora_context to None while calling convert_mapping. This avoids HPU specific conditions inside convert_mapping and explicitly handles HPU long-lora logic inside overrided _update_base_metadata.

vivekgoe

LGTM!

Refactor long-context + LoRA flow

b0a4e82

scsudhak-intel marked this pull request as ready for review February 10, 2025 05:50

scsudhak-intel requested review from afierka-intel, kzawora-intel, madamczyk-intel, mgawarkiewicz, michalkuligowski and vivekgoe as code owners February 10, 2025 05:50

Merge branch 'habana_main' into lora-long_context

873bb75

vivekgoe approved these changes Feb 11, 2025

View reviewed changes

vivekgoe merged commit 36c7676 into habana_main Feb 11, 2025
27 of 32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor long-context + LoRA flow#807

Refactor long-context + LoRA flow#807
vivekgoe merged 2 commits intohabana_mainfrom
lora-long_context

scsudhak-intel commented Feb 10, 2025 •

edited by github-actions bot

Loading

Uh oh!

vivekgoe left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

scsudhak-intel commented Feb 10, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vivekgoe left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

scsudhak-intel commented Feb 10, 2025 •

edited by github-actions bot

Loading