Skip to content

Refactor long-context + LoRA flow#807

Merged
vivekgoe merged 2 commits intohabana_mainfrom
lora-long_context
Feb 11, 2025
Merged

Refactor long-context + LoRA flow#807
vivekgoe merged 2 commits intohabana_mainfrom
lora-long_context

Conversation

@scsudhak-intel
Copy link

@scsudhak-intel scsudhak-intel commented Feb 10, 2025

This PR refactors long-context + LoRA flow to align with the upstream main branch vllm-project#12812.

HPU requires special handling while creating long_lora_offsets_tensor in convert_mapping. (refer)

As suggested by the vllm team this PR sets long_lora_context to None while calling convert_mapping. This avoids HPU specific conditions inside convert_mapping and explicitly handles HPU long-lora logic inside overrided _update_base_metadata.

Copy link

@vivekgoe vivekgoe left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@vivekgoe vivekgoe merged commit 36c7676 into habana_main Feb 11, 2025
27 of 32 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants