Skip to content

[kv_offload+HMA][6/N]: Split offloading_connector.py#37405

Merged
NickLucche merged 1 commit intovllm-project:mainfrom
orozery:offloading-connector-split
Mar 18, 2026
Merged

[kv_offload+HMA][6/N]: Split offloading_connector.py#37405
NickLucche merged 1 commit intovllm-project:mainfrom
orozery:offloading-connector-split

Conversation

@orozery
Copy link
Copy Markdown
Collaborator

@orozery orozery commented Mar 18, 2026

offloading_connector.py is quite long and contains multiple logical independent classes.
This PR splits it to 3 files: scheduler-side connector, worker-side connector, and metrics.
This is in preparation for additional code that will be added to offloading connector which will increase its number of lines.

offloading_connector.py is quite long and contains multiple logical independent classes.
This commit splits it to 3 files: scheduler-side connector, worker-side connector, and metrics.

Signed-off-by: Or Ozeri <oro@il.ibm.com>
@orozery orozery force-pushed the offloading-connector-split branch from ed2160f to 51ffdc3 Compare March 18, 2026 09:44
@orozery orozery changed the title [kv_offload+HMA][5/N]: Split offloading_connector.py [kv_offload+HMA][6/N]: Split offloading_connector.py Mar 18, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

The pull request successfully refactors the offloading_connector.py file by splitting its functionality into dedicated common, metrics, scheduler, and worker modules. This significantly improves modularity and maintainability, making the codebase easier to understand and extend. The changes align well with the stated goal of preparing for additional code. However, a critical bug was identified in the metrics.py file where OffloadingOperationMetrics dataclass instances are incorrectly accessed as dictionaries, which will lead to runtime errors.

@orozery
Copy link
Copy Markdown
Collaborator Author

orozery commented Mar 18, 2026

However, a critical bug was identified in the metrics.py file where OffloadingOperationMetrics dataclass instances are incorrectly accessed as dictionaries, which will lead to runtime errors.

False detection.
This code is actually untouched by this PR, just moved.
These are serialized structs so they are dictionaries rather than dataclasses.

@orozery orozery added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 18, 2026
@NickLucche NickLucche merged commit 525f2ee into vllm-project:main Mar 18, 2026
54 checks passed
fxdawnn pushed a commit to fxdawnn/vllm that referenced this pull request Mar 19, 2026
SouthWest7 pushed a commit to SouthWest7/vllm that referenced this pull request Mar 27, 2026
khairulkabir1661 pushed a commit to khairulkabir1661/vllm that referenced this pull request Mar 27, 2026
Monishver11 pushed a commit to Monishver11/vllm that referenced this pull request Mar 27, 2026
)

Signed-off-by: Or Ozeri <oro@il.ibm.com>
Signed-off-by: Monishver Chandrasekaran <monishverchandrasekaran@gmail.com>
JiantaoXu pushed a commit to JiantaoXu/vllm that referenced this pull request Mar 28, 2026
vrdn-23 pushed a commit to vrdn-23/vllm that referenced this pull request Mar 30, 2026
)

Signed-off-by: Or Ozeri <oro@il.ibm.com>
Signed-off-by: Vinay Damodaran <vrdn@hey.com>
EricccYang pushed a commit to EricccYang/vllm that referenced this pull request Apr 1, 2026
)

Signed-off-by: Or Ozeri <oro@il.ibm.com>
Signed-off-by: EricccYang <yangyang4991@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

kv-connector ready ONLY add when PR is ready to merge/full CI is needed v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants