Commit 00f1347
committed
OffloadingConnector: Fix GPU block tracking bug
This commit fixes a bug in the offloading connector that may result
in incorrect GPU block tracking per request.
It occurs when blocks cannot be allocated on the offloaded medium (prepare_store fails),
and the scheduler output has multiple requests, some of them with new GPU block IDs.
Before this commit, the connector simply returned without processing the rest of the requests,
and their GPU block IDs.
Signed-off-by: Or Ozeri <[email protected]>1 parent 70fbdb2 commit 00f1347
File tree
1 file changed
+3
-2
lines changed- vllm/distributed/kv_transfer/kv_connector/v1
1 file changed
+3
-2
lines changedLines changed: 3 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
278 | 278 | | |
279 | 279 | | |
280 | 280 | | |
281 | | - | |
282 | | - | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
283 | 284 | | |
284 | 285 | | |
285 | 286 | | |
| |||
0 commit comments