[BugFix] Patch inductor memory plan logic#26878
[BugFix] Patch inductor memory plan logic#26878DarkLight1337 merged 5 commits intovllm-project:mainfrom
Conversation
Signed-off-by: Boyuan Feng <boyuan@meta.com>
There was a problem hiding this comment.
Code Review
This pull request introduces a monkey patch for a PyTorch 2.9.0 issue related to inductor's memory planning, which was causing test_multi_graph_piecewise_compile to fail. The patch is applied conditionally for PyTorch 2.9.0. The changes also include enabling the failing test for PyTorch versions 2.9.0.dev and newer, assuming the fix is present in those versions. While the overall approach is sound for a temporary workaround, I've found a critical issue in the version comparison logic that could lead to the patch being applied to incorrect PyTorch versions.
commit 6b0c3c3 Author: Boyuan Feng <boyuan@meta.com> Date: Tue Oct 14 21:30:29 2025 -0700 nit Signed-off-by: Boyuan Feng <boyuan@meta.com> commit 1016467 Author: Boyuan Feng <boyuan@meta.com> Date: Tue Oct 14 21:21:47 2025 -0700 fix multi-graph test Signed-off-by: Boyuan Feng <boyuan@meta.com> Signed-off-by: ProExpertProg <lgovedic@redhat.com>
|
This pull request has merge conflicts that must be resolved before it can be |
Signed-off-by: Boyuan Feng <boyuan@meta.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
Documentation preview: https://vllm--26878.org.readthedocs.build/en/26878/ |
Signed-off-by: Boyuan Feng <boyuan@meta.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: bbartels <benjamin@bartels.dev>
Signed-off-by: Boyuan Feng <boyuan@meta.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: Alberto Perdomo <aperdomo@redhat.com>
Signed-off-by: Boyuan Feng <boyuan@meta.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Boyuan Feng <boyuan@meta.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Boyuan Feng <boyuan@meta.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>
Signed-off-by: Boyuan Feng <boyuan@meta.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>
Signed-off-by: Boyuan Feng <boyuan@meta.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Boyuan Feng <boyuan@meta.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
This PR fixes a test failure for
test_multi_graph_piecewise_compile. This is a pytorch issue and is fixed in pytorch/pytorch#165514. This PR monkey patches vLLM for PyTorch-2.9.0 only to unblock.