[BugFix] Fix cache issue in compilation_config by BoyuanFeng · Pull Request #31376 · vllm-project/vllm

BoyuanFeng · 2025-12-26T06:05:13Z

#22204 added get_cached_compilation_config() with @lru_cache(maxsize=1) to cache compilation config access via

@lru_cache(maxsize=1)
def get_cached_compilation_config():
    """Cache config to avoid repeated calls to get_current_vllm_config()"""
    return get_current_vllm_config().compilation_config

Since get_cached_compilation_config() has no argument, it would never be refreshed, unless we explicitly call get_cached_compilation_config.cache_clear().

To refresh the config, #22204 implements get_cached_compilation_config.cache_clear() in set_current_vllm_config.

@contextmanager
def set_current_vllm_config(vllm_config: VllmConfig, check_compile=False, prefix: Optional[str] = None):
    try:
        _current_vllm_config = vllm_config
        _current_prefix = prefix
        yield
   finally:
        _current_vllm_config = old_vllm_config
        _current_prefix = old_prefix
        # Clear the compilation config cache when context changes
        get_cached_compilation_config.cache_clear() # <----- explicitly clear the cache

However, a bug would happen if an old_vllm_config has been accessed and cached before set_current_vllm_config(new_config) is called. In other words

config = old_config
with set_current_vllm_config(new_config):
    # old_config is accessed instead of new_config

This PR fixes the issue by clearing the cache when entering the context manager.

Signed-off-by: Boyuan Feng <boyuan@meta.com>

chatgpt-codex-connector · 2025-12-26T06:05:18Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.

gemini-code-assist

Code Review

This pull request addresses a caching bug in get_cached_compilation_config by ensuring the cache is cleared upon entering the set_current_vllm_config context manager. This prevents an old, cached configuration from being used when a new configuration is set. The fix is correct and directly solves the issue described. The added code comment clearly explains the necessity of this change. The implementation is sound, and I have no further recommendations.

yewentao256

LGTM, thanks for the work!

Is there any specific issue this PR could solve? (If there is a case that without this PR would cause trouble?)

Signed-off-by: Boyuan Feng <boyuan@meta.com>

BoyuanFeng · 2025-12-26T19:16:59Z

@yewentao256 I added a unit test test_cached_compilation_config to show the issue.
tldr: prior to this pr, if I call old_config = get_cached_compilation_config first, and then call with set_current_vllm_config(vllm_config):, the old_config.compilation_config is used within the context manager instead of the new vllm_config.compilation_config.

yewentao256

LGTM, thanks for the work!

yewentao256

Merge this PR as all CI tests pass

Signed-off-by: Boyuan Feng <boyuan@meta.com>

fix bug

e2b79b3

Signed-off-by: Boyuan Feng <boyuan@meta.com>

BoyuanFeng requested review from ProExpertProg, WoosukKwon, hmellor, houseroad, mgoin, robertgshaw2-redhat, tlrmchlsmth, yewentao256 and youkaichao as code owners December 26, 2025 06:05

gemini-code-assist bot reviewed Dec 26, 2025

View reviewed changes

BoyuanFeng mentioned this pull request Dec 26, 2025

Optimize configuration access with LRU cache in custom ops #22204

Merged

yewentao256 reviewed Dec 26, 2025

View reviewed changes

yewentao256 added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 26, 2025

BoyuanFeng added 2 commits December 26, 2025 11:13

add unit test

f6c9421

Signed-off-by: Boyuan Feng <boyuan@meta.com>

Merge branch 'main' into bf/fix-cache-compilation-config

d1706ed

yewentao256 approved these changes Dec 26, 2025

View reviewed changes

BoyuanFeng mentioned this pull request Dec 26, 2025

[BugFix] register quant scale tensors as buffer #31395

Merged

ProExpertProg approved these changes Dec 26, 2025

View reviewed changes

yewentao256 reviewed Dec 27, 2025

View reviewed changes

yewentao256 merged commit 2f12cd3 into vllm-project:main Dec 27, 2025
46 checks passed

akh64bit pushed a commit to akh64bit/vllm that referenced this pull request Dec 28, 2025

[BugFix] Fix cache issue in compilation_config (vllm-project#31376)

4287736

Signed-off-by: Boyuan Feng <boyuan@meta.com>

yiliu30 pushed a commit to yiliu30/vllm-fork that referenced this pull request Dec 30, 2025

[BugFix] Fix cache issue in compilation_config (vllm-project#31376)

8b17286

Signed-off-by: Boyuan Feng <boyuan@meta.com>

akh64bit pushed a commit to akh64bit/vllm that referenced this pull request Jan 16, 2026

[BugFix] Fix cache issue in compilation_config (vllm-project#31376)

8caf80d

Signed-off-by: Boyuan Feng <boyuan@meta.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BugFix] Fix cache issue in compilation_config#31376

[BugFix] Fix cache issue in compilation_config#31376
yewentao256 merged 3 commits intovllm-project:mainfrom
BoyuanFeng:bf/fix-cache-compilation-config

BoyuanFeng commented Dec 26, 2025 •

edited by github-actions bot

Loading

Uh oh!

chatgpt-codex-connector bot commented Dec 26, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

yewentao256 left a comment

Uh oh!

BoyuanFeng commented Dec 26, 2025 •

edited

Loading

Uh oh!

yewentao256 left a comment

Uh oh!

yewentao256 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

BoyuanFeng commented Dec 26, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector bot commented Dec 26, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

BoyuanFeng commented Dec 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

BoyuanFeng commented Dec 26, 2025 •

edited by github-actions bot

Loading

BoyuanFeng commented Dec 26, 2025 •

edited

Loading