[Misc] Deprecate semantic_cache.backend_config_path and embed backend config inline#1100
Merged
Xunzhuo merged 10 commits intovllm-project:mainfrom Jan 19, 2026
Merged
Conversation
✅ Deploy Preview for vllm-semantic-router ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
d213e2c to
11affa5
Compare
…cheOption initializer to handle the new configuration approach Signed-off-by: Scanf-s <sullung2yo@gmail.com>
…s not given Signed-off-by: Scanf-s <sullung2yo@gmail.com>
Signed-off-by: Scanf-s <sullung2yo@gmail.com>
Signed-off-by: Scanf-s <sullung2yo@gmail.com>
Signed-off-by: Scanf-s <sullung2yo@gmail.com>
Signed-off-by: Scanf-s <sullung2yo@gmail.com>
…ject#1089) Add pluggable model selection algorithms for intelligent routing: - Elo rating system with Bradley-Terry model for preference-based selection - RouterDC for query-to-model embedding matching - AutoMix for POMDP-based cost-quality optimization - Hybrid selector combining multiple methods with configurable weights - Static selector for backwards compatibility Integration: - OpenAIRouter initializes selection registry on startup - req_filter_classification uses configured selector instead of hardcoded first model - Prometheus metrics for selection tracking Signed-off-by: asaadbalum <asaad.balum@gmail.com> Signed-off-by: Scanf-s <sullung2yo@gmail.com>
Signed-off-by: Scanf-s <sullung2yo@gmail.com>
Signed-off-by: Scanf-s <sullung2yo@gmail.com>
d48833a to
7e31f04
Compare
Contributor
👥 vLLM Semantic Team NotificationThe following members have been identified for the changed files in this PR and have been automatically assigned: 📁
|
Xunzhuo
approved these changes
Jan 19, 2026
Member
Xunzhuo
left a comment
There was a problem hiding this comment.
looks good, thanks! would you like to double check if all backend_config_path has been removed as follow-up?
Contributor
Author
|
@Xunzhuo Sure! I can do that for the follow-up task. |
henschwartz
pushed a commit
to henschwartz/semantic-router
that referenced
this pull request
Jan 21, 2026
… config inline (vllm-project#1100) * fix: Refactor Redis and Milvus Cache Config into config.go, Update CacheOption initializer to handle the new configuration approach Signed-off-by: Scanf-s <sullung2yo@gmail.com> * fix: Add fallback logic when proper redis or milvus configuration does not given Signed-off-by: Scanf-s <sullung2yo@gmail.com> * docs: Add sample inline redis configuration example Signed-off-by: Scanf-s <sullung2yo@gmail.com> * docs: Update cache configuration examples Signed-off-by: Scanf-s <sullung2yo@gmail.com> * fix: Update HybridCache Milvus configuration Signed-off-by: Scanf-s <sullung2yo@gmail.com> * chore: Apply code linter Signed-off-by: Scanf-s <sullung2yo@gmail.com> * Feat(selection): implement advanced model selection methods (vllm-project#1089) Add pluggable model selection algorithms for intelligent routing: - Elo rating system with Bradley-Terry model for preference-based selection - RouterDC for query-to-model embedding matching - AutoMix for POMDP-based cost-quality optimization - Hybrid selector combining multiple methods with configurable weights - Static selector for backwards compatibility Integration: - OpenAIRouter initializes selection registry on startup - req_filter_classification uses configured selector instead of hardcoded first model - Prometheus metrics for selection tracking Signed-off-by: asaadbalum <asaad.balum@gmail.com> Signed-off-by: Scanf-s <sullung2yo@gmail.com> * feat: Add inline cache configuration unit tests Signed-off-by: Scanf-s <sullung2yo@gmail.com> * feat: Add cache unit tests Signed-off-by: Scanf-s <sullung2yo@gmail.com> --------- Signed-off-by: Scanf-s <sullung2yo@gmail.com> Signed-off-by: asaadbalum <asaad.balum@gmail.com> Co-authored-by: asaadbalum <154635253+asaadbalum@users.noreply.github.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.

Overview
FIX #1022
This pull request adds inline
redis/milvuscache configuration support in a single configuration file.As suggested, previous
filebase cache backend configuration marked asDeprecated.If both provided,
inline configurationtakes prioritySolution
MilvusConfigandRedisConfiginto config.go to resolve circular import issues.backend_config_pathas(Deprecated)in the relevant code sections.HybridCacheto match the currentMilvusCacheimplementation.cache_test.goandconfig_test.goto validate the changesTests
-swhen doinggit commit[Bugfix],[Feat], and[CI].