feat(selection): Add Prometheus metrics for model selection evolution tracking#1124
Merged
rootfs merged 1 commit intovllm-project:mainfrom Jan 21, 2026
Conversation
✅ Deploy Preview for vllm-semantic-router ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
Contributor
👥 vLLM Semantic Team NotificationThe following members have been identified for the changed files in this PR and have been automatically assigned: 📁
|
9572e03 to
629a686
Compare
1deff25 to
c94cdc8
Compare
… tracking Implements comprehensive Prometheus metrics for explainability and traceability of ALL model selection methods evolution. ## New Metrics ### Elo-specific - llm_model_elo_rating: Current Elo rating per model/category - llm_model_feedback_total: Feedback events (wins/losses/ties) - llm_model_rating_change: Distribution of rating changes - llm_model_comparisons_total: Total comparisons per model - llm_model_win_rate: Win rate percentage ### AutoMix-specific - llm_model_automix_verification_prob: Learned verification probability - llm_model_automix_quality: Learned quality score - llm_model_automix_success_rate: Query success rate ### RouterDC-specific - llm_model_routerdc_similarity: Query-model similarity distribution - llm_model_routerdc_affinity: Learned affinity scores ### Hybrid-specific - llm_model_component_agreement: Agreement ratio between component selectors ## Other Changes - Pre-initialize all methods (elo, router_dc, automix, hybrid, static) - Fix GlobalRegistry initialization in router.go - Fix AutoMix deadlock in updateValueFunction - Add Grafana dashboard with method-specific panels - Add comprehensive METRICS.md documentation - Add unit tests for all new metrics Signed-off-by: asaadbalum <asaad.balum@gmail.com>
c94cdc8 to
cda215e
Compare
rootfs
approved these changes
Jan 21, 2026
Collaborator
|
@asaadbalum this is cool! can you record a video of the dashboard over time? we can add to our youtube channel. Thanks |
Collaborator
Author
Sure will do |
henschwartz
pushed a commit
to henschwartz/semantic-router
that referenced
this pull request
Feb 18, 2026
… tracking (vllm-project#1124) Implements comprehensive Prometheus metrics for explainability and traceability of ALL model selection methods evolution. ## New Metrics ### Elo-specific - llm_model_elo_rating: Current Elo rating per model/category - llm_model_feedback_total: Feedback events (wins/losses/ties) - llm_model_rating_change: Distribution of rating changes - llm_model_comparisons_total: Total comparisons per model - llm_model_win_rate: Win rate percentage ### AutoMix-specific - llm_model_automix_verification_prob: Learned verification probability - llm_model_automix_quality: Learned quality score - llm_model_automix_success_rate: Query success rate ### RouterDC-specific - llm_model_routerdc_similarity: Query-model similarity distribution - llm_model_routerdc_affinity: Learned affinity scores ### Hybrid-specific - llm_model_component_agreement: Agreement ratio between component selectors ## Other Changes - Pre-initialize all methods (elo, router_dc, automix, hybrid, static) - Fix GlobalRegistry initialization in router.go - Fix AutoMix deadlock in updateValueFunction - Add Grafana dashboard with method-specific panels - Add comprehensive METRICS.md documentation - Add unit tests for all new metrics Signed-off-by: asaadbalum <asaad.balum@gmail.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.

Summary
Implements issue #1093 - adds comprehensive Prometheus metrics for explainability and traceability of ALL model selection methods evolution.
This is a follow-up to #987 (Advanced Model Selection Methods) as suggested by @rootfs in PR #1089.
Dashboard Demo
Key Features Shown:
New Metrics
General Selection Metrics (Auto-recorded for ALL methods)
llm_model_selection_totalllm_model_selection_historyllm_model_selection_scorellm_model_selection_confidenceElo-specific Metrics
llm_model_elo_ratingllm_model_feedback_totalllm_model_rating_changellm_model_win_rateAutoMix-specific Metrics
llm_model_automix_verification_probllm_model_automix_qualityllm_model_automix_success_rateRouterDC-specific Metrics
llm_model_routerdc_similarityllm_model_routerdc_affinityHybrid-specific Metrics
llm_model_selection_component_agreementArchitecture
req_filter_classification.gofor ALL methodsFiles Changed
pkg/selection/metrics.gopkg/selection/automix.gopkg/selection/router_dc.gopkg/selection/hybrid.gopkg/selection/elo.gopkg/selection/metrics_test.gopkg/selection/METRICS.mddeploy/docker-compose/addons/model-selection-dashboard.jsonTesting
All tests pass including comprehensive evolution tests for each method:
Acceptance Criteria
Fixes: #1093