-
Notifications
You must be signed in to change notification settings - Fork 688
feat: Add a "model" label to Component metrics #2383
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Co-authored-by: Anant Sharma <[email protected]> Co-authored-by: Ishan Dhanani <[email protected]>
Co-authored-by: Dmitry Tokarev <[email protected]>
Co-authored-by: Dmitry Tokarev <[email protected]>
Signed-off-by: Anish <[email protected]> Co-authored-by: Anish <[email protected]>
…2260) Signed-off-by: Biswa Panda <[email protected]>
…#2319) Signed-off-by: Anish <[email protected]> Co-authored-by: Kristen Kelleher <[email protected]> Co-authored-by: Biswa Panda <[email protected]> Co-authored-by: Neal Vaidya <[email protected]>
Signed-off-by: Anish <[email protected]>
Signed-off-by: Anish <[email protected]>
|
The base isn't right here, there are many extra commits and a huge diff included. Please clean up the branch or start a new one only with the net new changes. |
|
Caution Review failedFailed to post review comments. WalkthroughThis update introduces major enhancements and restructuring to documentation across the project, especially for backend integrations (vLLM, SGLang, TensorRT-LLM) and their deployment guides. It adds detailed feature support matrices, clarifies installation steps (notably for SGLang), consolidates and corrects documentation links, and provides new or updated guides for Kubernetes, SLURM, and multi-node deployments. Minor code changes propagate model-awareness in component and metrics handling. Changes
Sequence Diagram(s)sequenceDiagram
participant User
participant Docs
participant Backend (vLLM/SGLang/TRTLLM)
participant Metrics
participant DeployTool (K8s/SLURM/Helm)
User->>Docs: Reads feature matrix, install, and deployment guides
User->>DeployTool: Follows deployment instructions (K8s/SLURM/Helm)
DeployTool->>Backend: Launches backend with specified model/config
Backend->>Metrics: Reports metrics with model-aware labels
User->>Backend: Sends inference requests
Backend->>User: Returns results
Estimated code review effort🎯 4 (Complex) | ⏱️ ~45 minutes Possibly related PRs
Poem
Note 🔌 MCP (Model Context Protocol) integration is now available in Early Access!Pro users can now connect to remote MCP servers under the Integrations page to get reviews and chat conversations that understand additional development context. Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. 🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
SupportNeed help? Create a ticket on our support page for assistance with any issues or questions. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
Documentation and Community
|
…rvice_name has been removed in PR#2349
Overview:
Add a "model" label to Component metrics.
Details:
This pull request introduces model-specific metrics by adding a model field to the Component struct and updating the metrics labeling logic.
Changes
Where should the reviewer start?
lib/runtime/src/component.rs:
modelis added to Component/Endpoint.lib/runtime/src/metrics.rs: how model label is added to the metrics.
Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)
DIS-360 Add a "model" label to Component metrics
Summary by CodeRabbit