-
Notifications
You must be signed in to change notification settings - Fork 676
fix: fix broken doc links #2308
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Caution Review failedFailed to post review comments. WalkthroughThis update introduces extensive documentation and configuration changes across the codebase, focusing on Kubernetes deployment guides, backend-specific deployment patterns, and configuration refactoring. Notable code changes include a major refactor of port allocation logic for vLLM backends, Grove feature configuration in the operator, and updates to CUDA graph and cache settings in TRTLLM engine configs. Multiple Dockerfiles, Helm charts, and deployment YAMLs are updated for consistency, dependency pinning, and improved health checks. Changes
Sequence Diagram(s)sequenceDiagram
participant User
participant Etcd
participant vLLM Worker
participant Ports Module
User->>vLLM Worker: Start with --dynamo-port-min/max
vLLM Worker->>Ports Module: Request port allocation (with metadata)
Ports Module->>Etcd: Reserve port(s) atomically
Etcd-->>Ports Module: Confirm reservation
Ports Module-->>vLLM Worker: Return allocated port(s)
vLLM Worker->>User: Ready (ports configured)
sequenceDiagram
participant Operator
participant K8s API Server
participant Grove API
participant Helm Chart
Operator->>K8s API Server: Start with Grove config
Operator->>Grove API: Detect Grove availability
Grove API-->>Operator: Grove present?
Operator->>K8s API Server: Set Grove.Enabled and TerminationDelay
Helm Chart->>K8s API Server: Deploy with Grove settings
Estimated code review effort🎯 4 (Complex) | ⏱️ ~40 minutes Possibly related PRs
Poem
Warning There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure. 🔧 golangci-lint (2.2.2)Error: can't load config: unsupported version of the configuration: "" See https://golangci-lint.run/product/migration-guide for migration instructions Note ⚡️ Unit Test Generation is now available in beta!Learn more here, or try it out under "Finishing Touches" below. Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. 🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
SupportNeed help? Create a ticket on our support page for assistance with any issues or questions. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
Documentation and Community
|
Overview:
Cherry-pick for #2309
fix broken doc links for deployment of frameworks
fixes nvbug: https://nvbugspro.nvidia.com/bug/5424387
closes linear: dyn-819
Details:
Where should the reviewer start?
Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)
Summary by CodeRabbit
New Features
Bug Fixes
Documentation
Refactor
Chores