Skip to content

docs: update metrics design doc to use new vllm:kv_cache_usage_perc#30041

Merged
markmc merged 2 commits intovllm-project:mainfrom
haitwang-cloud:rename-gpu-cache-metric-deprecation
Dec 4, 2025
Merged

docs: update metrics design doc to use new vllm:kv_cache_usage_perc#30041
markmc merged 2 commits intovllm-project:mainfrom
haitwang-cloud:rename-gpu-cache-metric-deprecation

Conversation

@haitwang-cloud
Copy link
Copy Markdown
Contributor

@haitwang-cloud haitwang-cloud commented Dec 4, 2025

In #18354, the vllm:gpu_cache_usage_perc metric was deprecated and vllm:kv_cache_usage_perc was introduced as its replacement.

See also #27133

@mergify
Copy link
Copy Markdown

mergify bot commented Dec 4, 2025

Documentation preview: https://vllm--30041.org.readthedocs.build/en/30041/

@mergify mergify bot added the documentation Improvements or additions to documentation label Dec 4, 2025
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates metric names in the docs/design/metrics.md file to improve clarity and consistency. The changes involve renaming vllm:time_per_output_token_seconds to vllm:inter_token_latency_seconds_bucket and vllm:gpu_cache_usage_perc to vllm:kv_cache_usage_perc within the Grafana Dashboard section. These modifications align the documented metric names with what is currently implemented in the codebase and used in other parts of the documentation, which is a positive improvement for documentation accuracy. The changes are correct and I have no further comments.

@haitwang-cloud haitwang-cloud force-pushed the rename-gpu-cache-metric-deprecation branch from ab9d454 to 696dc95 Compare December 4, 2025 08:56
@markmc markmc changed the title fix(metrics): update metric names for clarity and consistency docs: update metrics design doc to use new vllm:kv_cache_usage_perc Dec 4, 2025
@markmc markmc added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 4, 2025
Copy link
Copy Markdown
Member

@markmc markmc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you

@markmc markmc enabled auto-merge (squash) December 4, 2025 23:36
@markmc markmc merged commit 690cc3e into vllm-project:main Dec 4, 2025
13 checks passed
dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026
…llm-project#30041)

Signed-off-by: Tim <tim.wang03@sap.com>
Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants