fix(cherry-pick): KV Router bindings docs#3337
Conversation
Signed-off-by: PeaBrane <yanrpei@gmail.com>
|
Caution Review failedFailed to post review comments WalkthroughRepository-wide version bump to 0.5.1 across images, charts, and docs; CLI/docs rename from file to dataset for router benchmarks with new ISL/OSL knobs; router snapshot threshold default increased to 1,000,000 in Python/Rust/docs; Docker build path adjusted for TRT-LLM; NATS stream setup simplified; minor handler/operator/env merge changes; tests updated. Changes
Sequence Diagram(s)sequenceDiagram
autonumber
actor User
participant CLI as real_data_benchmark.py
participant Synth as Synthesizer
participant DS as Input Dataset
participant LoadGen as genai-perf
User->>CLI: Run with --input-dataset and ISL/OSL knobs
CLI->>DS: Check dataset path and line count
alt Synthesis required (min/max ISL/OSL provided or caching criteria)
CLI->>Synth: synthesize_requests(num, max_isl, min_isl, min_osl, max_osl)
Synth->>DS: Read requests
Synth-->>CLI: Filtered/clipped requests (synthetic_trace.jsonl)
CLI->>LoadGen: Invoke with synthetic_trace.jsonl and schedule
else Use dataset directly
CLI->>LoadGen: Invoke with input_dataset and schedule
end
LoadGen-->>CLI: Results/artifacts
CLI-->>User: Benchmark summary
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~25 minutes Poem
Pre-merge checks❌ Failed checks (3 warnings)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
Overview:
as titled, main PR in #3308
Summary by CodeRabbit