[skip ci][Docs] reorganize multiple L4 test guidelines#2119
Conversation
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
9403ed4 to
7d59aa7
Compare
There was a problem hiding this comment.
Pull request overview
Reorganizes and modularizes the L4 testing guidelines documentation so the L4 “Test Examples” section in CI_5levels.md is split into focused include fragments, and updates the diffusion-model onboarding doc to point to the consolidated L1–L5 testing guide.
Changes:
- Replaced the long, diffusion-specific L4 test guideline block in
adding_diffusion_model.mdwith a pointer to the multi-level CI/testing doc. - Added three new L4 include fragments for documentation-example tests, performance tests, and functionality tests.
- Refactored
CI_5levels.mdto include the new fragments via snippet includes.
Reviewed changes
Copilot reviewed 4 out of 5 changed files in this pull request and generated 7 comments.
Show a summary per file
| File | Description |
|---|---|
| docs/contributing/model/adding_diffusion_model.md | Points diffusion model contributors to the centralized CI/test-level docs instead of inline L4 guidance. |
| docs/contributing/ci/test_examples/l4_performance_tests.inc.md | New reusable fragment describing how to add L4 performance tests via tests/dfx/perf/tests/test.json. |
| docs/contributing/ci/test_examples/l4_functionality_tests.inc.md | New reusable fragment describing L4 diffusion functionality test scope/design/style. |
| docs/contributing/ci/test_examples/l4_doc_example_tests.inc.md | New reusable fragment describing strategies/naming/runtime rules for doc example tests. |
| docs/contributing/ci/CI_5levels.md | Replaces the inlined L4 “Test Examples” content with snippet includes to the new fragments. |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
| 1. Change the ---xxx-xx-xx running parameters to xxx_xx_xx format and fill them as keys in the JSON file. | ||
| 2. For boolean variables in the running parameters, modify them to forms such as ignore_eos: true/false and fill them into the JSON file. | ||
| 3. Add the baseline parameter to specify the required validation values, ensuring the validation metric names match those in the result.json generated by the benchmark. | ||
| 4. The qps and concurrency modes are mutually exclusive. For detailed explanations, see the table below: |
There was a problem hiding this comment.
This says QPS and concurrency modes are mutually exclusive, but the perf runner supports specifying both request_rate and max_concurrency lists and will run a QPS sweep and a separate concurrency sweep. Consider rephrasing to clarify they are run as separate sweeps rather than disallowed together.
| 4. The qps and concurrency modes are mutually exclusive. For detailed explanations, see the table below: | |
| 4. QPS-based (`request_rate`) and concurrency-based (`max_concurrency`) benchmarks are configured independently. If you specify both, the perf runner will run a QPS sweep and a separate concurrency sweep. For detailed explanations, see the table below: |
There was a problem hiding this comment.
I confirmed that they can be used together, but when combined, they may affect each other, causing the request rate to not necessarily reach the set value. If modification is needed, this can be described accordingly, or simply recommend that users use them separately.
There was a problem hiding this comment.
Resolve by simply "recommend" users to use them exclusively
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Zeyu Huang | 黃澤宇 <11222265+fhfuih@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Zeyu Huang | 黃澤宇 <11222265+fhfuih@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Zeyu Huang | 黃澤宇 <11222265+fhfuih@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Zeyu Huang | 黃澤宇 <11222265+fhfuih@users.noreply.github.com>
Please also run |
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com>
…2119) Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com> Signed-off-by: Zeyu Huang | 黃澤宇 <11222265+fhfuih@users.noreply.github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
…2119) Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com> Signed-off-by: Zeyu Huang | 黃澤宇 <11222265+fhfuih@users.noreply.github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
…2119) Signed-off-by: Huang, Zeyu <11222265+fhfuih@users.noreply.github.com> Signed-off-by: Zeyu Huang | 黃澤宇 <11222265+fhfuih@users.noreply.github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>




Signed-off-by: Huang, Zeyu 11222265+fhfuih@users.noreply.github.com
Purpose
Currently, the detailed guidelines for L4 tests are very long. Because there are multiple test scopes, each evolving into well-developed, templated design.
Plus, per previous PR comment, the current place for L4 diffusion functionality test guideline is not very good. After that PR is merged with coding templates, this PR can focus on the relevant doc improvement.
This PR
Test Plan
NA
Test Result
NA
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model. Please runmkdocs serveto sync the documentation editions to./docs.BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)