[skip CI][Docs] Add Qwen3-Omni and Qwen3-TTS performance blog and figures#1837
Conversation
Signed-off-by: CHEN <116010019@link.cuhk.edu.cn> Co-authored-by: linyueqian <linyueqian@outlook.com>
ce5beca to
4f8713c
Compare
Signed-off-by: CHEN <116010019@link.cuhk.edu.cn>
Replaced the existing YouTube iframe with a new one. Signed-off-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com>
| **Qwen3-TTS** (H200, concurrency 1): | ||
|
|
||
| <table><tr> | ||
| <td><img src="figures/tts/Mean_E2EL_(ms)_vllm_omni_vs_transformers.png" alt="Qwen3-TTS E2EL: vLLM vs HF" width="100%"/></td> |
There was a problem hiding this comment.
Wouldn't it be better to use https://user-images.githubusercontent.com/xxx/xxxx/xxx.png rather than to upload these pictures to repository?
There was a problem hiding this comment.
The total size of all PNGsis only about 2–3 MB, which is negligible for the repository. Keeping the images together with the blog content in the same version ensures consistency.
…ugfix qwen3 omni and tts blog Signed-off-by: CHEN <116010019@link.cuhk.edu.cn>
1229ce5 to
e341e6b
Compare
Signed-off-by: CHEN <116010019@link.cuhk.edu.cn>
|
Updated TTS benchmark results with latest vLLM v0.18.0 / vllm-omni v0.18.0rc2 data (H200). Key changes:
Headline numbers (concurrency 1):
@Sy0307 @JuanPZuluaga - could you take a look at these results and see if they align with what you're seeing? |
Signed-off-by: linyueqian <linyueqian@outlook.com>
158bd71 to
e43e72f
Compare
Signed-off-by: CHEN <116010019@link.cuhk.edu.cn>
|
@yinpeiqi pls also check if the corresponding descriptions and results are consistent with the paper. |
Signed-off-by: CHEN <116010019@link.cuhk.edu.cn>
|
@linyueqian ran a benchmark locally with the latest main:
Results on RTX 6000 AdaResults from PR (H200)(i'm seeing a bit fast TTFP) |
Thank you very much! |
…ures (vllm-project#1837) Signed-off-by: CHEN <116010019@link.cuhk.edu.cn> Signed-off-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com> Signed-off-by: linyueqian <linyueqian@outlook.com> Co-authored-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com> Co-authored-by: linyueqian <linyueqian@outlook.com>
…ures (vllm-project#1837) Signed-off-by: CHEN <116010019@link.cuhk.edu.cn> Signed-off-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com> Signed-off-by: linyueqian <linyueqian@outlook.com> Co-authored-by: Yueqian Lin <70319226+linyueqian@users.noreply.github.com> Co-authored-by: linyueqian <linyueqian@outlook.com>
PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED.
Purpose
Add Qwen3-Omni and Qwen3-TTS performance blog and figures
Test Plan
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model. Please runmkdocs serveto sync the documentation editions to./docs.BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)