Skip to content

Conversation

@jinzhen-lin
Copy link
Contributor

@jinzhen-lin jinzhen-lin commented Aug 7, 2025

This pr add mxfp4 and bias support for marlin kernel. (for gpt-oss)

Performance test on gpt-oss-20b + RTX4090 + TP1 (tokens / s)

bs triton marlin
1 97.9 97.8
2 190.2 191.8
4 348 379.6
8 551.2 622.3
16 849.5 1063.9
32 1513.5 1769.5
64 2380.4 3142
128 3659.5 4670.6

Performance test on gpt-oss-20b + A100 + TP1 (tokens / s)

bs triton marlin
1 124.4 125.8
2 210.8 246
4 322.8 477.6
8 515.2 879
16 814.4 1398.4
32 1395 2367.7
64 2349.1 3756.5
128 3533 5887.8

Performance test on gpt-oss-120b + A100 + TP2 (tokens / s)

bs triton marlin
1 110.4 117.5
2 193.8 227.1
4 345.2 427.1
8 515.3 731.9
16 833.6 1283.1
32 1280 2054.2
64 1868.7 3052.7
128 2777.8 4172.4

Note: not benchmark test, just copy the Avg generation throughput: value from vllm log

@github-actions
Copy link

github-actions bot commented Aug 7, 2025

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request adds support for MXFP4 and bias in the Marlin kernel. The changes are extensive, touching kernel implementations, generation scripts, and Python bindings. The core logic for handling the new MXFP4 format and incorporating bias seems well-integrated. A new scalar type for MXFP4 scales is introduced, and the necessary dequantization logic is added. Tests have also been updated to cover these new features. My main concern is a critical inconsistency in the definition of the new scalar type between C++ and Python, which could lead to incorrect behavior.

@mergify mergify bot added the performance Performance-related issues label Aug 7, 2025
@mgoin mgoin self-assigned this Aug 7, 2025
Copy link
Member

@mgoin mgoin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Very nice work! I think most of the places that need to updated look properly addressed. I left some comments from initial pass

@mergify mergify bot added the ci/build label Aug 8, 2025
@mergify
Copy link

mergify bot commented Aug 8, 2025

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @jinzhen-lin.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

@jinzhen-lin jinzhen-lin mentioned this pull request Aug 8, 2025
1 task
@ehartford
Copy link
Contributor

Hero!

lengrongfu and others added 13 commits August 9, 2025 13:03
Signed-off-by: Huzaifa Sidhpurwala <[email protected]>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Russell Bryant <[email protected]>
Signed-off-by: Jinzhen Lin <[email protected]>
… Dispatch (vllm-project#21837)

Signed-off-by: Varun Sundar Rabindranath <[email protected]>
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Signed-off-by: Jinzhen Lin <[email protected]>
@jinzhen-lin
Copy link
Contributor Author

@mgoin The failure is fixed. I previously omitted the permute operation on the bias (Qwen1.5 has bias in attention layer).

The metric is ok in my local test. However, the metrics of the test results fluctuate significantly. I run the same test multi-times:

|Tasks|Version|     Filter     |n-shot|  Metric   |   |Value |   |Stderr|
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
|gsm8k|      3|flexible-extract|     5|exact_match|↑  |0.4450|±  |0.0137|
|     |       |strict-match    |     5|exact_match|↑  |0.2752|±  |0.0123|
|Tasks|Version|     Filter     |n-shot|  Metric   |   |Value |   |Stderr|
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
|gsm8k|      3|flexible-extract|     5|exact_match|↑  |0.4738|±  |0.0138|
|     |       |strict-match    |     5|exact_match|↑  |0.3010|±  |0.0126|
|Tasks|Version|     Filter     |n-shot|  Metric   |   |Value |   |Stderr|
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
|gsm8k|      3|flexible-extract|     5|exact_match|↑  |0.4647|±  |0.0137|
|     |       |strict-match    |     5|exact_match|↑  |0.2835|±  |0.0124|

So the ground_truth 0.465 / 0.3 may be too high. Also see the test failure in another pr mentioned by @bnellnm .

@simon-mo
Copy link
Collaborator

Please run GPQA

pip install gpt-oss, and run OPENAI_API_KEY=empty python -m gpt_oss.evals --model openai/gpt-oss-120b --eval gpqa --n-threads 128

@simon-mo
Copy link
Collaborator

Looking forward to ship this!

@jinzhen-lin
Copy link
Contributor Author

Please run GPQA

pip install gpt-oss, and run OPENAI_API_KEY=empty python -m gpt_oss.evals --model openai/gpt-oss-120b --eval gpqa --n-threads 128

The test result of gpt-oss-20b:

# reasoning_effort=low
{'chars': 53.39962121212121, 'chars:std': 208.81521548256066, 'score': 0.5650252525252525, 'score:std': 0.49575368534588543}

# reasoning_effort=medium
{'chars': 40.38762626262626, 'chars:std': 182.659349965226, 'score': 0.6647727272727273, 'score:std': 0.47206985536793966}

The reasoning_effort=high setting take too many time to run. The results of reasoning_effort=low/medium settings
seem ok.

@mgoin mgoin removed this from Tool Calling Aug 13, 2025
Signed-off-by: Jinzhen Lin <[email protected]>
@mgoin mgoin enabled auto-merge (squash) August 14, 2025 13:58
@simon-mo
Copy link
Collaborator

I ran some GPQA on H100 as well. LGTM

20B

Low
{'chars': np.float64(49.21022727272727), 'chars:std': np.float64(199.19166667350962), 'score': np.float64(0.5669191919191919), 'score:std': np.float64(0.4955015860245882)}

Medium
{'chars': np.float64(40.28787878787879), 'chars:std': np.float64(184.79641529424072), 'score': np.float64(0.6843434343434344), 'score:std': np.float64(0.4647768262451001)}

120B

[
{'eval_name': 'gpqa', 'model_name': 'openai-gpt-oss-120b-low_temp1.0_20250813_150155', 'metric': 0.6407828282828283},
{'eval_name': 'gpqa', 'model_name': 'openai-gpt-oss-120b-medium_temp1.0_20250813_150155', 'metric': 0.7102272727272727},
{'eval_name': 'gpqa', 'model_name': 'openai-gpt-oss-120b-high_temp1.0_20250813_150155', 'metric': 0.7916666666666666}]

@simon-mo simon-mo disabled auto-merge August 14, 2025 18:23
@simon-mo simon-mo merged commit 33c63e9 into vllm-project:main Aug 14, 2025
73 checks passed
yiliu30 pushed a commit to yiliu30/vllm-fork that referenced this pull request Aug 19, 2025
…llm-project#22428)

Signed-off-by: rongfu.leng <[email protected]>
Signed-off-by: Jinzhen Lin <[email protected]>
Signed-off-by: Huzaifa Sidhpurwala <[email protected]>
Signed-off-by: Varun Sundar Rabindranath <[email protected]>
Signed-off-by: Harry Mellor <[email protected]>
Signed-off-by: Jee Jee Li <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: Animesh Jain <[email protected]>
Signed-off-by: Rui Qiao <[email protected]>
Signed-off-by: Xiongfei Wei <[email protected]>
Signed-off-by: Nick Hill <[email protected]>
Signed-off-by: yewentao256 <[email protected]>
Signed-off-by: kf <[email protected]>
Signed-off-by: vllmellm <[email protected]>
Signed-off-by: NickLucche <[email protected]>
Signed-off-by: Dipika Sikka <[email protected]>
Signed-off-by: Sage Moore <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: Yong Hoon Shin <[email protected]>
Signed-off-by: Chih-Chieh-Yang <[email protected]>
Signed-off-by: Roger Wang <[email protected]>
Signed-off-by: Vadim Gimpelson <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: zRzRzRzRzRzRzR <[email protected]>
Signed-off-by: Chih-Chieh Yang <[email protected]>
Signed-off-by: DarkLight1337 <[email protected]>
Signed-off-by: Thomas Parnell <[email protected]>
Signed-off-by: yan <[email protected]>
Signed-off-by: Yan Ma <[email protected]>
Signed-off-by: Xiao Liu <[email protected]>
Signed-off-by: jiahanc <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Ye (Charlotte) Qi <[email protected]>
Signed-off-by: LopezCastroRoberto <[email protected]>
Signed-off-by: Andy Xie <[email protected]>
Signed-off-by: Haibin Lin <[email protected]>
Signed-off-by: David Ben-David <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: jiang1.li <[email protected]>
Signed-off-by: Seiji Eicher <[email protected]>
Signed-off-by: zitian.zhao <[email protected]>
Signed-off-by: 22quinn <[email protected]>
Signed-off-by: Abirdcfly <[email protected]>
Signed-off-by: Giancarlo Delfin <[email protected]>
Signed-off-by: Tyler Michael Smith <[email protected]>
Signed-off-by: huangweixiao <[email protected]>
Signed-off-by: alyosha-swamy <[email protected]>
Signed-off-by: Eric Hanley <[email protected]>
Signed-off-by: Abatom <[email protected]>
Signed-off-by: CLFutureX <[email protected]>
Signed-off-by: Linkun Chen <[email protected]>
Signed-off-by: tjtanaa <[email protected]>
Signed-off-by: Gregory Shtrasberg <[email protected]>
Signed-off-by: tlipoca9 <[email protected]>
Signed-off-by: elvischenv <[email protected]>
Signed-off-by: zitian zhao <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: wang.yuqi <[email protected]>
Signed-off-by: Benji Beck <[email protected]>
Signed-off-by: Siyuan Liu <[email protected]>
Signed-off-by: Benjamin Chislett <[email protected]>
Signed-off-by: isotr0py <[email protected]>
Signed-off-by: Chen Zhang <[email protected]>
Signed-off-by: simon-mo <[email protected]>
Signed-off-by: LucasWilkinson <[email protected]>
Signed-off-by: Zhang Jason <[email protected]>
Signed-off-by: Yongye Zhu <[email protected]>
Signed-off-by: asafg <[email protected]>
Signed-off-by: Siyuan Fu <[email protected]>
Signed-off-by: Lain <[email protected]>
Signed-off-by: Max de Bayser <[email protected]>
Signed-off-by: Lucas Wilkinson <[email protected]>
Signed-off-by: Kunshang Ji <[email protected]>
Signed-off-by: Tao He <[email protected]>
Signed-off-by: Michael Goin <[email protected]>
Signed-off-by: QscQ <[email protected]>
Signed-off-by: qingjun <[email protected]>
Signed-off-by: Syed Muhammad Bin Asif <[email protected]>
Signed-off-by: Lionel Villard <[email protected]>
Signed-off-by: ycyaw66 <[email protected]>
Signed-off-by: David Chen <[email protected]>
Signed-off-by: Linkun <[email protected]>
Signed-off-by: Moritz Sanft <[email protected]>
Signed-off-by: Ming Yang <[email protected]>
Signed-off-by: Adrian Garcia <[email protected]>
Signed-off-by: shaojunqi <[email protected]>
Signed-off-by: Ricardo Decal <[email protected]>
Signed-off-by: Andrew Chan <[email protected]>
Signed-off-by: Felix Marty <[email protected]>
Signed-off-by: Andrew Sansom <[email protected]>
Signed-off-by: Zhiyu Cheng <[email protected]>
Signed-off-by: Shu Wang <[email protected]>
Signed-off-by: Po-Han Huang <[email protected]>
Signed-off-by: Shu Wang. <[email protected]>
Signed-off-by: XIn Li <[email protected]>
Signed-off-by: Junhao Li <[email protected]>
Signed-off-by: chaunceyjiang <[email protected]>
Signed-off-by: iAmir97 <[email protected]>
Signed-off-by: iAmir97 <[email protected]>
Signed-off-by: <[email protected]>
Signed-off-by: Guy Stone <[email protected]>
Signed-off-by: <[email protected]>
Signed-off-by: yyw <[email protected]>
Signed-off-by: Russell Bryant <[email protected]>
Signed-off-by: Pradyun Ramadorai <[email protected]>
Signed-off-by: Pradyun92 <[email protected]>
Signed-off-by: Jinzhen Lin <[email protected]>
Co-authored-by: rongfu.leng <[email protected]>
Co-authored-by: Huzaifa Sidhpurwala <[email protected]>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Russell Bryant <[email protected]>
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Co-authored-by: Harry Mellor <[email protected]>
Co-authored-by: Jee Jee Li <[email protected]>
Co-authored-by: Michael Goin <[email protected]>
Co-authored-by: Animesh Jain <[email protected]>
Co-authored-by: Rui Qiao <[email protected]>
Co-authored-by: XiongfeiWei <[email protected]>
Co-authored-by: Nick Hill <[email protected]>
Co-authored-by: Wentao Ye <[email protected]>
Co-authored-by: JartX <[email protected]>
Co-authored-by: fhl2000 <[email protected]>
Co-authored-by: vllmellm <[email protected]>
Co-authored-by: kf <[email protected]>
Co-authored-by: Nicolò Lucchesi <[email protected]>
Co-authored-by: Dipika Sikka <[email protected]>
Co-authored-by: Sage Moore <[email protected]>
Co-authored-by: tjtanaavllm <[email protected]>
Co-authored-by: Yong Hoon Shin <[email protected]>
Co-authored-by: Chih-Chieh Yang <[email protected]>
Co-authored-by: Roger Wang <[email protected]>
Co-authored-by: Vadim Gimpelson <[email protected]>
Co-authored-by: Yuxuan Zhang <[email protected]>
Co-authored-by: Isotr0py <[email protected]>
Co-authored-by: Cyrus Leung <[email protected]>
Co-authored-by: Thomas Parnell <[email protected]>
Co-authored-by: Yan Ma <[email protected]>
Co-authored-by: Xiao <[email protected]>
Co-authored-by: jiahanc <[email protected]>
Co-authored-by: Isotr0py <[email protected]>
Co-authored-by: Ye (Charlotte) Qi <[email protected]>
Co-authored-by: Roberto L. Castro <[email protected]>
Co-authored-by: Ning Xie <[email protected]>
Co-authored-by: H <[email protected]>
Co-authored-by: David Ben-David <[email protected]>
Co-authored-by: David Ben-David <[email protected]>
Co-authored-by: Woosuk Kwon <[email protected]>
Co-authored-by: Li, Jiang <[email protected]>
Co-authored-by: TankNee <[email protected]>
Co-authored-by: Cyrus Leung <[email protected]>
Co-authored-by: Seiji Eicher <[email protected]>
Co-authored-by: ZiTian.Zhao <[email protected]>
Co-authored-by: 22quinn <[email protected]>
Co-authored-by: Abirdcfly <[email protected]>
Co-authored-by: Giancarlo Delfin <[email protected]>
Co-authored-by: Chenxi Yang <[email protected]>
Co-authored-by: Chenxi Yang <[email protected]>
Co-authored-by: Tyler Michael Smith <[email protected]>
Co-authored-by: Weixiao Huang <[email protected]>
Co-authored-by: Raghav Ravishankar <[email protected]>
Co-authored-by: ericehanley <[email protected]>
Co-authored-by: Zhonghua Deng <[email protected]>
Co-authored-by: Po-Han Huang (NVIDIA) <[email protected]>
Co-authored-by: PiteXChen <[email protected]>
Co-authored-by: lkchen <[email protected]>
Co-authored-by: TJian <[email protected]>
Co-authored-by: Gregory Shtrasberg <[email protected]>
Co-authored-by: tlipoca9 <[email protected]>
Co-authored-by: elvischenv <[email protected]>
Co-authored-by: wang.yuqi <[email protected]>
Co-authored-by: Benji Beck <[email protected]>
Co-authored-by: youkaichao <[email protected]>
Co-authored-by: Siyuan Liu <[email protected]>
Co-authored-by: Benjamin Chislett <[email protected]>
Co-authored-by: LiuXiaoxuanPKU <[email protected]>
Co-authored-by: simon-mo <[email protected]>
Co-authored-by: Chen Zhang <[email protected]>
Co-authored-by: Hongxia Yang <[email protected]>
Co-authored-by: Minseok Lee <[email protected]>
Co-authored-by: Yongye Zhu <[email protected]>
Co-authored-by: Lucas Wilkinson <[email protected]>
Co-authored-by: Zhang Jason <[email protected]>
Co-authored-by: Asaf Joseph Gardin <[email protected]>
Co-authored-by: asafg <[email protected]>
Co-authored-by: Lain <[email protected]>
Co-authored-by: tc-mb <[email protected]>
Co-authored-by: imning3 <[email protected]>
Co-authored-by: Maximilien de Bayser <[email protected]>
Co-authored-by: Kunshang Ji <[email protected]>
Co-authored-by: Tao He <[email protected]>
Co-authored-by: qscqesze <[email protected]>
Co-authored-by: Syed Muhammad Bin Asif <[email protected]>
Co-authored-by: Lionel Villard <[email protected]>
Co-authored-by: WeiQing Chen <[email protected]>
Co-authored-by: ycyaw66 <[email protected]>
Co-authored-by: Moritz Sanft <[email protected]>
Co-authored-by: Ming Yang <[email protected]>
Co-authored-by: Adrián García García <[email protected]>
Co-authored-by: Michael Goin <[email protected]>
Co-authored-by: JaceyShao <[email protected]>
Co-authored-by: shaojunqi <[email protected]>
Co-authored-by: Ricardo Decal <[email protected]>
Co-authored-by: Andrew Chan <[email protected]>
Co-authored-by: fxmarty-amd <[email protected]>
Co-authored-by: Andrew Sansom <[email protected]>
Co-authored-by: Zhiyu <[email protected]>
Co-authored-by: Shu Wang <[email protected]>
Co-authored-by: XIn Li <[email protected]>
Co-authored-by: Junhao Li <[email protected]>
Co-authored-by: Chauncey <[email protected]>
Co-authored-by: iAmir97 <[email protected]>
Co-authored-by: iAmir97 <[email protected]>
Co-authored-by: Hong Hanh <[email protected]>
Co-authored-by: Daniel Serebrenik <[email protected]>
Co-authored-by: yewentao256 <[email protected]>
Co-authored-by: Guy Stone <[email protected]>
Co-authored-by: yyweiss <[email protected]>
Co-authored-by: Pradyun92 <[email protected]>
Co-authored-by: Pradyun Ramadorai <[email protected]>
Co-authored-by: Nicolò Lucchesi <[email protected]>
@simon-mo
Copy link
Collaborator

simon-mo commented Aug 20, 2025

Forgot to post my local perf test record, putting here for reference

With Marlin: H100

vllm bench serve --dataset-name hf --dataset-path AI-MO/aimo-validation-aime --model openai/gpt-oss-120b --port 8001 --num-prompts 2048


============ Serving Benchmark Result ============
Successful requests:                     2048      
Benchmark duration (s):                  287.20    
Total input tokens:                      214965    
Total generated tokens:                  4051696   
Request throughput (req/s):              7.13      
Output token throughput (tok/s):         14107.40  
Total Token throughput (tok/s):          14855.88  
---------------Time to First Token----------------
Mean TTFT (ms):                          44399.41  
Median TTFT (ms):                        4974.10   
P99 TTFT (ms):                           160217.06 
-----Time per Output Token (excl. 1st token)------
Mean TPOT (ms):                          57.83     
Median TPOT (ms):                        59.46     
P99 TPOT (ms):                           68.26     
---------------Inter-token Latency----------------
Mean ITL (ms):                           55.48     
Median ITL (ms):                         53.24     
P99 ITL (ms):                            169.69    
==================================================

============ Serving Benchmark Result ============
Successful requests:                     2048
Benchmark duration (s):                  288.44
Total input tokens:                      214965
Total generated tokens:                  4048325
Request throughput (req/s):              7.10
Output token throughput (tok/s):         14035.29
Total Token throughput (tok/s):          14780.56
---------------Time to First Token----------------
Mean TTFT (ms):                          45371.57
Median TTFT (ms):                        5395.25
P99 TTFT (ms):                           163403.50
-----Time per Output Token (excl. 1st token)------
Mean TPOT (ms):                          58.56
Median TPOT (ms):                        60.77
P99 TPOT (ms):                           68.30
---------------Inter-token Latency----------------
Mean ITL (ms):                           56.11
Median ITL (ms):                         53.49
P99 ITL (ms):                            145.67
==================================================

With Triton matmul ogs


uv pip install -U --pre torch==2.9.0.dev20250804+cu128 triton==3.4.0+git663e04e8 triton-kernels --index-url https://wheels.vllm.ai/gpt-oss --extra-index-url https://download.pytorch.org/whl/nightly/cu128 --index-strategy unsafe-best-match




============ Serving Benchmark Result ============
Successful requests:                     2048
Benchmark duration (s):                  276.56
Total input tokens:                      214965
Total generated tokens:                  4086045
Request throughput (req/s):              7.41
Output token throughput (tok/s):         14774.54
Total Token throughput (tok/s):          15551.83
---------------Time to First Token----------------
Mean TTFT (ms):                          50542.82
Median TTFT (ms):                        11954.13
P99 TTFT (ms):                           164304.18
-----Time per Output Token (excl. 1st token)------
Mean TPOT (ms):                          57.00
Median TPOT (ms):                        59.84
P99 TPOT (ms):                           82.02
---------------Inter-token Latency----------------
Mean ITL (ms):                           53.19
Median ITL (ms):                         46.46
P99 ITL (ms):                            186.68
==================================================

divakar-amd pushed a commit to divakar-amd/vllm_upstream that referenced this pull request Aug 20, 2025
…llm-project#22428)

Signed-off-by: rongfu.leng <[email protected]>
Signed-off-by: Jinzhen Lin <[email protected]>
Signed-off-by: Huzaifa Sidhpurwala <[email protected]>
Signed-off-by: Varun Sundar Rabindranath <[email protected]>
Signed-off-by: Harry Mellor <[email protected]>
Signed-off-by: Jee Jee Li <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: Animesh Jain <[email protected]>
Signed-off-by: Rui Qiao <[email protected]>
Signed-off-by: Xiongfei Wei <[email protected]>
Signed-off-by: Nick Hill <[email protected]>
Signed-off-by: yewentao256 <[email protected]>
Signed-off-by: kf <[email protected]>
Signed-off-by: vllmellm <[email protected]>
Signed-off-by: NickLucche <[email protected]>
Signed-off-by: Dipika Sikka <[email protected]>
Signed-off-by: Sage Moore <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: Yong Hoon Shin <[email protected]>
Signed-off-by: Chih-Chieh-Yang <[email protected]>
Signed-off-by: Roger Wang <[email protected]>
Signed-off-by: Vadim Gimpelson <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: zRzRzRzRzRzRzR <[email protected]>
Signed-off-by: Chih-Chieh Yang <[email protected]>
Signed-off-by: DarkLight1337 <[email protected]>
Signed-off-by: Thomas Parnell <[email protected]>
Signed-off-by: yan <[email protected]>
Signed-off-by: Yan Ma <[email protected]>
Signed-off-by: Xiao Liu <[email protected]>
Signed-off-by: jiahanc <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Ye (Charlotte) Qi <[email protected]>
Signed-off-by: LopezCastroRoberto <[email protected]>
Signed-off-by: Andy Xie <[email protected]>
Signed-off-by: Haibin Lin <[email protected]>
Signed-off-by: David Ben-David <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: jiang1.li <[email protected]>
Signed-off-by: Seiji Eicher <[email protected]>
Signed-off-by: zitian.zhao <[email protected]>
Signed-off-by: 22quinn <[email protected]>
Signed-off-by: Abirdcfly <[email protected]>
Signed-off-by: Giancarlo Delfin <[email protected]>
Signed-off-by: Tyler Michael Smith <[email protected]>
Signed-off-by: huangweixiao <[email protected]>
Signed-off-by: alyosha-swamy <[email protected]>
Signed-off-by: Eric Hanley <[email protected]>
Signed-off-by: Abatom <[email protected]>
Signed-off-by: CLFutureX <[email protected]>
Signed-off-by: Linkun Chen <[email protected]>
Signed-off-by: tjtanaa <[email protected]>
Signed-off-by: Gregory Shtrasberg <[email protected]>
Signed-off-by: tlipoca9 <[email protected]>
Signed-off-by: elvischenv <[email protected]>
Signed-off-by: zitian zhao <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: wang.yuqi <[email protected]>
Signed-off-by: Benji Beck <[email protected]>
Signed-off-by: Siyuan Liu <[email protected]>
Signed-off-by: Benjamin Chislett <[email protected]>
Signed-off-by: isotr0py <[email protected]>
Signed-off-by: Chen Zhang <[email protected]>
Signed-off-by: simon-mo <[email protected]>
Signed-off-by: LucasWilkinson <[email protected]>
Signed-off-by: Zhang Jason <[email protected]>
Signed-off-by: Yongye Zhu <[email protected]>
Signed-off-by: asafg <[email protected]>
Signed-off-by: Siyuan Fu <[email protected]>
Signed-off-by: Lain <[email protected]>
Signed-off-by: Max de Bayser <[email protected]>
Signed-off-by: Lucas Wilkinson <[email protected]>
Signed-off-by: Kunshang Ji <[email protected]>
Signed-off-by: Tao He <[email protected]>
Signed-off-by: Michael Goin <[email protected]>
Signed-off-by: QscQ <[email protected]>
Signed-off-by: qingjun <[email protected]>
Signed-off-by: Syed Muhammad Bin Asif <[email protected]>
Signed-off-by: Lionel Villard <[email protected]>
Signed-off-by: ycyaw66 <[email protected]>
Signed-off-by: David Chen <[email protected]>
Signed-off-by: Linkun <[email protected]>
Signed-off-by: Moritz Sanft <[email protected]>
Signed-off-by: Ming Yang <[email protected]>
Signed-off-by: Adrian Garcia <[email protected]>
Signed-off-by: shaojunqi <[email protected]>
Signed-off-by: Ricardo Decal <[email protected]>
Signed-off-by: Andrew Chan <[email protected]>
Signed-off-by: Felix Marty <[email protected]>
Signed-off-by: Andrew Sansom <[email protected]>
Signed-off-by: Zhiyu Cheng <[email protected]>
Signed-off-by: Shu Wang <[email protected]>
Signed-off-by: Po-Han Huang <[email protected]>
Signed-off-by: Shu Wang. <[email protected]>
Signed-off-by: XIn Li <[email protected]>
Signed-off-by: Junhao Li <[email protected]>
Signed-off-by: chaunceyjiang <[email protected]>
Signed-off-by: iAmir97 <[email protected]>
Signed-off-by: iAmir97 <[email protected]>
Signed-off-by: <[email protected]>
Signed-off-by: Guy Stone <[email protected]>
Signed-off-by: <[email protected]>
Signed-off-by: yyw <[email protected]>
Signed-off-by: Russell Bryant <[email protected]>
Signed-off-by: Pradyun Ramadorai <[email protected]>
Signed-off-by: Pradyun92 <[email protected]>
Signed-off-by: Jinzhen Lin <[email protected]>
Co-authored-by: rongfu.leng <[email protected]>
Co-authored-by: Huzaifa Sidhpurwala <[email protected]>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Russell Bryant <[email protected]>
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Co-authored-by: Harry Mellor <[email protected]>
Co-authored-by: Jee Jee Li <[email protected]>
Co-authored-by: Michael Goin <[email protected]>
Co-authored-by: Animesh Jain <[email protected]>
Co-authored-by: Rui Qiao <[email protected]>
Co-authored-by: XiongfeiWei <[email protected]>
Co-authored-by: Nick Hill <[email protected]>
Co-authored-by: Wentao Ye <[email protected]>
Co-authored-by: JartX <[email protected]>
Co-authored-by: fhl2000 <[email protected]>
Co-authored-by: vllmellm <[email protected]>
Co-authored-by: kf <[email protected]>
Co-authored-by: Nicolò Lucchesi <[email protected]>
Co-authored-by: Dipika Sikka <[email protected]>
Co-authored-by: Sage Moore <[email protected]>
Co-authored-by: tjtanaavllm <[email protected]>
Co-authored-by: Yong Hoon Shin <[email protected]>
Co-authored-by: Chih-Chieh Yang <[email protected]>
Co-authored-by: Roger Wang <[email protected]>
Co-authored-by: Vadim Gimpelson <[email protected]>
Co-authored-by: Yuxuan Zhang <[email protected]>
Co-authored-by: Isotr0py <[email protected]>
Co-authored-by: Cyrus Leung <[email protected]>
Co-authored-by: Thomas Parnell <[email protected]>
Co-authored-by: Yan Ma <[email protected]>
Co-authored-by: Xiao <[email protected]>
Co-authored-by: jiahanc <[email protected]>
Co-authored-by: Isotr0py <[email protected]>
Co-authored-by: Ye (Charlotte) Qi <[email protected]>
Co-authored-by: Roberto L. Castro <[email protected]>
Co-authored-by: Ning Xie <[email protected]>
Co-authored-by: H <[email protected]>
Co-authored-by: David Ben-David <[email protected]>
Co-authored-by: David Ben-David <[email protected]>
Co-authored-by: Woosuk Kwon <[email protected]>
Co-authored-by: Li, Jiang <[email protected]>
Co-authored-by: TankNee <[email protected]>
Co-authored-by: Cyrus Leung <[email protected]>
Co-authored-by: Seiji Eicher <[email protected]>
Co-authored-by: ZiTian.Zhao <[email protected]>
Co-authored-by: 22quinn <[email protected]>
Co-authored-by: Abirdcfly <[email protected]>
Co-authored-by: Giancarlo Delfin <[email protected]>
Co-authored-by: Chenxi Yang <[email protected]>
Co-authored-by: Chenxi Yang <[email protected]>
Co-authored-by: Tyler Michael Smith <[email protected]>
Co-authored-by: Weixiao Huang <[email protected]>
Co-authored-by: Raghav Ravishankar <[email protected]>
Co-authored-by: ericehanley <[email protected]>
Co-authored-by: Zhonghua Deng <[email protected]>
Co-authored-by: Po-Han Huang (NVIDIA) <[email protected]>
Co-authored-by: PiteXChen <[email protected]>
Co-authored-by: lkchen <[email protected]>
Co-authored-by: TJian <[email protected]>
Co-authored-by: Gregory Shtrasberg <[email protected]>
Co-authored-by: tlipoca9 <[email protected]>
Co-authored-by: elvischenv <[email protected]>
Co-authored-by: wang.yuqi <[email protected]>
Co-authored-by: Benji Beck <[email protected]>
Co-authored-by: youkaichao <[email protected]>
Co-authored-by: Siyuan Liu <[email protected]>
Co-authored-by: Benjamin Chislett <[email protected]>
Co-authored-by: LiuXiaoxuanPKU <[email protected]>
Co-authored-by: simon-mo <[email protected]>
Co-authored-by: Chen Zhang <[email protected]>
Co-authored-by: Hongxia Yang <[email protected]>
Co-authored-by: Minseok Lee <[email protected]>
Co-authored-by: Yongye Zhu <[email protected]>
Co-authored-by: Lucas Wilkinson <[email protected]>
Co-authored-by: Zhang Jason <[email protected]>
Co-authored-by: Asaf Joseph Gardin <[email protected]>
Co-authored-by: asafg <[email protected]>
Co-authored-by: Lain <[email protected]>
Co-authored-by: tc-mb <[email protected]>
Co-authored-by: imning3 <[email protected]>
Co-authored-by: Maximilien de Bayser <[email protected]>
Co-authored-by: Kunshang Ji <[email protected]>
Co-authored-by: Tao He <[email protected]>
Co-authored-by: qscqesze <[email protected]>
Co-authored-by: Syed Muhammad Bin Asif <[email protected]>
Co-authored-by: Lionel Villard <[email protected]>
Co-authored-by: WeiQing Chen <[email protected]>
Co-authored-by: ycyaw66 <[email protected]>
Co-authored-by: Moritz Sanft <[email protected]>
Co-authored-by: Ming Yang <[email protected]>
Co-authored-by: Adrián García García <[email protected]>
Co-authored-by: Michael Goin <[email protected]>
Co-authored-by: JaceyShao <[email protected]>
Co-authored-by: shaojunqi <[email protected]>
Co-authored-by: Ricardo Decal <[email protected]>
Co-authored-by: Andrew Chan <[email protected]>
Co-authored-by: fxmarty-amd <[email protected]>
Co-authored-by: Andrew Sansom <[email protected]>
Co-authored-by: Zhiyu <[email protected]>
Co-authored-by: Shu Wang <[email protected]>
Co-authored-by: XIn Li <[email protected]>
Co-authored-by: Junhao Li <[email protected]>
Co-authored-by: Chauncey <[email protected]>
Co-authored-by: iAmir97 <[email protected]>
Co-authored-by: iAmir97 <[email protected]>
Co-authored-by: Hong Hanh <[email protected]>
Co-authored-by: Daniel Serebrenik <[email protected]>
Co-authored-by: yewentao256 <[email protected]>
Co-authored-by: Guy Stone <[email protected]>
Co-authored-by: yyweiss <[email protected]>
Co-authored-by: Pradyun92 <[email protected]>
Co-authored-by: Pradyun Ramadorai <[email protected]>
Co-authored-by: Nicolò Lucchesi <[email protected]>
djmmoss pushed a commit to djmmoss/vllm that referenced this pull request Aug 21, 2025
…llm-project#22428)

Signed-off-by: rongfu.leng <[email protected]>
Signed-off-by: Jinzhen Lin <[email protected]>
Signed-off-by: Huzaifa Sidhpurwala <[email protected]>
Signed-off-by: Varun Sundar Rabindranath <[email protected]>
Signed-off-by: Harry Mellor <[email protected]>
Signed-off-by: Jee Jee Li <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: Animesh Jain <[email protected]>
Signed-off-by: Rui Qiao <[email protected]>
Signed-off-by: Xiongfei Wei <[email protected]>
Signed-off-by: Nick Hill <[email protected]>
Signed-off-by: yewentao256 <[email protected]>
Signed-off-by: kf <[email protected]>
Signed-off-by: vllmellm <[email protected]>
Signed-off-by: NickLucche <[email protected]>
Signed-off-by: Dipika Sikka <[email protected]>
Signed-off-by: Sage Moore <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: Yong Hoon Shin <[email protected]>
Signed-off-by: Chih-Chieh-Yang <[email protected]>
Signed-off-by: Roger Wang <[email protected]>
Signed-off-by: Vadim Gimpelson <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: zRzRzRzRzRzRzR <[email protected]>
Signed-off-by: Chih-Chieh Yang <[email protected]>
Signed-off-by: DarkLight1337 <[email protected]>
Signed-off-by: Thomas Parnell <[email protected]>
Signed-off-by: yan <[email protected]>
Signed-off-by: Yan Ma <[email protected]>
Signed-off-by: Xiao Liu <[email protected]>
Signed-off-by: jiahanc <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Ye (Charlotte) Qi <[email protected]>
Signed-off-by: LopezCastroRoberto <[email protected]>
Signed-off-by: Andy Xie <[email protected]>
Signed-off-by: Haibin Lin <[email protected]>
Signed-off-by: David Ben-David <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: jiang1.li <[email protected]>
Signed-off-by: Seiji Eicher <[email protected]>
Signed-off-by: zitian.zhao <[email protected]>
Signed-off-by: 22quinn <[email protected]>
Signed-off-by: Abirdcfly <[email protected]>
Signed-off-by: Giancarlo Delfin <[email protected]>
Signed-off-by: Tyler Michael Smith <[email protected]>
Signed-off-by: huangweixiao <[email protected]>
Signed-off-by: alyosha-swamy <[email protected]>
Signed-off-by: Eric Hanley <[email protected]>
Signed-off-by: Abatom <[email protected]>
Signed-off-by: CLFutureX <[email protected]>
Signed-off-by: Linkun Chen <[email protected]>
Signed-off-by: tjtanaa <[email protected]>
Signed-off-by: Gregory Shtrasberg <[email protected]>
Signed-off-by: tlipoca9 <[email protected]>
Signed-off-by: elvischenv <[email protected]>
Signed-off-by: zitian zhao <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: wang.yuqi <[email protected]>
Signed-off-by: Benji Beck <[email protected]>
Signed-off-by: Siyuan Liu <[email protected]>
Signed-off-by: Benjamin Chislett <[email protected]>
Signed-off-by: isotr0py <[email protected]>
Signed-off-by: Chen Zhang <[email protected]>
Signed-off-by: simon-mo <[email protected]>
Signed-off-by: LucasWilkinson <[email protected]>
Signed-off-by: Zhang Jason <[email protected]>
Signed-off-by: Yongye Zhu <[email protected]>
Signed-off-by: asafg <[email protected]>
Signed-off-by: Siyuan Fu <[email protected]>
Signed-off-by: Lain <[email protected]>
Signed-off-by: Max de Bayser <[email protected]>
Signed-off-by: Lucas Wilkinson <[email protected]>
Signed-off-by: Kunshang Ji <[email protected]>
Signed-off-by: Tao He <[email protected]>
Signed-off-by: Michael Goin <[email protected]>
Signed-off-by: QscQ <[email protected]>
Signed-off-by: qingjun <[email protected]>
Signed-off-by: Syed Muhammad Bin Asif <[email protected]>
Signed-off-by: Lionel Villard <[email protected]>
Signed-off-by: ycyaw66 <[email protected]>
Signed-off-by: David Chen <[email protected]>
Signed-off-by: Linkun <[email protected]>
Signed-off-by: Moritz Sanft <[email protected]>
Signed-off-by: Ming Yang <[email protected]>
Signed-off-by: Adrian Garcia <[email protected]>
Signed-off-by: shaojunqi <[email protected]>
Signed-off-by: Ricardo Decal <[email protected]>
Signed-off-by: Andrew Chan <[email protected]>
Signed-off-by: Felix Marty <[email protected]>
Signed-off-by: Andrew Sansom <[email protected]>
Signed-off-by: Zhiyu Cheng <[email protected]>
Signed-off-by: Shu Wang <[email protected]>
Signed-off-by: Po-Han Huang <[email protected]>
Signed-off-by: Shu Wang. <[email protected]>
Signed-off-by: XIn Li <[email protected]>
Signed-off-by: Junhao Li <[email protected]>
Signed-off-by: chaunceyjiang <[email protected]>
Signed-off-by: iAmir97 <[email protected]>
Signed-off-by: iAmir97 <[email protected]>
Signed-off-by: <[email protected]>
Signed-off-by: Guy Stone <[email protected]>
Signed-off-by: <[email protected]>
Signed-off-by: yyw <[email protected]>
Signed-off-by: Russell Bryant <[email protected]>
Signed-off-by: Pradyun Ramadorai <[email protected]>
Signed-off-by: Pradyun92 <[email protected]>
Signed-off-by: Jinzhen Lin <[email protected]>
Co-authored-by: rongfu.leng <[email protected]>
Co-authored-by: Huzaifa Sidhpurwala <[email protected]>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Russell Bryant <[email protected]>
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Co-authored-by: Harry Mellor <[email protected]>
Co-authored-by: Jee Jee Li <[email protected]>
Co-authored-by: Michael Goin <[email protected]>
Co-authored-by: Animesh Jain <[email protected]>
Co-authored-by: Rui Qiao <[email protected]>
Co-authored-by: XiongfeiWei <[email protected]>
Co-authored-by: Nick Hill <[email protected]>
Co-authored-by: Wentao Ye <[email protected]>
Co-authored-by: JartX <[email protected]>
Co-authored-by: fhl2000 <[email protected]>
Co-authored-by: vllmellm <[email protected]>
Co-authored-by: kf <[email protected]>
Co-authored-by: Nicolò Lucchesi <[email protected]>
Co-authored-by: Dipika Sikka <[email protected]>
Co-authored-by: Sage Moore <[email protected]>
Co-authored-by: tjtanaavllm <[email protected]>
Co-authored-by: Yong Hoon Shin <[email protected]>
Co-authored-by: Chih-Chieh Yang <[email protected]>
Co-authored-by: Roger Wang <[email protected]>
Co-authored-by: Vadim Gimpelson <[email protected]>
Co-authored-by: Yuxuan Zhang <[email protected]>
Co-authored-by: Isotr0py <[email protected]>
Co-authored-by: Cyrus Leung <[email protected]>
Co-authored-by: Thomas Parnell <[email protected]>
Co-authored-by: Yan Ma <[email protected]>
Co-authored-by: Xiao <[email protected]>
Co-authored-by: jiahanc <[email protected]>
Co-authored-by: Isotr0py <[email protected]>
Co-authored-by: Ye (Charlotte) Qi <[email protected]>
Co-authored-by: Roberto L. Castro <[email protected]>
Co-authored-by: Ning Xie <[email protected]>
Co-authored-by: H <[email protected]>
Co-authored-by: David Ben-David <[email protected]>
Co-authored-by: David Ben-David <[email protected]>
Co-authored-by: Woosuk Kwon <[email protected]>
Co-authored-by: Li, Jiang <[email protected]>
Co-authored-by: TankNee <[email protected]>
Co-authored-by: Cyrus Leung <[email protected]>
Co-authored-by: Seiji Eicher <[email protected]>
Co-authored-by: ZiTian.Zhao <[email protected]>
Co-authored-by: 22quinn <[email protected]>
Co-authored-by: Abirdcfly <[email protected]>
Co-authored-by: Giancarlo Delfin <[email protected]>
Co-authored-by: Chenxi Yang <[email protected]>
Co-authored-by: Chenxi Yang <[email protected]>
Co-authored-by: Tyler Michael Smith <[email protected]>
Co-authored-by: Weixiao Huang <[email protected]>
Co-authored-by: Raghav Ravishankar <[email protected]>
Co-authored-by: ericehanley <[email protected]>
Co-authored-by: Zhonghua Deng <[email protected]>
Co-authored-by: Po-Han Huang (NVIDIA) <[email protected]>
Co-authored-by: PiteXChen <[email protected]>
Co-authored-by: lkchen <[email protected]>
Co-authored-by: TJian <[email protected]>
Co-authored-by: Gregory Shtrasberg <[email protected]>
Co-authored-by: tlipoca9 <[email protected]>
Co-authored-by: elvischenv <[email protected]>
Co-authored-by: wang.yuqi <[email protected]>
Co-authored-by: Benji Beck <[email protected]>
Co-authored-by: youkaichao <[email protected]>
Co-authored-by: Siyuan Liu <[email protected]>
Co-authored-by: Benjamin Chislett <[email protected]>
Co-authored-by: LiuXiaoxuanPKU <[email protected]>
Co-authored-by: simon-mo <[email protected]>
Co-authored-by: Chen Zhang <[email protected]>
Co-authored-by: Hongxia Yang <[email protected]>
Co-authored-by: Minseok Lee <[email protected]>
Co-authored-by: Yongye Zhu <[email protected]>
Co-authored-by: Lucas Wilkinson <[email protected]>
Co-authored-by: Zhang Jason <[email protected]>
Co-authored-by: Asaf Joseph Gardin <[email protected]>
Co-authored-by: asafg <[email protected]>
Co-authored-by: Lain <[email protected]>
Co-authored-by: tc-mb <[email protected]>
Co-authored-by: imning3 <[email protected]>
Co-authored-by: Maximilien de Bayser <[email protected]>
Co-authored-by: Kunshang Ji <[email protected]>
Co-authored-by: Tao He <[email protected]>
Co-authored-by: qscqesze <[email protected]>
Co-authored-by: Syed Muhammad Bin Asif <[email protected]>
Co-authored-by: Lionel Villard <[email protected]>
Co-authored-by: WeiQing Chen <[email protected]>
Co-authored-by: ycyaw66 <[email protected]>
Co-authored-by: Moritz Sanft <[email protected]>
Co-authored-by: Ming Yang <[email protected]>
Co-authored-by: Adrián García García <[email protected]>
Co-authored-by: Michael Goin <[email protected]>
Co-authored-by: JaceyShao <[email protected]>
Co-authored-by: shaojunqi <[email protected]>
Co-authored-by: Ricardo Decal <[email protected]>
Co-authored-by: Andrew Chan <[email protected]>
Co-authored-by: fxmarty-amd <[email protected]>
Co-authored-by: Andrew Sansom <[email protected]>
Co-authored-by: Zhiyu <[email protected]>
Co-authored-by: Shu Wang <[email protected]>
Co-authored-by: XIn Li <[email protected]>
Co-authored-by: Junhao Li <[email protected]>
Co-authored-by: Chauncey <[email protected]>
Co-authored-by: iAmir97 <[email protected]>
Co-authored-by: iAmir97 <[email protected]>
Co-authored-by: Hong Hanh <[email protected]>
Co-authored-by: Daniel Serebrenik <[email protected]>
Co-authored-by: yewentao256 <[email protected]>
Co-authored-by: Guy Stone <[email protected]>
Co-authored-by: yyweiss <[email protected]>
Co-authored-by: Pradyun92 <[email protected]>
Co-authored-by: Pradyun Ramadorai <[email protected]>
Co-authored-by: Nicolò Lucchesi <[email protected]>
epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025
…llm-project#22428)

Signed-off-by: rongfu.leng <[email protected]>
Signed-off-by: Jinzhen Lin <[email protected]>
Signed-off-by: Huzaifa Sidhpurwala <[email protected]>
Signed-off-by: Varun Sundar Rabindranath <[email protected]>
Signed-off-by: Harry Mellor <[email protected]>
Signed-off-by: Jee Jee Li <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: Animesh Jain <[email protected]>
Signed-off-by: Rui Qiao <[email protected]>
Signed-off-by: Xiongfei Wei <[email protected]>
Signed-off-by: Nick Hill <[email protected]>
Signed-off-by: yewentao256 <[email protected]>
Signed-off-by: kf <[email protected]>
Signed-off-by: vllmellm <[email protected]>
Signed-off-by: NickLucche <[email protected]>
Signed-off-by: Dipika Sikka <[email protected]>
Signed-off-by: Sage Moore <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: Yong Hoon Shin <[email protected]>
Signed-off-by: Chih-Chieh-Yang <[email protected]>
Signed-off-by: Roger Wang <[email protected]>
Signed-off-by: Vadim Gimpelson <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: zRzRzRzRzRzRzR <[email protected]>
Signed-off-by: Chih-Chieh Yang <[email protected]>
Signed-off-by: DarkLight1337 <[email protected]>
Signed-off-by: Thomas Parnell <[email protected]>
Signed-off-by: yan <[email protected]>
Signed-off-by: Yan Ma <[email protected]>
Signed-off-by: Xiao Liu <[email protected]>
Signed-off-by: jiahanc <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Ye (Charlotte) Qi <[email protected]>
Signed-off-by: LopezCastroRoberto <[email protected]>
Signed-off-by: Andy Xie <[email protected]>
Signed-off-by: Haibin Lin <[email protected]>
Signed-off-by: David Ben-David <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: jiang1.li <[email protected]>
Signed-off-by: Seiji Eicher <[email protected]>
Signed-off-by: zitian.zhao <[email protected]>
Signed-off-by: 22quinn <[email protected]>
Signed-off-by: Abirdcfly <[email protected]>
Signed-off-by: Giancarlo Delfin <[email protected]>
Signed-off-by: Tyler Michael Smith <[email protected]>
Signed-off-by: huangweixiao <[email protected]>
Signed-off-by: alyosha-swamy <[email protected]>
Signed-off-by: Eric Hanley <[email protected]>
Signed-off-by: Abatom <[email protected]>
Signed-off-by: CLFutureX <[email protected]>
Signed-off-by: Linkun Chen <[email protected]>
Signed-off-by: tjtanaa <[email protected]>
Signed-off-by: Gregory Shtrasberg <[email protected]>
Signed-off-by: tlipoca9 <[email protected]>
Signed-off-by: elvischenv <[email protected]>
Signed-off-by: zitian zhao <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: wang.yuqi <[email protected]>
Signed-off-by: Benji Beck <[email protected]>
Signed-off-by: Siyuan Liu <[email protected]>
Signed-off-by: Benjamin Chislett <[email protected]>
Signed-off-by: isotr0py <[email protected]>
Signed-off-by: Chen Zhang <[email protected]>
Signed-off-by: simon-mo <[email protected]>
Signed-off-by: LucasWilkinson <[email protected]>
Signed-off-by: Zhang Jason <[email protected]>
Signed-off-by: Yongye Zhu <[email protected]>
Signed-off-by: asafg <[email protected]>
Signed-off-by: Siyuan Fu <[email protected]>
Signed-off-by: Lain <[email protected]>
Signed-off-by: Max de Bayser <[email protected]>
Signed-off-by: Lucas Wilkinson <[email protected]>
Signed-off-by: Kunshang Ji <[email protected]>
Signed-off-by: Tao He <[email protected]>
Signed-off-by: Michael Goin <[email protected]>
Signed-off-by: QscQ <[email protected]>
Signed-off-by: qingjun <[email protected]>
Signed-off-by: Syed Muhammad Bin Asif <[email protected]>
Signed-off-by: Lionel Villard <[email protected]>
Signed-off-by: ycyaw66 <[email protected]>
Signed-off-by: David Chen <[email protected]>
Signed-off-by: Linkun <[email protected]>
Signed-off-by: Moritz Sanft <[email protected]>
Signed-off-by: Ming Yang <[email protected]>
Signed-off-by: Adrian Garcia <[email protected]>
Signed-off-by: shaojunqi <[email protected]>
Signed-off-by: Ricardo Decal <[email protected]>
Signed-off-by: Andrew Chan <[email protected]>
Signed-off-by: Felix Marty <[email protected]>
Signed-off-by: Andrew Sansom <[email protected]>
Signed-off-by: Zhiyu Cheng <[email protected]>
Signed-off-by: Shu Wang <[email protected]>
Signed-off-by: Po-Han Huang <[email protected]>
Signed-off-by: Shu Wang. <[email protected]>
Signed-off-by: XIn Li <[email protected]>
Signed-off-by: Junhao Li <[email protected]>
Signed-off-by: chaunceyjiang <[email protected]>
Signed-off-by: iAmir97 <[email protected]>
Signed-off-by: iAmir97 <[email protected]>
Signed-off-by: <[email protected]>
Signed-off-by: Guy Stone <[email protected]>
Signed-off-by: <[email protected]>
Signed-off-by: yyw <[email protected]>
Signed-off-by: Russell Bryant <[email protected]>
Signed-off-by: Pradyun Ramadorai <[email protected]>
Signed-off-by: Pradyun92 <[email protected]>
Signed-off-by: Jinzhen Lin <[email protected]>
Co-authored-by: rongfu.leng <[email protected]>
Co-authored-by: Huzaifa Sidhpurwala <[email protected]>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Russell Bryant <[email protected]>
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Co-authored-by: Harry Mellor <[email protected]>
Co-authored-by: Jee Jee Li <[email protected]>
Co-authored-by: Michael Goin <[email protected]>
Co-authored-by: Animesh Jain <[email protected]>
Co-authored-by: Rui Qiao <[email protected]>
Co-authored-by: XiongfeiWei <[email protected]>
Co-authored-by: Nick Hill <[email protected]>
Co-authored-by: Wentao Ye <[email protected]>
Co-authored-by: JartX <[email protected]>
Co-authored-by: fhl2000 <[email protected]>
Co-authored-by: vllmellm <[email protected]>
Co-authored-by: kf <[email protected]>
Co-authored-by: Nicolò Lucchesi <[email protected]>
Co-authored-by: Dipika Sikka <[email protected]>
Co-authored-by: Sage Moore <[email protected]>
Co-authored-by: tjtanaavllm <[email protected]>
Co-authored-by: Yong Hoon Shin <[email protected]>
Co-authored-by: Chih-Chieh Yang <[email protected]>
Co-authored-by: Roger Wang <[email protected]>
Co-authored-by: Vadim Gimpelson <[email protected]>
Co-authored-by: Yuxuan Zhang <[email protected]>
Co-authored-by: Isotr0py <[email protected]>
Co-authored-by: Cyrus Leung <[email protected]>
Co-authored-by: Thomas Parnell <[email protected]>
Co-authored-by: Yan Ma <[email protected]>
Co-authored-by: Xiao <[email protected]>
Co-authored-by: jiahanc <[email protected]>
Co-authored-by: Isotr0py <[email protected]>
Co-authored-by: Ye (Charlotte) Qi <[email protected]>
Co-authored-by: Roberto L. Castro <[email protected]>
Co-authored-by: Ning Xie <[email protected]>
Co-authored-by: H <[email protected]>
Co-authored-by: David Ben-David <[email protected]>
Co-authored-by: David Ben-David <[email protected]>
Co-authored-by: Woosuk Kwon <[email protected]>
Co-authored-by: Li, Jiang <[email protected]>
Co-authored-by: TankNee <[email protected]>
Co-authored-by: Cyrus Leung <[email protected]>
Co-authored-by: Seiji Eicher <[email protected]>
Co-authored-by: ZiTian.Zhao <[email protected]>
Co-authored-by: 22quinn <[email protected]>
Co-authored-by: Abirdcfly <[email protected]>
Co-authored-by: Giancarlo Delfin <[email protected]>
Co-authored-by: Chenxi Yang <[email protected]>
Co-authored-by: Chenxi Yang <[email protected]>
Co-authored-by: Tyler Michael Smith <[email protected]>
Co-authored-by: Weixiao Huang <[email protected]>
Co-authored-by: Raghav Ravishankar <[email protected]>
Co-authored-by: ericehanley <[email protected]>
Co-authored-by: Zhonghua Deng <[email protected]>
Co-authored-by: Po-Han Huang (NVIDIA) <[email protected]>
Co-authored-by: PiteXChen <[email protected]>
Co-authored-by: lkchen <[email protected]>
Co-authored-by: TJian <[email protected]>
Co-authored-by: Gregory Shtrasberg <[email protected]>
Co-authored-by: tlipoca9 <[email protected]>
Co-authored-by: elvischenv <[email protected]>
Co-authored-by: wang.yuqi <[email protected]>
Co-authored-by: Benji Beck <[email protected]>
Co-authored-by: youkaichao <[email protected]>
Co-authored-by: Siyuan Liu <[email protected]>
Co-authored-by: Benjamin Chislett <[email protected]>
Co-authored-by: LiuXiaoxuanPKU <[email protected]>
Co-authored-by: simon-mo <[email protected]>
Co-authored-by: Chen Zhang <[email protected]>
Co-authored-by: Hongxia Yang <[email protected]>
Co-authored-by: Minseok Lee <[email protected]>
Co-authored-by: Yongye Zhu <[email protected]>
Co-authored-by: Lucas Wilkinson <[email protected]>
Co-authored-by: Zhang Jason <[email protected]>
Co-authored-by: Asaf Joseph Gardin <[email protected]>
Co-authored-by: asafg <[email protected]>
Co-authored-by: Lain <[email protected]>
Co-authored-by: tc-mb <[email protected]>
Co-authored-by: imning3 <[email protected]>
Co-authored-by: Maximilien de Bayser <[email protected]>
Co-authored-by: Kunshang Ji <[email protected]>
Co-authored-by: Tao He <[email protected]>
Co-authored-by: qscqesze <[email protected]>
Co-authored-by: Syed Muhammad Bin Asif <[email protected]>
Co-authored-by: Lionel Villard <[email protected]>
Co-authored-by: WeiQing Chen <[email protected]>
Co-authored-by: ycyaw66 <[email protected]>
Co-authored-by: Moritz Sanft <[email protected]>
Co-authored-by: Ming Yang <[email protected]>
Co-authored-by: Adrián García García <[email protected]>
Co-authored-by: Michael Goin <[email protected]>
Co-authored-by: JaceyShao <[email protected]>
Co-authored-by: shaojunqi <[email protected]>
Co-authored-by: Ricardo Decal <[email protected]>
Co-authored-by: Andrew Chan <[email protected]>
Co-authored-by: fxmarty-amd <[email protected]>
Co-authored-by: Andrew Sansom <[email protected]>
Co-authored-by: Zhiyu <[email protected]>
Co-authored-by: Shu Wang <[email protected]>
Co-authored-by: XIn Li <[email protected]>
Co-authored-by: Junhao Li <[email protected]>
Co-authored-by: Chauncey <[email protected]>
Co-authored-by: iAmir97 <[email protected]>
Co-authored-by: iAmir97 <[email protected]>
Co-authored-by: Hong Hanh <[email protected]>
Co-authored-by: Daniel Serebrenik <[email protected]>
Co-authored-by: yewentao256 <[email protected]>
Co-authored-by: Guy Stone <[email protected]>
Co-authored-by: yyweiss <[email protected]>
Co-authored-by: Pradyun92 <[email protected]>
Co-authored-by: Pradyun Ramadorai <[email protected]>
Co-authored-by: Nicolò Lucchesi <[email protected]>
xiao-llm pushed a commit to xiao-llm/vllm that referenced this pull request Aug 28, 2025
…llm-project#22428)

Signed-off-by: rongfu.leng <[email protected]>
Signed-off-by: Jinzhen Lin <[email protected]>
Signed-off-by: Huzaifa Sidhpurwala <[email protected]>
Signed-off-by: Varun Sundar Rabindranath <[email protected]>
Signed-off-by: Harry Mellor <[email protected]>
Signed-off-by: Jee Jee Li <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: Animesh Jain <[email protected]>
Signed-off-by: Rui Qiao <[email protected]>
Signed-off-by: Xiongfei Wei <[email protected]>
Signed-off-by: Nick Hill <[email protected]>
Signed-off-by: yewentao256 <[email protected]>
Signed-off-by: kf <[email protected]>
Signed-off-by: vllmellm <[email protected]>
Signed-off-by: NickLucche <[email protected]>
Signed-off-by: Dipika Sikka <[email protected]>
Signed-off-by: Sage Moore <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: Yong Hoon Shin <[email protected]>
Signed-off-by: Chih-Chieh-Yang <[email protected]>
Signed-off-by: Roger Wang <[email protected]>
Signed-off-by: Vadim Gimpelson <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: zRzRzRzRzRzRzR <[email protected]>
Signed-off-by: Chih-Chieh Yang <[email protected]>
Signed-off-by: DarkLight1337 <[email protected]>
Signed-off-by: Thomas Parnell <[email protected]>
Signed-off-by: yan <[email protected]>
Signed-off-by: Yan Ma <[email protected]>
Signed-off-by: Xiao Liu <[email protected]>
Signed-off-by: jiahanc <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Ye (Charlotte) Qi <[email protected]>
Signed-off-by: LopezCastroRoberto <[email protected]>
Signed-off-by: Andy Xie <[email protected]>
Signed-off-by: Haibin Lin <[email protected]>
Signed-off-by: David Ben-David <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: jiang1.li <[email protected]>
Signed-off-by: Seiji Eicher <[email protected]>
Signed-off-by: zitian.zhao <[email protected]>
Signed-off-by: 22quinn <[email protected]>
Signed-off-by: Abirdcfly <[email protected]>
Signed-off-by: Giancarlo Delfin <[email protected]>
Signed-off-by: Tyler Michael Smith <[email protected]>
Signed-off-by: huangweixiao <[email protected]>
Signed-off-by: alyosha-swamy <[email protected]>
Signed-off-by: Eric Hanley <[email protected]>
Signed-off-by: Abatom <[email protected]>
Signed-off-by: CLFutureX <[email protected]>
Signed-off-by: Linkun Chen <[email protected]>
Signed-off-by: tjtanaa <[email protected]>
Signed-off-by: Gregory Shtrasberg <[email protected]>
Signed-off-by: tlipoca9 <[email protected]>
Signed-off-by: elvischenv <[email protected]>
Signed-off-by: zitian zhao <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: wang.yuqi <[email protected]>
Signed-off-by: Benji Beck <[email protected]>
Signed-off-by: Siyuan Liu <[email protected]>
Signed-off-by: Benjamin Chislett <[email protected]>
Signed-off-by: isotr0py <[email protected]>
Signed-off-by: Chen Zhang <[email protected]>
Signed-off-by: simon-mo <[email protected]>
Signed-off-by: LucasWilkinson <[email protected]>
Signed-off-by: Zhang Jason <[email protected]>
Signed-off-by: Yongye Zhu <[email protected]>
Signed-off-by: asafg <[email protected]>
Signed-off-by: Siyuan Fu <[email protected]>
Signed-off-by: Lain <[email protected]>
Signed-off-by: Max de Bayser <[email protected]>
Signed-off-by: Lucas Wilkinson <[email protected]>
Signed-off-by: Kunshang Ji <[email protected]>
Signed-off-by: Tao He <[email protected]>
Signed-off-by: Michael Goin <[email protected]>
Signed-off-by: QscQ <[email protected]>
Signed-off-by: qingjun <[email protected]>
Signed-off-by: Syed Muhammad Bin Asif <[email protected]>
Signed-off-by: Lionel Villard <[email protected]>
Signed-off-by: ycyaw66 <[email protected]>
Signed-off-by: David Chen <[email protected]>
Signed-off-by: Linkun <[email protected]>
Signed-off-by: Moritz Sanft <[email protected]>
Signed-off-by: Ming Yang <[email protected]>
Signed-off-by: Adrian Garcia <[email protected]>
Signed-off-by: shaojunqi <[email protected]>
Signed-off-by: Ricardo Decal <[email protected]>
Signed-off-by: Andrew Chan <[email protected]>
Signed-off-by: Felix Marty <[email protected]>
Signed-off-by: Andrew Sansom <[email protected]>
Signed-off-by: Zhiyu Cheng <[email protected]>
Signed-off-by: Shu Wang <[email protected]>
Signed-off-by: Po-Han Huang <[email protected]>
Signed-off-by: Shu Wang. <[email protected]>
Signed-off-by: XIn Li <[email protected]>
Signed-off-by: Junhao Li <[email protected]>
Signed-off-by: chaunceyjiang <[email protected]>
Signed-off-by: iAmir97 <[email protected]>
Signed-off-by: iAmir97 <[email protected]>
Signed-off-by: <[email protected]>
Signed-off-by: Guy Stone <[email protected]>
Signed-off-by: <[email protected]>
Signed-off-by: yyw <[email protected]>
Signed-off-by: Russell Bryant <[email protected]>
Signed-off-by: Pradyun Ramadorai <[email protected]>
Signed-off-by: Pradyun92 <[email protected]>
Signed-off-by: Jinzhen Lin <[email protected]>
Co-authored-by: rongfu.leng <[email protected]>
Co-authored-by: Huzaifa Sidhpurwala <[email protected]>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Russell Bryant <[email protected]>
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Co-authored-by: Harry Mellor <[email protected]>
Co-authored-by: Jee Jee Li <[email protected]>
Co-authored-by: Michael Goin <[email protected]>
Co-authored-by: Animesh Jain <[email protected]>
Co-authored-by: Rui Qiao <[email protected]>
Co-authored-by: XiongfeiWei <[email protected]>
Co-authored-by: Nick Hill <[email protected]>
Co-authored-by: Wentao Ye <[email protected]>
Co-authored-by: JartX <[email protected]>
Co-authored-by: fhl2000 <[email protected]>
Co-authored-by: vllmellm <[email protected]>
Co-authored-by: kf <[email protected]>
Co-authored-by: Nicolò Lucchesi <[email protected]>
Co-authored-by: Dipika Sikka <[email protected]>
Co-authored-by: Sage Moore <[email protected]>
Co-authored-by: tjtanaavllm <[email protected]>
Co-authored-by: Yong Hoon Shin <[email protected]>
Co-authored-by: Chih-Chieh Yang <[email protected]>
Co-authored-by: Roger Wang <[email protected]>
Co-authored-by: Vadim Gimpelson <[email protected]>
Co-authored-by: Yuxuan Zhang <[email protected]>
Co-authored-by: Isotr0py <[email protected]>
Co-authored-by: Cyrus Leung <[email protected]>
Co-authored-by: Thomas Parnell <[email protected]>
Co-authored-by: Yan Ma <[email protected]>
Co-authored-by: Xiao <[email protected]>
Co-authored-by: jiahanc <[email protected]>
Co-authored-by: Isotr0py <[email protected]>
Co-authored-by: Ye (Charlotte) Qi <[email protected]>
Co-authored-by: Roberto L. Castro <[email protected]>
Co-authored-by: Ning Xie <[email protected]>
Co-authored-by: H <[email protected]>
Co-authored-by: David Ben-David <[email protected]>
Co-authored-by: David Ben-David <[email protected]>
Co-authored-by: Woosuk Kwon <[email protected]>
Co-authored-by: Li, Jiang <[email protected]>
Co-authored-by: TankNee <[email protected]>
Co-authored-by: Cyrus Leung <[email protected]>
Co-authored-by: Seiji Eicher <[email protected]>
Co-authored-by: ZiTian.Zhao <[email protected]>
Co-authored-by: 22quinn <[email protected]>
Co-authored-by: Abirdcfly <[email protected]>
Co-authored-by: Giancarlo Delfin <[email protected]>
Co-authored-by: Chenxi Yang <[email protected]>
Co-authored-by: Chenxi Yang <[email protected]>
Co-authored-by: Tyler Michael Smith <[email protected]>
Co-authored-by: Weixiao Huang <[email protected]>
Co-authored-by: Raghav Ravishankar <[email protected]>
Co-authored-by: ericehanley <[email protected]>
Co-authored-by: Zhonghua Deng <[email protected]>
Co-authored-by: Po-Han Huang (NVIDIA) <[email protected]>
Co-authored-by: PiteXChen <[email protected]>
Co-authored-by: lkchen <[email protected]>
Co-authored-by: TJian <[email protected]>
Co-authored-by: Gregory Shtrasberg <[email protected]>
Co-authored-by: tlipoca9 <[email protected]>
Co-authored-by: elvischenv <[email protected]>
Co-authored-by: wang.yuqi <[email protected]>
Co-authored-by: Benji Beck <[email protected]>
Co-authored-by: youkaichao <[email protected]>
Co-authored-by: Siyuan Liu <[email protected]>
Co-authored-by: Benjamin Chislett <[email protected]>
Co-authored-by: LiuXiaoxuanPKU <[email protected]>
Co-authored-by: simon-mo <[email protected]>
Co-authored-by: Chen Zhang <[email protected]>
Co-authored-by: Hongxia Yang <[email protected]>
Co-authored-by: Minseok Lee <[email protected]>
Co-authored-by: Yongye Zhu <[email protected]>
Co-authored-by: Lucas Wilkinson <[email protected]>
Co-authored-by: Zhang Jason <[email protected]>
Co-authored-by: Asaf Joseph Gardin <[email protected]>
Co-authored-by: asafg <[email protected]>
Co-authored-by: Lain <[email protected]>
Co-authored-by: tc-mb <[email protected]>
Co-authored-by: imning3 <[email protected]>
Co-authored-by: Maximilien de Bayser <[email protected]>
Co-authored-by: Kunshang Ji <[email protected]>
Co-authored-by: Tao He <[email protected]>
Co-authored-by: qscqesze <[email protected]>
Co-authored-by: Syed Muhammad Bin Asif <[email protected]>
Co-authored-by: Lionel Villard <[email protected]>
Co-authored-by: WeiQing Chen <[email protected]>
Co-authored-by: ycyaw66 <[email protected]>
Co-authored-by: Moritz Sanft <[email protected]>
Co-authored-by: Ming Yang <[email protected]>
Co-authored-by: Adrián García García <[email protected]>
Co-authored-by: Michael Goin <[email protected]>
Co-authored-by: JaceyShao <[email protected]>
Co-authored-by: shaojunqi <[email protected]>
Co-authored-by: Ricardo Decal <[email protected]>
Co-authored-by: Andrew Chan <[email protected]>
Co-authored-by: fxmarty-amd <[email protected]>
Co-authored-by: Andrew Sansom <[email protected]>
Co-authored-by: Zhiyu <[email protected]>
Co-authored-by: Shu Wang <[email protected]>
Co-authored-by: XIn Li <[email protected]>
Co-authored-by: Junhao Li <[email protected]>
Co-authored-by: Chauncey <[email protected]>
Co-authored-by: iAmir97 <[email protected]>
Co-authored-by: iAmir97 <[email protected]>
Co-authored-by: Hong Hanh <[email protected]>
Co-authored-by: Daniel Serebrenik <[email protected]>
Co-authored-by: yewentao256 <[email protected]>
Co-authored-by: Guy Stone <[email protected]>
Co-authored-by: yyweiss <[email protected]>
Co-authored-by: Pradyun92 <[email protected]>
Co-authored-by: Pradyun Ramadorai <[email protected]>
Co-authored-by: Nicolò Lucchesi <[email protected]>
Signed-off-by: Xiao Yu <[email protected]>
zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Aug 28, 2025
…llm-project#22428)

Signed-off-by: rongfu.leng <[email protected]>
Signed-off-by: Jinzhen Lin <[email protected]>
Signed-off-by: Huzaifa Sidhpurwala <[email protected]>
Signed-off-by: Varun Sundar Rabindranath <[email protected]>
Signed-off-by: Harry Mellor <[email protected]>
Signed-off-by: Jee Jee Li <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: Animesh Jain <[email protected]>
Signed-off-by: Rui Qiao <[email protected]>
Signed-off-by: Xiongfei Wei <[email protected]>
Signed-off-by: Nick Hill <[email protected]>
Signed-off-by: yewentao256 <[email protected]>
Signed-off-by: kf <[email protected]>
Signed-off-by: vllmellm <[email protected]>
Signed-off-by: NickLucche <[email protected]>
Signed-off-by: Dipika Sikka <[email protected]>
Signed-off-by: Sage Moore <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: Yong Hoon Shin <[email protected]>
Signed-off-by: Chih-Chieh-Yang <[email protected]>
Signed-off-by: Roger Wang <[email protected]>
Signed-off-by: Vadim Gimpelson <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: zRzRzRzRzRzRzR <[email protected]>
Signed-off-by: Chih-Chieh Yang <[email protected]>
Signed-off-by: DarkLight1337 <[email protected]>
Signed-off-by: Thomas Parnell <[email protected]>
Signed-off-by: yan <[email protected]>
Signed-off-by: Yan Ma <[email protected]>
Signed-off-by: Xiao Liu <[email protected]>
Signed-off-by: jiahanc <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Ye (Charlotte) Qi <[email protected]>
Signed-off-by: LopezCastroRoberto <[email protected]>
Signed-off-by: Andy Xie <[email protected]>
Signed-off-by: Haibin Lin <[email protected]>
Signed-off-by: David Ben-David <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: jiang1.li <[email protected]>
Signed-off-by: Seiji Eicher <[email protected]>
Signed-off-by: zitian.zhao <[email protected]>
Signed-off-by: 22quinn <[email protected]>
Signed-off-by: Abirdcfly <[email protected]>
Signed-off-by: Giancarlo Delfin <[email protected]>
Signed-off-by: Tyler Michael Smith <[email protected]>
Signed-off-by: huangweixiao <[email protected]>
Signed-off-by: alyosha-swamy <[email protected]>
Signed-off-by: Eric Hanley <[email protected]>
Signed-off-by: Abatom <[email protected]>
Signed-off-by: CLFutureX <[email protected]>
Signed-off-by: Linkun Chen <[email protected]>
Signed-off-by: tjtanaa <[email protected]>
Signed-off-by: Gregory Shtrasberg <[email protected]>
Signed-off-by: tlipoca9 <[email protected]>
Signed-off-by: elvischenv <[email protected]>
Signed-off-by: zitian zhao <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: wang.yuqi <[email protected]>
Signed-off-by: Benji Beck <[email protected]>
Signed-off-by: Siyuan Liu <[email protected]>
Signed-off-by: Benjamin Chislett <[email protected]>
Signed-off-by: isotr0py <[email protected]>
Signed-off-by: Chen Zhang <[email protected]>
Signed-off-by: simon-mo <[email protected]>
Signed-off-by: LucasWilkinson <[email protected]>
Signed-off-by: Zhang Jason <[email protected]>
Signed-off-by: Yongye Zhu <[email protected]>
Signed-off-by: asafg <[email protected]>
Signed-off-by: Siyuan Fu <[email protected]>
Signed-off-by: Lain <[email protected]>
Signed-off-by: Max de Bayser <[email protected]>
Signed-off-by: Lucas Wilkinson <[email protected]>
Signed-off-by: Kunshang Ji <[email protected]>
Signed-off-by: Tao He <[email protected]>
Signed-off-by: Michael Goin <[email protected]>
Signed-off-by: QscQ <[email protected]>
Signed-off-by: qingjun <[email protected]>
Signed-off-by: Syed Muhammad Bin Asif <[email protected]>
Signed-off-by: Lionel Villard <[email protected]>
Signed-off-by: ycyaw66 <[email protected]>
Signed-off-by: David Chen <[email protected]>
Signed-off-by: Linkun <[email protected]>
Signed-off-by: Moritz Sanft <[email protected]>
Signed-off-by: Ming Yang <[email protected]>
Signed-off-by: Adrian Garcia <[email protected]>
Signed-off-by: shaojunqi <[email protected]>
Signed-off-by: Ricardo Decal <[email protected]>
Signed-off-by: Andrew Chan <[email protected]>
Signed-off-by: Felix Marty <[email protected]>
Signed-off-by: Andrew Sansom <[email protected]>
Signed-off-by: Zhiyu Cheng <[email protected]>
Signed-off-by: Shu Wang <[email protected]>
Signed-off-by: Po-Han Huang <[email protected]>
Signed-off-by: Shu Wang. <[email protected]>
Signed-off-by: XIn Li <[email protected]>
Signed-off-by: Junhao Li <[email protected]>
Signed-off-by: chaunceyjiang <[email protected]>
Signed-off-by: iAmir97 <[email protected]>
Signed-off-by: iAmir97 <[email protected]>
Signed-off-by: <[email protected]>
Signed-off-by: Guy Stone <[email protected]>
Signed-off-by: <[email protected]>
Signed-off-by: yyw <[email protected]>
Signed-off-by: Russell Bryant <[email protected]>
Signed-off-by: Pradyun Ramadorai <[email protected]>
Signed-off-by: Pradyun92 <[email protected]>
Signed-off-by: Jinzhen Lin <[email protected]>
Co-authored-by: rongfu.leng <[email protected]>
Co-authored-by: Huzaifa Sidhpurwala <[email protected]>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: Russell Bryant <[email protected]>
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Co-authored-by: Varun Sundar Rabindranath <[email protected]>
Co-authored-by: Harry Mellor <[email protected]>
Co-authored-by: Jee Jee Li <[email protected]>
Co-authored-by: Michael Goin <[email protected]>
Co-authored-by: Animesh Jain <[email protected]>
Co-authored-by: Rui Qiao <[email protected]>
Co-authored-by: XiongfeiWei <[email protected]>
Co-authored-by: Nick Hill <[email protected]>
Co-authored-by: Wentao Ye <[email protected]>
Co-authored-by: JartX <[email protected]>
Co-authored-by: fhl2000 <[email protected]>
Co-authored-by: vllmellm <[email protected]>
Co-authored-by: kf <[email protected]>
Co-authored-by: Nicolò Lucchesi <[email protected]>
Co-authored-by: Dipika Sikka <[email protected]>
Co-authored-by: Sage Moore <[email protected]>
Co-authored-by: tjtanaavllm <[email protected]>
Co-authored-by: Yong Hoon Shin <[email protected]>
Co-authored-by: Chih-Chieh Yang <[email protected]>
Co-authored-by: Roger Wang <[email protected]>
Co-authored-by: Vadim Gimpelson <[email protected]>
Co-authored-by: Yuxuan Zhang <[email protected]>
Co-authored-by: Isotr0py <[email protected]>
Co-authored-by: Cyrus Leung <[email protected]>
Co-authored-by: Thomas Parnell <[email protected]>
Co-authored-by: Yan Ma <[email protected]>
Co-authored-by: Xiao <[email protected]>
Co-authored-by: jiahanc <[email protected]>
Co-authored-by: Isotr0py <[email protected]>
Co-authored-by: Ye (Charlotte) Qi <[email protected]>
Co-authored-by: Roberto L. Castro <[email protected]>
Co-authored-by: Ning Xie <[email protected]>
Co-authored-by: H <[email protected]>
Co-authored-by: David Ben-David <[email protected]>
Co-authored-by: David Ben-David <[email protected]>
Co-authored-by: Woosuk Kwon <[email protected]>
Co-authored-by: Li, Jiang <[email protected]>
Co-authored-by: TankNee <[email protected]>
Co-authored-by: Cyrus Leung <[email protected]>
Co-authored-by: Seiji Eicher <[email protected]>
Co-authored-by: ZiTian.Zhao <[email protected]>
Co-authored-by: 22quinn <[email protected]>
Co-authored-by: Abirdcfly <[email protected]>
Co-authored-by: Giancarlo Delfin <[email protected]>
Co-authored-by: Chenxi Yang <[email protected]>
Co-authored-by: Chenxi Yang <[email protected]>
Co-authored-by: Tyler Michael Smith <[email protected]>
Co-authored-by: Weixiao Huang <[email protected]>
Co-authored-by: Raghav Ravishankar <[email protected]>
Co-authored-by: ericehanley <[email protected]>
Co-authored-by: Zhonghua Deng <[email protected]>
Co-authored-by: Po-Han Huang (NVIDIA) <[email protected]>
Co-authored-by: PiteXChen <[email protected]>
Co-authored-by: lkchen <[email protected]>
Co-authored-by: TJian <[email protected]>
Co-authored-by: Gregory Shtrasberg <[email protected]>
Co-authored-by: tlipoca9 <[email protected]>
Co-authored-by: elvischenv <[email protected]>
Co-authored-by: wang.yuqi <[email protected]>
Co-authored-by: Benji Beck <[email protected]>
Co-authored-by: youkaichao <[email protected]>
Co-authored-by: Siyuan Liu <[email protected]>
Co-authored-by: Benjamin Chislett <[email protected]>
Co-authored-by: LiuXiaoxuanPKU <[email protected]>
Co-authored-by: simon-mo <[email protected]>
Co-authored-by: Chen Zhang <[email protected]>
Co-authored-by: Hongxia Yang <[email protected]>
Co-authored-by: Minseok Lee <[email protected]>
Co-authored-by: Yongye Zhu <[email protected]>
Co-authored-by: Lucas Wilkinson <[email protected]>
Co-authored-by: Zhang Jason <[email protected]>
Co-authored-by: Asaf Joseph Gardin <[email protected]>
Co-authored-by: asafg <[email protected]>
Co-authored-by: Lain <[email protected]>
Co-authored-by: tc-mb <[email protected]>
Co-authored-by: imning3 <[email protected]>
Co-authored-by: Maximilien de Bayser <[email protected]>
Co-authored-by: Kunshang Ji <[email protected]>
Co-authored-by: Tao He <[email protected]>
Co-authored-by: qscqesze <[email protected]>
Co-authored-by: Syed Muhammad Bin Asif <[email protected]>
Co-authored-by: Lionel Villard <[email protected]>
Co-authored-by: WeiQing Chen <[email protected]>
Co-authored-by: ycyaw66 <[email protected]>
Co-authored-by: Moritz Sanft <[email protected]>
Co-authored-by: Ming Yang <[email protected]>
Co-authored-by: Adrián García García <[email protected]>
Co-authored-by: Michael Goin <[email protected]>
Co-authored-by: JaceyShao <[email protected]>
Co-authored-by: shaojunqi <[email protected]>
Co-authored-by: Ricardo Decal <[email protected]>
Co-authored-by: Andrew Chan <[email protected]>
Co-authored-by: fxmarty-amd <[email protected]>
Co-authored-by: Andrew Sansom <[email protected]>
Co-authored-by: Zhiyu <[email protected]>
Co-authored-by: Shu Wang <[email protected]>
Co-authored-by: XIn Li <[email protected]>
Co-authored-by: Junhao Li <[email protected]>
Co-authored-by: Chauncey <[email protected]>
Co-authored-by: iAmir97 <[email protected]>
Co-authored-by: iAmir97 <[email protected]>
Co-authored-by: Hong Hanh <[email protected]>
Co-authored-by: Daniel Serebrenik <[email protected]>
Co-authored-by: yewentao256 <[email protected]>
Co-authored-by: Guy Stone <[email protected]>
Co-authored-by: yyweiss <[email protected]>
Co-authored-by: Pradyun92 <[email protected]>
Co-authored-by: Pradyun Ramadorai <[email protected]>
Co-authored-by: Nicolò Lucchesi <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci/build performance Performance-related issues quantization ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.