Commit e38137d
白永斌
Merge branch 'model_register' of https://github.com/dsxsteven/vllm_splitPR into model_register
* 'model_register' of https://github.com/dsxsteven/vllm_splitPR: (138 commits)
Retrieve `sliding_window` from text config in Gemma3 MM (vllm-project#25085)
[Docs] Fix API Reference (vllm-project#25140)
[Kernel] Better inf handling for grouped topk cu (vllm-project#24886)
[CLI] Use streaming in CLI chat and completion commands (vllm-project#23769)
[benchmark] add peak throughput metrics and plot (vllm-project#23867)
[Spec Decode] Efficient padded speculation (vllm-project#24539)
[V0 Deprecation] Remove more V0 tests (vllm-project#25117)
[EPLB] Add EPLB support for hunyuan_v1 (vllm-project#23078)
[XPU] Whisper model support on XPU Platform (vllm-project#25123)
Mark prompt logprobs as incompatible with prompt embeds at API level (vllm-project#25077)
[Model] enable data parallel for InternVL vision encoder (vllm-project#23909)
[Kernels] Overlap shared experts with combine instead of dispatch (vllm-project#24254)
[Bugfix][Qwen3-Next] add prefixes to shared_expert in qwen3-next and mlp in qwen2moe to successfully load ignored params in quantized models (vllm-project#24960)
[Core][MM] Cleanup `MultiModalCache` (vllm-project#25006)
[Docs] Clean up the contributing README (vllm-project#25099)
[MM Encoder] Apply DP ViT for Qwen3-VL model series (vllm-project#24955)
[Kernels] Enable DeepGEMM by default (vllm-project#24462)
[V0 Deprecation] Skip PP test (vllm-project#25128)
[V0 Deprecation] Remove misc V0 tests (vllm-project#25118)
[V0 Deprecation] Remove V0 Tracing & Metrics tests (vllm-project#25115)
...File tree
526 files changed
+20833
-29953
lines changed- .buildkite
- nightly-benchmarks
- scripts
- hardware_ci
- .github
- benchmarks
- auto_tune
- kernels
- multi_turn
- csrc
- attention/mla
- cutlass_sm100_mla/device
- cpu
- moe
- quantization
- compressed_tensors
- cutlass_w4a8
- cutlass_w8a8/c3x
- fp4
- fp8
- fused_kernels
- hadamard/hadacore
- rocm
- docker
- docs
- configuration
- contributing
- model
- design
- features
- getting_started/installation
- cpu
- gpu
- models
- serving
- usage
- examples
- offline_inference
- logits_processor
- pooling
- online_serving
- dashboards
- grafana
- perses
- openai_embedding_long_text
- pooling
- requirements
- tests
- async_engine
- basic_correctness
- compile
- core
- block
- e2e
- detokenizer
- distributed
- encoder_decoder
- engine
- entrypoints
- openai
- correctness
- tool_parsers
- evals
- gpt_oss
- gsm8k/configs
- kernels
- attention
- moe
- modular_kernel_tools
- quantization
- lora
- metrics
- model_executor
- models
- language/generation
- multimodal
- generation
- pooling
- processing
- mq_llm_engine
- multimodal
- quantization
- samplers
- tokenization
- tracing
- utils_
- v1
- attention
- core
- e2e
- engine
- entrypoints/llm
- logits_processors
- spec_decode
- tpu/worker
- tracing
- worker
- worker
- tools
- vllm
- adapter_commons
- assets
- attention
- backends
- layers
- ops
- benchmarks
- lib
- compilation
- config
- distributed
- device_communicators
- eplb
- kv_transfer/kv_connector/v1
- engine
- multiprocessing
- output_processor
- entrypoints
- cli
- openai
- executor
- inputs
- lora
- layers
- model_executor
- layers
- fused_moe
- configs
- mamba
- ops
- quantization
- compressed_tensors
- transform
- schemes
- quark
- utils
- rotary_embedding
- models
- warmup
- multimodal
- platforms
- plugins/io_processors
- transformers_utils
- chat_templates
- configs
- tokenizers
- triton_utils
- utils
- v1
- attention/backends
- mla
- core
- sched
- engine
- executor
- metrics
- sample/logits_processor
- spec_decode
- structured_output
- worker
- worker
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
526 files changed
+20833
-29953
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
11 | | - | |
| 11 | + | |
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | 3 | | |
| 4 | + | |
4 | 5 | | |
5 | 6 | | |
6 | 7 | | |
7 | 8 | | |
8 | 9 | | |
9 | 10 | | |
10 | | - | |
| 11 | + | |
11 | 12 | | |
12 | 13 | | |
13 | 14 | | |
14 | 15 | | |
15 | 16 | | |
16 | 17 | | |
17 | | - | |
18 | | - | |
19 | | - | |
20 | 18 | | |
21 | | - | |
| 19 | + | |
22 | 20 | | |
23 | 21 | | |
24 | 22 | | |
| |||
30 | 28 | | |
31 | 29 | | |
32 | 30 | | |
33 | | - | |
34 | | - | |
35 | | - | |
36 | | - | |
37 | 31 | | |
38 | | - | |
| 32 | + | |
39 | 33 | | |
40 | 34 | | |
41 | 35 | | |
| |||
102 | 96 | | |
103 | 97 | | |
104 | 98 | | |
105 | | - | |
106 | | - | |
107 | 99 | | |
108 | 100 | | |
109 | 101 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
| 17 | + | |
| 18 | + | |
17 | 19 | | |
18 | | - | |
| 20 | + | |
19 | 21 | | |
20 | 22 | | |
21 | 23 | | |
22 | 24 | | |
23 | 25 | | |
24 | | - | |
25 | | - | |
26 | | - | |
27 | | - | |
28 | | - | |
29 | | - | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
30 | 45 | | |
31 | 46 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
66 | 66 | | |
67 | 67 | | |
68 | 68 | | |
69 | | - | |
70 | 69 | | |
71 | 70 | | |
72 | 71 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
46 | 46 | | |
47 | 47 | | |
48 | 48 | | |
49 | | - | |
50 | | - | |
51 | 49 | | |
52 | 50 | | |
53 | 51 | | |
54 | 52 | | |
55 | | - | |
56 | 53 | | |
57 | 54 | | |
58 | 55 | | |
59 | 56 | | |
60 | | - | |
61 | | - | |
62 | 57 | | |
63 | 58 | | |
64 | 59 | | |
65 | 60 | | |
66 | | - | |
67 | 61 | | |
68 | 62 | | |
69 | 63 | | |
| |||
84 | 78 | | |
85 | 79 | | |
86 | 80 | | |
87 | | - | |
88 | 81 | | |
89 | 82 | | |
90 | 83 | | |
91 | 84 | | |
92 | 85 | | |
93 | 86 | | |
94 | | - | |
95 | | - | |
96 | | - | |
97 | | - | |
98 | | - | |
99 | | - | |
100 | | - | |
101 | | - | |
102 | | - | |
103 | | - | |
104 | | - | |
105 | | - | |
106 | 87 | | |
107 | 88 | | |
108 | 89 | | |
| |||
230 | 211 | | |
231 | 212 | | |
232 | 213 | | |
233 | | - | |
234 | 214 | | |
235 | 215 | | |
236 | | - | |
237 | 216 | | |
238 | 217 | | |
239 | 218 | | |
240 | 219 | | |
241 | 220 | | |
242 | | - | |
| 221 | + | |
243 | 222 | | |
244 | 223 | | |
245 | 224 | | |
| |||
394 | 373 | | |
395 | 374 | | |
396 | 375 | | |
| 376 | + | |
397 | 377 | | |
398 | 378 | | |
399 | 379 | | |
| |||
548 | 528 | | |
549 | 529 | | |
550 | 530 | | |
551 | | - | |
552 | | - | |
553 | | - | |
554 | | - | |
555 | | - | |
556 | | - | |
557 | | - | |
558 | | - | |
559 | | - | |
560 | 531 | | |
561 | 532 | | |
562 | 533 | | |
| |||
817 | 788 | | |
818 | 789 | | |
819 | 790 | | |
820 | | - | |
| 791 | + | |
821 | 792 | | |
822 | 793 | | |
823 | 794 | | |
| |||
829 | 800 | | |
830 | 801 | | |
831 | 802 | | |
| 803 | + | |
| 804 | + | |
| 805 | + | |
| 806 | + | |
| 807 | + | |
| 808 | + | |
| 809 | + | |
| 810 | + | |
| 811 | + | |
| 812 | + | |
| 813 | + | |
| 814 | + | |
| 815 | + | |
| 816 | + | |
832 | 817 | | |
833 | 818 | | |
834 | 819 | | |
| |||
954 | 939 | | |
955 | 940 | | |
956 | 941 | | |
957 | | - | |
958 | 942 | | |
959 | 943 | | |
960 | 944 | | |
| |||
1028 | 1012 | | |
1029 | 1013 | | |
1030 | 1014 | | |
1031 | | - | |
| 1015 | + | |
| 1016 | + | |
1032 | 1017 | | |
1033 | 1018 | | |
| 1019 | + | |
| 1020 | + | |
| 1021 | + | |
| 1022 | + | |
| 1023 | + | |
| 1024 | + | |
| 1025 | + | |
| 1026 | + | |
| 1027 | + | |
| 1028 | + | |
| 1029 | + | |
1034 | 1030 | | |
1035 | 1031 | | |
1036 | | - | |
| 1032 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
| 5 | + | |
5 | 6 | | |
6 | 7 | | |
7 | 8 | | |
8 | 9 | | |
9 | 10 | | |
10 | 11 | | |
| 12 | + | |
11 | 13 | | |
12 | 14 | | |
13 | 15 | | |
14 | 16 | | |
15 | 17 | | |
| 18 | + | |
16 | 19 | | |
17 | 20 | | |
18 | 21 | | |
19 | 22 | | |
20 | 23 | | |
21 | 24 | | |
22 | | - | |
| 25 | + | |
23 | 26 | | |
24 | 27 | | |
25 | 28 | | |
| |||
30 | 33 | | |
31 | 34 | | |
32 | 35 | | |
| 36 | + | |
33 | 37 | | |
34 | | - | |
| 38 | + | |
35 | 39 | | |
| 40 | + | |
36 | 41 | | |
37 | 42 | | |
38 | 43 | | |
39 | | - | |
40 | 44 | | |
41 | 45 | | |
42 | 46 | | |
43 | 47 | | |
44 | | - | |
| 48 | + | |
| 49 | + | |
45 | 50 | | |
46 | 51 | | |
47 | | - | |
48 | 52 | | |
49 | 53 | | |
50 | 54 | | |
51 | 55 | | |
52 | | - | |
| 56 | + | |
53 | 57 | | |
54 | 58 | | |
55 | 59 | | |
56 | | - | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
57 | 63 | | |
58 | 64 | | |
59 | 65 | | |
| |||
101 | 107 | | |
102 | 108 | | |
103 | 109 | | |
104 | | - | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
0 commit comments