Commit e44a6e5
authored
Merge branch 'main' into andy/v1-sd-with-probs
Signed-off-by: Andy Lo <[email protected]>File tree
520 files changed
+18821
-11804
lines changed- .buildkite
- nightly-benchmarks
- scripts
- tests
- scripts
- hardware_ci
- .github
- workflows
- benchmarks
- auto_tune
- disagg_benchmarks
- kernels
- multi_turn
- cmake
- external_projects
- csrc
- attention/mla
- cpu
- mamba/mamba_ssm
- moe
- quantization
- cutlass_w4a8
- fp4
- machete
- docker
- docs
- community
- configuration
- contributing
- deployment
- frameworks
- integrations
- design
- features
- getting_started/installation
- cpu
- gpu
- mkdocs/hooks
- models
- serving
- usage
- examples
- offline_inference
- logits_processor
- online_serving
- prometheus_grafana
- requirements
- tests
- async_engine
- benchmarks
- compile
- piecewise
- core
- block/e2e
- distributed
- engine
- entrypoints
- llm
- offline_mode
- openai
- correctness
- kernels
- attention
- mamba
- moe
- quantization
- kv_transfer
- lora
- models
- language
- generation
- pooling
- multimodal
- generation
- vlm_utils
- pooling
- processing
- multimodal
- neuron
- 1_core
- 2_core
- plugins_tests
- plugins/prithvi_io_processor_plugin
- prithvi_io_processor
- quantization
- samplers
- tool_use
- tpu
- utils_
- v1
- attention
- core
- cudagraph
- e2e
- engine
- entrypoints
- llm
- openai/responses
- executor
- kv_connector/unit
- logits_processors
- metrics
- sample
- spec_decode
- tpu
- worker
- tools
- profiler
- vllm
- assets
- attention
- backends
- mla
- ops
- utils
- benchmarks
- lib
- compilation
- config
- distributed
- device_communicators
- kv_transfer
- kv_connector
- v1
- p2p
- kv_pipe
- engine
- multiprocessing
- entrypoints
- openai
- tool_parsers
- executor
- inputs
- lora
- punica_wrapper
- model_executor
- layers
- fused_moe
- configs
- mamba
- ops
- quantization
- compressed_tensors
- transform
- quark
- utils
- rotary_embedding
- model_loader
- models
- multimodal
- platforms
- plugins/io_processors
- reasoning
- third_party
- transformers_utils
- configs
- utils
- v1
- attention/backends
- mla
- core
- sched
- engine
- executor
- metrics
- sample
- logits_processor
- ops
- tpu
- spec_decode
- structured_output
- worker
- worker
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
520 files changed
+18821
-11804
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | | - | |
9 | | - | |
10 | | - | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
11 | 11 | | |
12 | | - | |
| 12 | + | |
13 | 13 | | |
14 | 14 | | |
15 | 15 | | |
| |||
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
218 | 218 | | |
219 | 219 | | |
220 | 220 | | |
221 | | - | |
| 221 | + | |
222 | 222 | | |
223 | 223 | | |
224 | 224 | | |
| |||
0 commit comments