Skip to content

Commit cc30ccf

Browse files
b8zhongxwu-intel
authored andcommitted
[Llama4] Add docs note about enable multimodal (sgl-project#6235)
1 parent 47ccebb commit cc30ccf

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

docs/references/llama4.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,7 @@ python3 -m sglang.launch_server --model-path meta-llama/Llama-4-Scout-17B-16E-In
1919
- **OOM Mitigation**: Adjust `--context-length` to avoid a GPU out-of-memory issue. For the Scout model, we recommend setting this value up to 1M on 8\*H100 and up to 2.5M on 8\*H200. For the Maverick model, we don't need to set context length on 8\*H200.
2020

2121
- **Chat Template**: Add `--chat-template llama-4` for chat completion tasks.
22+
- **Enable Multi-Modal**: Add `--enable-multimodal` for multi-modal capabilities.
2223

2324
## Benchmarking Results
2425

0 commit comments

Comments
 (0)