Skip to content

Commit 2ee475a

Browse files
committed
Update config.
Signed-off-by: Yuxian Qiu <[email protected]>
1 parent 37f5d3c commit 2ee475a

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/models/core/deepseek_v3/deployment-guide-for-trt-llm-deepseek-r1.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -78,7 +78,7 @@ cat << EOF > ${EXTRA_LLM_API_FILE}
7878
enable_attention_dp: true
7979
cuda_graph_config:
8080
enable_padding: true
81-
max_batch_size: 1024
81+
max_batch_size: 128
8282
kv_cache_config:
8383
dtype: fp8
8484
stream_interval: 10

0 commit comments

Comments
 (0)