[Core] Increase default max_num_batched_tokens
for multimodal models
#20672
Job | Run time |
---|---|
1m 49s | |
1m 36s | |
1m 36s | |
1m 4s | |
1m 7s | |
7m 12s |
max_num_batched_tokens
for multimodal models
#20672
Job | Run time |
---|---|
1m 49s | |
1m 36s | |
1m 36s | |
1m 4s | |
1m 7s | |
7m 12s |