Skip to content

Commit 51e98e4

Browse files
authored
[Bugfix] Disable prefix caching by default for benchmark (#18771)
Signed-off-by: cascade812 <[email protected]>
1 parent e56f44d commit 51e98e4

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/benchmarks/latency.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -82,7 +82,7 @@ def add_cli_args(parser: argparse.ArgumentParser):
8282
parser = EngineArgs.add_cli_args(parser)
8383
# V1 enables prefix caching by default which skews the latency
8484
# numbers. We need to disable prefix caching by default.
85-
parser.set_defaults(enable_prefix_caching=True)
85+
parser.set_defaults(enable_prefix_caching=False)
8686

8787

8888
def main(args: argparse.Namespace):

0 commit comments

Comments
 (0)