Skip to content

Commit 682c6b6

Browse files
committed
Update GPU memory utilization
Signed-off-by: luka <[email protected]> Signed-off-by: luka <[email protected]>
1 parent 4e92ff8 commit 682c6b6

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

tests/compile/piecewise/test_full_cudagraph.py

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -49,8 +49,7 @@ def llm_pair(request):
4949
)
5050
piecewise = LLM(
5151
model=model,
52-
# TODO set to 0.45 after V1 GPU model runner is fixed.
53-
gpu_memory_utilization=0.9,
52+
gpu_memory_utilization=0.45,
5453
trust_remote_code=True,
5554
max_model_len=1024,
5655
compilation_config=CompilationConfig(),

0 commit comments

Comments
 (0)