-
-
Notifications
You must be signed in to change notification settings - Fork 18.7k
[ROCm][CI] Added MI325 mirrors (stage C) #35239
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 1 commit
3bb985d
1bc1a45
c0e038e
439d4d1
f2455dc
d3d4ed0
05750df
a59425f
d7939a1
dfb8207
b6a5ea7
0fd15b1
3eefd34
10094ce
93e4651
ff531c3
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -86,6 +86,11 @@ steps: | |
| - python3 offline_inference/spec_decode.py --test --method eagle --num_spec_tokens 3 --dataset-name hf --dataset-path philschmid/mt-bench --num-prompts 80 --temp 0 --top-p 1.0 --top-k -1 --tp 1 --enable-chunked-prefill --max-model-len 2048 | ||
| # https://github.com/vllm-project/vllm/pull/26682 uses slightly more memory in PyTorch 2.9+ causing this test to OOM in 1xL4 GPU | ||
| - python3 offline_inference/spec_decode.py --test --method eagle3 --num_spec_tokens 3 --dataset-name hf --dataset-path philschmid/mt-bench --num-prompts 80 --temp 0 --top-p 1.0 --top-k -1 --tp 1 --enable-chunked-prefill --max-model-len 1536 | ||
| mirror: | ||
| amd: | ||
| device: mi325_1 | ||
| depends_on: | ||
| - image-build-amd | ||
|
Comment on lines
+90
to
+94
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This |
||
|
|
||
| - label: Metrics, Tracing (2 GPUs) | ||
| timeout_in_minutes: 20 | ||
|
|
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -32,3 +32,8 @@ steps: | |
| - pytest -v -s entrypoints/openai/test_oot_registration.py # it needs a clean process | ||
| - pytest -v -s models/test_oot_registration.py # it needs a clean process | ||
| - pytest -v -s plugins/lora_resolvers # unit tests for in-tree lora resolver plugins | ||
| mirror: | ||
| amd: | ||
| device: mi325_2 | ||
| depends_on: | ||
| - image-build-amd | ||
|
Member
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I'll see if I need |
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This
mirrorconfiguration is duplicated for the 'Entrypoints Integration (v1)' step later in this file (lines 90-94). To improve maintainability and reduce redundancy, consider using YAML anchors. This will make future updates to the mirror configuration easier and less error-prone.For example, you could define an anchor (e.g., at the top of the file, before
steps:) and then reference it: