-
Notifications
You must be signed in to change notification settings - Fork 355
Switch the pytorch tests to run on the L4 runners #2195
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: Dan Huang <[email protected]>
Signed-off-by: Dan Huang <[email protected]>
Signed-off-by: Dan Huang <[email protected]>
Signed-off-by: Dan Huang <[email protected]>
|
Note Gemini is unable to generate a summary for this pull request due to the file types involved not being currently supported. |
|
👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review. Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed. |
dsikka
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should we do this for the base tests as well?
I tried to run the base tests with L4 runners but they ran OOM so I still keep the base tests to run on H100 and only run the pytorch tests on L4. |
SUMMARY:
We have the GCP L4 runners back and expanded to multiple zones. Switching the pytorch tests to run on the L4 runners so the test jobs can be distributed on more runners and not only rely on the H100 runners. The base tests in the workflow ran OOM on L4 so we'll still run them on the H100.
TEST PLAN:
Had a successful run for the test-check.yaml workflow: https://github.com/vllm-project/llm-compressor/actions/runs/20787678167/job/59701632932