Skip to content

[ROCm][CI] Update GSM8K eval config to use fp8-and-mixed models list#37619

Merged
DarkLight1337 merged 1 commit intovllm-project:mainfrom
ROCm:akaratza_fix_lm_eval_models_mi355
Mar 20, 2026
Merged

[ROCm][CI] Update GSM8K eval config to use fp8-and-mixed models list#37619
DarkLight1337 merged 1 commit intovllm-project:mainfrom
ROCm:akaratza_fix_lm_eval_models_mi355

Conversation

@AndreasKaratzas
Copy link
Collaborator

Follow-up for:

Updated the GSM8K correctness eval in the AMD CI pipeline to use the models-mi3xx-fp8-and-mixed.txt config list instead of models-mi3xx-fp8.txt. Addresses failure in mi325_2: LM Eval Small Models (B200-MI325)

Motivation: https://buildkite.com/vllm/amd-ci/builds/6701/steps/canvas?sid=019d07a7-1a87-47bb-866e-99e0fb04c660&tab=output

cc @kenroche

Signed-off-by: Andreas Karatzas <akaratza@amd.com>
@mergify mergify bot added ci/build rocm Related to AMD ROCm labels Mar 20, 2026
@github-project-automation github-project-automation bot moved this to Todo in AMD Mar 20, 2026
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the GSM8K evaluation configuration in the AMD CI pipeline to use a new list of models, including FP8 and mixed-precision models, to address a CI failure. However, I've identified a critical issue with the file path for the new model list in the CI configuration. The specified path does not match the file's actual location, which will cause the CI job to fail. My review includes a suggested fix for this path.

@AndreasKaratzas AndreasKaratzas added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 20, 2026
@AndreasKaratzas
Copy link
Collaborator Author

Testing MI325 to see if issue is resolved (added ready label).

@AndreasKaratzas
Copy link
Collaborator Author

@AndreasKaratzas AndreasKaratzas marked this pull request as ready for review March 20, 2026 05:47
@AndreasKaratzas AndreasKaratzas requested a review from mgoin as a code owner March 20, 2026 05:47
@DarkLight1337 DarkLight1337 merged commit 9cfd4eb into vllm-project:main Mar 20, 2026
14 of 15 checks passed
@github-project-automation github-project-automation bot moved this from Todo to Done in AMD Mar 20, 2026
@AndreasKaratzas AndreasKaratzas deleted the akaratza_fix_lm_eval_models_mi355 branch March 20, 2026 15:16
chooper26 pushed a commit to intellistream/vllm-hust that referenced this pull request Mar 21, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci/build ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

2 participants