[Test][Accuracy] Add accuracy evaluation config for InternVL3_5-8B by gcanlin · Pull Request #3964 · vllm-project/vllm-ascend

gcanlin · 2025-11-04T01:13:07Z

What this PR does / why we need it?

To continuously monitor the accuracy of the InternVL3_5-8B model, this PR adds the corresponding configuration file to the CI. We need to add the -hf suffix to avoid incompatibility with the lm-eval preprocessor.

Does this PR introduce any user-facing change?

None.

How was this patch tested?

pytest -sv ./tests/e2e/models/test_lm_eval_correctness.py --config ./tests/e2e/models/configs/InternVL3_5-8B.yaml

vLLM version: v0.11.0
vLLM main: vllm-project/vllm@83f478b

github-actions · 2025-11-04T01:13:25Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request adds a new test configuration file to evaluate the accuracy of the InternVL3_5-8B model. While the configuration is straightforward and appears to serve its purpose, it introduces a critical security vulnerability by enabling trust_remote_code. This setting can lead to arbitrary code execution in the CI environment. My review includes a critical comment detailing this risk and suggesting potential mitigations.

github-actions · 2025-11-04T06:50:57Z

This pull request has conflicts, please resolve those before we can evaluate the pull request.

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

MengqingCao

LGTM

gcanlin · 2025-11-12T00:59:30Z

@MengqingCao CI has passed. Could you please merge this PR?

…llm-project#3964) ### What this PR does / why we need it? To continuously monitor the accuracy of the InternVL3_5-8B model, this PR adds the corresponding configuration file to the CI. We need to add the `-hf` suffix to avoid incompatibility with the `lm-eval` preprocessor. ### How was this patch tested? `pytest -sv ./tests/e2e/models/test_lm_eval_correctness.py --config ./tests/e2e/models/configs/InternVL3_5-8B.yaml` - vLLM version: v0.11.0 - vLLM main: vllm-project/vllm@83f478b Signed-off-by: gcanlin <canlinguosdu@gmail.com> Signed-off-by: luolun <luolun1995@cmbchina.com>

…llm-project#3964) ### What this PR does / why we need it? To continuously monitor the accuracy of the InternVL3_5-8B model, this PR adds the corresponding configuration file to the CI. We need to add the `-hf` suffix to avoid incompatibility with the `lm-eval` preprocessor. ### How was this patch tested? `pytest -sv ./tests/e2e/models/test_lm_eval_correctness.py --config ./tests/e2e/models/configs/InternVL3_5-8B.yaml` - vLLM version: v0.11.0 - vLLM main: vllm-project/vllm@83f478b Signed-off-by: gcanlin <canlinguosdu@gmail.com> Signed-off-by: hwhaokun <haokun0405@163.com>

…llm-project#3964) ### What this PR does / why we need it? To continuously monitor the accuracy of the InternVL3_5-8B model, this PR adds the corresponding configuration file to the CI. We need to add the `-hf` suffix to avoid incompatibility with the `lm-eval` preprocessor. ### How was this patch tested? `pytest -sv ./tests/e2e/models/test_lm_eval_correctness.py --config ./tests/e2e/models/configs/InternVL3_5-8B.yaml` - vLLM version: v0.11.0 - vLLM main: vllm-project/vllm@83f478b Signed-off-by: gcanlin <canlinguosdu@gmail.com> Signed-off-by: nsdie <yeyifan@huawei.com>

…llm-project#3964) ### What this PR does / why we need it? To continuously monitor the accuracy of the InternVL3_5-8B model, this PR adds the corresponding configuration file to the CI. We need to add the `-hf` suffix to avoid incompatibility with the `lm-eval` preprocessor. ### How was this patch tested? `pytest -sv ./tests/e2e/models/test_lm_eval_correctness.py --config ./tests/e2e/models/configs/InternVL3_5-8B.yaml` - vLLM version: v0.11.0 - vLLM main: vllm-project/vllm@83f478b Signed-off-by: gcanlin <canlinguosdu@gmail.com>

github-actions bot added the module:tests label Nov 4, 2025

gemini-code-assist bot reviewed Nov 4, 2025

View reviewed changes

Comment thread tests/e2e/models/configs/InternVL3_5-8B.yaml

github-actions bot added the merge-conflicts label Nov 4, 2025

gcanlin force-pushed the acc-internvl3_5 branch from 088b748 to ed28694 Compare November 4, 2025 12:14

github-actions bot removed the merge-conflicts label Nov 4, 2025

[Test][Accuracy] Add accuracy evaluation config for InternVL3_5-8B

2afecee

Signed-off-by: gcanlin <canlinguosdu@gmail.com>

gcanlin force-pushed the acc-internvl3_5 branch from ed28694 to 2afecee Compare November 4, 2025 12:20

MengqingCao added ready read for review ready-for-test start test by label for PR accuracy-test enable all accuracy test for PR and removed ready read for review labels Nov 8, 2025

MengqingCao approved these changes Nov 8, 2025

View reviewed changes

MengqingCao merged commit 1c677c3 into vllm-project:main Nov 12, 2025
60 of 61 checks passed

gcanlin mentioned this pull request Nov 14, 2025

[Docs] Add InternVL series tutorial for single NPU #3664

Closed

Yikun mentioned this pull request Feb 5, 2026

[v0.13.0rc2] FAQ / Feedback | 问题/反馈 #6186

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Test][Accuracy] Add accuracy evaluation config for InternVL3_5-8B#3964

[Test][Accuracy] Add accuracy evaluation config for InternVL3_5-8B#3964
MengqingCao merged 1 commit intovllm-project:mainfrom
gcanlin:acc-internvl3_5

gcanlin commented Nov 4, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Nov 4, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

github-actions bot commented Nov 4, 2025

Uh oh!

MengqingCao left a comment

Uh oh!

gcanlin commented Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gcanlin commented Nov 4, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Nov 4, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

github-actions bot commented Nov 4, 2025

Uh oh!

MengqingCao left a comment

Choose a reason for hiding this comment

Uh oh!

gcanlin commented Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gcanlin commented Nov 4, 2025 •

edited by github-actions bot

Loading