[Test][Accuracy] Add accuracy evaluation config for InternVL3_5-8B#3964
[Test][Accuracy] Add accuracy evaluation config for InternVL3_5-8B#3964MengqingCao merged 1 commit intovllm-project:mainfrom
Conversation
|
👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:
If CI fails, you can run linting and testing checks locally according Contributing and Testing. |
There was a problem hiding this comment.
Code Review
This pull request adds a new test configuration file to evaluate the accuracy of the InternVL3_5-8B model. While the configuration is straightforward and appears to serve its purpose, it introduces a critical security vulnerability by enabling trust_remote_code. This setting can lead to arbitrary code execution in the CI environment. My review includes a critical comment detailing this risk and suggesting potential mitigations.
|
This pull request has conflicts, please resolve those before we can evaluate the pull request. |
088b748 to
ed28694
Compare
Signed-off-by: gcanlin <canlinguosdu@gmail.com>
ed28694 to
2afecee
Compare
|
@MengqingCao CI has passed. Could you please merge this PR? |
…llm-project#3964) ### What this PR does / why we need it? To continuously monitor the accuracy of the InternVL3_5-8B model, this PR adds the corresponding configuration file to the CI. We need to add the `-hf` suffix to avoid incompatibility with the `lm-eval` preprocessor. ### How was this patch tested? `pytest -sv ./tests/e2e/models/test_lm_eval_correctness.py --config ./tests/e2e/models/configs/InternVL3_5-8B.yaml` - vLLM version: v0.11.0 - vLLM main: vllm-project/vllm@83f478b Signed-off-by: gcanlin <canlinguosdu@gmail.com> Signed-off-by: luolun <luolun1995@cmbchina.com>
…llm-project#3964) ### What this PR does / why we need it? To continuously monitor the accuracy of the InternVL3_5-8B model, this PR adds the corresponding configuration file to the CI. We need to add the `-hf` suffix to avoid incompatibility with the `lm-eval` preprocessor. ### How was this patch tested? `pytest -sv ./tests/e2e/models/test_lm_eval_correctness.py --config ./tests/e2e/models/configs/InternVL3_5-8B.yaml` - vLLM version: v0.11.0 - vLLM main: vllm-project/vllm@83f478b Signed-off-by: gcanlin <canlinguosdu@gmail.com> Signed-off-by: hwhaokun <haokun0405@163.com>
…llm-project#3964) ### What this PR does / why we need it? To continuously monitor the accuracy of the InternVL3_5-8B model, this PR adds the corresponding configuration file to the CI. We need to add the `-hf` suffix to avoid incompatibility with the `lm-eval` preprocessor. ### How was this patch tested? `pytest -sv ./tests/e2e/models/test_lm_eval_correctness.py --config ./tests/e2e/models/configs/InternVL3_5-8B.yaml` - vLLM version: v0.11.0 - vLLM main: vllm-project/vllm@83f478b Signed-off-by: gcanlin <canlinguosdu@gmail.com> Signed-off-by: nsdie <yeyifan@huawei.com>
…llm-project#3964) ### What this PR does / why we need it? To continuously monitor the accuracy of the InternVL3_5-8B model, this PR adds the corresponding configuration file to the CI. We need to add the `-hf` suffix to avoid incompatibility with the `lm-eval` preprocessor. ### How was this patch tested? `pytest -sv ./tests/e2e/models/test_lm_eval_correctness.py --config ./tests/e2e/models/configs/InternVL3_5-8B.yaml` - vLLM version: v0.11.0 - vLLM main: vllm-project/vllm@83f478b Signed-off-by: gcanlin <canlinguosdu@gmail.com>
What this PR does / why we need it?
To continuously monitor the accuracy of the InternVL3_5-8B model, this PR adds the corresponding configuration file to the CI. We need to add the
-hfsuffix to avoid incompatibility with thelm-evalpreprocessor.Does this PR introduce any user-facing change?
None.
How was this patch tested?
pytest -sv ./tests/e2e/models/test_lm_eval_correctness.py --config ./tests/e2e/models/configs/InternVL3_5-8B.yaml