[ci] add pytorch kvint testcase into function regresstion #2584

zhulinJulia24 · 2024-10-11T07:39:52Z

[tested] add pytorch kvint testcase
[tested] review test config according to support models, most models supported by turbomind are compatible with kvint and 4-bit quantization. Modify the configuration YAML file to make it easy to maintain.
[testing] add llama3.2 model into pytorch backend

zhulinJulia24 · 2024-10-12T01:46:59Z

@lvhan028 Qwen2-VL does not supported by turbomind backend

error is

lmdeploy convert qwen /nvme/qa_test_models/Qwen/Qwen2-VL-2B-Instruct --dst-path /nvme/qa_test_models/autotest_model/workspace_Qwen/Qwen2-VL-2B-Instruct

2024-10-12 09:44:10,786 - lmdeploy - WARNING - converter.py:324 - The argument `<model_name>` is deprecated and unused now. It will be removed on 2024.12.31. It was originally used to specify the name of the built-in chat template, but now it is substituted with a clearer parameter `--chat-template`
Unrecognized keys in `rope_scaling` for 'rope_type'='default': {'mrope_section'}
Traceback (most recent call last):
  File "/opt/py3/bin/lmdeploy", line 8, in <module>
    sys.exit(run())
  File "/opt/py3/lib/python3.10/site-packages/lmdeploy/cli/entrypoint.py", line 42, in run
    args.run(args)
  File "/opt/py3/lib/python3.10/site-packages/lmdeploy/cli/cli.py", line 165, in convert
    main(**kwargs)
  File "/opt/py3/lib/python3.10/site-packages/lmdeploy/turbomind/deploy/converter.py", line 334, in main
    assert is_supported(model_path), (
AssertionError: turbomind does not support /nvme/qa_test_models/Qwen/Qwen2-VL-2B-Instruct. Plz try pytorch engine instead.

Should we update this table

autotest/utils/run_restful_chat.py

autotest/utils/pipeline_chat.py

autotest/config.yaml

autotest/utils/config_utils.py

zhulin1 and others added 26 commits October 9, 2024 19:12

update

a601558

update

4c3d118

update

c261978

update

11c1884

update

4568457

update

b974491

update

6376f33

update

1dacb34

update

81c629b

update

5ef184e

update

65cd7c1

update

1f6c38e

update

839bc61

updata

74bda88

update

e9581cb

update

1c74379

update

e330de7

update

b9d4140

update

727c39d

Merge branch 'InternLM:main' into add_pytorch_kvint

a3f9786

update

f46ca96

update

638d0b9

update

1c65357

update

0d046cf

update

fe358f7

update

4fd3274

zhulin1 added 3 commits October 12, 2024 10:42

update

1d2a941

update

864588b

update

ce9edea

zhulinJulia24 and others added 11 commits October 12, 2024 17:58

Merge branch 'InternLM:main' into add_pytorch_kvint

04a18cb

update

23a0be2

Update config.yaml

277f368

update

fcd8ece

update

a7f9244

update kvint testcase for vl model

1cd6460

update

06e89e1

update

b360373

update

5a8dbf9

updaste

f833cb1

update

43d72d1