expand coverage of gpt2 models by twaka · Pull Request #271 · vllm-project/vllm

twaka · 2023-06-27T03:22:06Z

Thanks for the amazing project!
I noticed some gpt2 models have different naming of weights from https://huggingface.co/gpt2

For example, DialoGPT-small and mGPT are saved in a way that transformer. prefix is already prepended and also masked_bias is saved.

transformer.wte.weight
transformer.wpe.weight
transformer.h.0.attn.c_attn.weight
transformer.h.0.attn.c_attn.bias
transformer.h.0.attn.bias
transformer.h.0.attn.masked_bias
...

Therefore, it causes errors such asKeyError: 'transformer.transformer.wte.weight'.
This PR fixes these errors in model loading by checking transformer. prefix and ignoring masked_bias.

zhuohan123

LGTM! Thanks for your contribution!

includes the NOTICE file alongside the license files in the nm-vllm*.dist-info directory

Attn MetaData was hard coded to bfloat16, leading to a runtime error for float32 model instantiation.

use_beam_search is no longer in sampling params

### What this PR does / why we need it? Re-arch on tutorials, move singe npu / multi npu / multi node to index. - Unifiy docker run cmd - Use dropdown to hide build from source installation doc - Re-arch tutorials to include Qwen/QwQ/DeepSeek - Make QwQ doc works ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? CI test Signed-off-by: Yikun Jiang <yikunkero@gmail.com>

expand coverage of gpt2 model loading

e0785b5

zhuohan123 approved these changes Jun 27, 2023

View reviewed changes

zhuohan123 merged commit 4026a04 into vllm-project:main Jun 27, 2023

michaelfeil pushed a commit to michaelfeil/vllm that referenced this pull request Jul 1, 2023

expand coverage of gpt2 model loading (vllm-project#271)

270931b

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

expand coverage of gpt2 model loading (vllm-project#271)

9c9280a

yukavio pushed a commit to yukavio/vllm that referenced this pull request Jul 3, 2024

[CI/Build] include NOTICE in package dist-info (vllm-project#271)

4fabcfc

includes the NOTICE file alongside the license files in the nm-vllm*.dist-info directory

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Sep 24, 2024

Attn MetaData dtype should be same as model dtype (vllm-project#271)

f858d43

Attn MetaData was hard coded to bfloat16, leading to a runtime error for float32 model instantiation.

mht-sharma pushed a commit to mht-sharma/vllm that referenced this pull request Dec 9, 2024

Update P3L.py (vllm-project#271)

8f3bf8b

use_beam_search is no longer in sampling params

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

expand coverage of gpt2 models#271

expand coverage of gpt2 models#271
zhuohan123 merged 1 commit intovllm-project:mainfrom
twaka:main

twaka commented Jun 27, 2023

Uh oh!

zhuohan123 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

twaka commented Jun 27, 2023

Uh oh!

zhuohan123 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants