fix-qwen2vl-no-position_ids by simonJJJ · Pull Request #33487 · huggingface/transformers

simonJJJ · 2024-09-14T10:13:02Z

What does this PR do?

We encountered some issues with position_ids==None when fine-tuning the qwen2vl model, because the model_inputs do not pass the position_ids arg.

Before submitting

[] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

ArthurZucker

Makes sense! Do you want to add a small test for training? 🤗

…one (#276) ## Summary When `position_ids` is None, we should call `get_rope_index` to create 3D rope index The code was copied from here: huggingface/transformers#33487. ## Testing Done I am using qwen2-vl to train the grounding task. The red box shows the results before fixing, and the green box shows the results after fixing (correct results). <img width="146" alt="WechatIMG4500_副本" src="https://github.com/user-attachments/assets/e7a42f89-e19b-4b53-b84a-0d62d981e54a"> - Hardware Type: 3090 - [x] run `make test` to ensure correctness - [x] run `make checkstyle` to ensure code style - [x] run `make test-convergence` to ensure convergence --------- Co-authored-by: Shao Tang <tangshao28@gmail.com>

zhangfaen · 2024-10-13T07:14:19Z

I am trying to finetune Qwen2-VL for Object detection on coco dataset ( https://github.com/zhangfaen/finetune-Qwen2-VL )

Without this PR, my finetuned model performance is bad.
With this PR (by cherry picking), my finetuned model performance is really good.

Any progress for this PR to be merged?

Thank you

zytx121 · 2024-10-14T13:07:34Z

I am trying to finetune Qwen2-VL for Object detection on coco dataset ( https://github.com/zhangfaen/finetune-Qwen2-VL )

Without this PR, my finetuned model performance is bad. With this PR (by cherry picking), my finetuned model performance is really good.

Any progress for this PR to be merged?

Thank you

@zhangfaen Strange, after I modified this code, the fine-tuning performance still did not improve. Is there anything else that needs to be modified?

zhangfaen · 2024-10-14T15:40:51Z

I am trying to finetune Qwen2-VL for Object detection on coco dataset ( https://github.com/zhangfaen/finetune-Qwen2-VL )
Without this PR, my finetuned model performance is bad. With this PR (by cherry picking), my finetuned model performance is really good.
Any progress for this PR to be merged?
Thank you

@zhangfaen Strange, after I modified this code, the fine-tuning performance still did not improve. Is there anything else that needs to be modified?

for what task and against what dataset are you finetuning qwen2-vl ?

simonJJJ · 2024-10-28T06:03:54Z

hi @ArthurZucker, would u mind taking a look at merging this PR? The community is waiting for it.

zhangfaen · 2024-10-28T07:28:34Z

hi @ArthurZucker, would u mind taking a look at merging this PR? The community is waiting for it.

+1 @ArthurZucker

ArthurZucker

Yep sorry I approved I don't know why it was lost in tracks!

Betty-J · 2024-11-14T03:02:49Z

Yep sorry I approved I don't know why it was lost in tracks!

Hi there, when I used pip install transformers, I installed version 4.46.2, which still does not include the updates mentioned here https://github.com/huggingface/transformers/blob/main/src/transformers/models/qwen2_vl/modeling_qwen2_vl.py#L1723.

fix-qwen2vl-no-position_ids

b1e3f6b

LysandreJik requested a review from ArthurZucker September 17, 2024 18:29

sunzjz mentioned this pull request Sep 19, 2024

qwen2vl训练需要修改position_ids问题吗 hiyouga/LlamaFactory#5477

Closed

1 task

Jintao-Huang mentioned this pull request Sep 19, 2024

fix qwen2vl position_ids modelscope/ms-swift#2051

Merged

1 task

ArthurZucker approved these changes Sep 21, 2024

View reviewed changes

Sanster mentioned this pull request Sep 27, 2024

fix qwen2-vl: create correct rope position_ids when position_ids is None linkedin/Liger-Kernel#276

Merged

3 tasks

pange1802703882 mentioned this pull request Oct 22, 2024

关于微调定位框的问题? modelscope/ms-swift#2317

Open

ShuaiBai623 mentioned this pull request Oct 28, 2024

Was M-ROPE used in the training of Qwen2-VL? QwenLM/Qwen3-VL#488

Closed

ArthurZucker reviewed Oct 29, 2024

View reviewed changes

ArthurZucker merged commit 0ab0a42 into huggingface:main Oct 29, 2024

BernardZach pushed a commit to BernardZach/transformers that referenced this pull request Dec 5, 2024

fix-qwen2vl-no-position_ids (huggingface#33487)

e4dd180

AIFFFENG mentioned this pull request Dec 11, 2024

关于temporal_patch_size的问题 QwenLM/Qwen3-VL#121

Closed

This was referenced Dec 17, 2024

请问支持视觉定位吗？ QwenLM/Qwen3-VL#9

Closed

[Bug]: Qwen2vl vllm grounding任务效果不如transformers推理 vllm-project/vllm#11254

Open

Qwen2vl vllm grounding任务效果不如transformers推理 QwenLM/Qwen3-VL#601

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix-qwen2vl-no-position_ids#33487

fix-qwen2vl-no-position_ids#33487
ArthurZucker merged 1 commit intohuggingface:mainfrom
simonJJJ:fix_qwen2vl_no_position_ids

simonJJJ commented Sep 14, 2024 •

edited

Loading

Uh oh!

ArthurZucker left a comment

Uh oh!

zhangfaen commented Oct 13, 2024 •

edited

Loading

Uh oh!

zytx121 commented Oct 14, 2024

Uh oh!

zhangfaen commented Oct 14, 2024

Uh oh!

simonJJJ commented Oct 28, 2024

Uh oh!

zhangfaen commented Oct 28, 2024

Uh oh!

ArthurZucker left a comment

Uh oh!

Betty-J commented Nov 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

simonJJJ commented Sep 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

zhangfaen commented Oct 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zytx121 commented Oct 14, 2024

Uh oh!

zhangfaen commented Oct 14, 2024

Uh oh!

simonJJJ commented Oct 28, 2024

Uh oh!

zhangfaen commented Oct 28, 2024

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Betty-J commented Nov 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

simonJJJ commented Sep 14, 2024 •

edited

Loading

zhangfaen commented Oct 13, 2024 •

edited

Loading