fix-qwen2vl-no-position_ids#33487
Conversation
ArthurZucker
left a comment
There was a problem hiding this comment.
Makes sense! Do you want to add a small test for training? 🤗
…one (#276) ## Summary When `position_ids` is None, we should call `get_rope_index` to create 3D rope index The code was copied from here: huggingface/transformers#33487. ## Testing Done I am using qwen2-vl to train the grounding task. The red box shows the results before fixing, and the green box shows the results after fixing (correct results). <img width="146" alt="WechatIMG4500_副本" src="https://github.com/user-attachments/assets/e7a42f89-e19b-4b53-b84a-0d62d981e54a"> - Hardware Type: 3090 - [x] run `make test` to ensure correctness - [x] run `make checkstyle` to ensure code style - [x] run `make test-convergence` to ensure convergence --------- Co-authored-by: Shao Tang <tangshao28@gmail.com>
|
I am trying to finetune Qwen2-VL for Object detection on coco dataset ( https://github.com/zhangfaen/finetune-Qwen2-VL ) Without this PR, my finetuned model performance is bad. Any progress for this PR to be merged? Thank you |
@zhangfaen Strange, after I modified this code, the fine-tuning performance still did not improve. Is there anything else that needs to be modified? |
for what task and against what dataset are you finetuning qwen2-vl ? |
|
hi @ArthurZucker, would u mind taking a look at merging this PR? The community is waiting for it. |
|
ArthurZucker
left a comment
There was a problem hiding this comment.
Yep sorry I approved I don't know why it was lost in tracks!
Hi there, when I used |
What does this PR do?
Fixes # (issue) (issue)
We encountered some issues with
position_ids==Nonewhen fine-tuning the qwen2vl model, because the model_inputs do not pass theposition_idsarg.Before submitting
Pull Request section?
to it if that's the case.
documentation guidelines, and
here are tips on formatting docstrings.
Who can review?
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.