Fix the issue with media_offset in owl3 when batch_size > 1. #2100

LukeForeverYoung · 2024-09-23T05:12:24Z

PR type

Bug Fix
New Feature
Document Updates
More Models or Datasets Support

PR information

Write the detail information belongs to this PR.

When the batch size > 1, the media_offset in mPLUG-Owl3 needs to be padded. We use the last value of each media_offset to fill the padding. Additionally, note that the images in the same batch are stacked in order, so the value of media_offset should be incremented by the number of images preceding it

Jintao-Huang · 2024-09-23T05:52:42Z

Thank you very much.

* mplugowl3 mediaoffset issue * padding of mediaoffset

* commit '57b3b9e46aa01bdc5c29b5e3d1e2da0582c9b282': (23 commits) fix not impl bug (modelscope#2134) Support fine-tuning MLLama. (modelscope#2132) Support for fine-tuning and deployment of the Llama 3.2 series models. (modelscope#2130) support got-ocr2 (modelscope#2123) [TorchAcc] fix: fix find_labels and can_return_loss (modelscope#2120) fix qwen2-audio (modelscope#2116) Fix qwen2-vl zero2/3 (modelscope#2114) support vllm & qwen2-vl video (modelscope#2110) Support for fine-tuning Llama 3.1 Omni. (modelscope#2106) fix infer device_map (modelscope#2105) fix cpu infer device_map (modelscope#2103) fix dataset preprocess (modelscope#2102) fix deploy openai compat (modelscope#2101) Fix the issue with media_offset in owl3 when batch_size > 1. (modelscope#2100) fix vllm tokenizer (modelscope#2099) Support for fine-tuning Pixtral-12B. (modelscope#2090) fix multiprocess remove_columns (modelscope#2088) fix qwen2.5 template (modelscope#2081) dynamic vit gradient_checkpointing (modelscope#2071) Support Mistral-small-inst-2409 (modelscope#2077) ...

LukeForeverYoung added 2 commits September 23, 2024 11:05

mplugowl3 mediaoffset issue

baaaf39

padding of mediaoffset

65f8b5b

Jintao-Huang approved these changes Sep 23, 2024

View reviewed changes

Jintao-Huang mentioned this pull request Sep 23, 2024

mplug-owl3-7b-chat fine-tuning document #1969

Closed

yingdachen merged commit 9cc72e3 into modelscope:main Sep 23, 2024
1 of 2 checks passed

Jintao-Huang pushed a commit that referenced this pull request Sep 23, 2024

Fix the issue with media_offset in owl3 when batch_size > 1. (#2100)

230a1a2

* mplugowl3 mediaoffset issue * padding of mediaoffset

LukeForeverYoung mentioned this pull request Sep 23, 2024

mplugowl3在训练的时候如果batchsize>1, mediaoffset的padding方式是什么？ X-PLUG/mPLUG-Owl#247

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the issue with media_offset in owl3 when batch_size > 1. #2100

Fix the issue with media_offset in owl3 when batch_size > 1. #2100

LukeForeverYoung commented Sep 23, 2024

Jintao-Huang commented Sep 23, 2024

Fix the issue with media_offset in owl3 when batch_size > 1. #2100

Fix the issue with media_offset in owl3 when batch_size > 1. #2100

Conversation

LukeForeverYoung commented Sep 23, 2024

PR type

PR information

Jintao-Huang commented Sep 23, 2024