-
Notifications
You must be signed in to change notification settings - Fork 278
[WIP][feat] Initial support for VLMs, add Qwen2VL SFT test and Qwen2.5VL GRPO test #188
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
635 commits
Select commit
Hold shift + click to select a range
ef2da68
vlm_sft_test
antoinegg1 8007f17
vlm_sft_test
antoinegg1 719081c
.
garrett4wade 2ce1ece
.
garrett4wade 1e51b2c
Fix unresolved issue in SFTTrainer PR (#139)
nuzant 09f339f
Fix unresolved issue in SFTTrainer PR (#139)
nuzant 0659174
Merge branch 'fw/refactor' of https://github.com/inclusionAI/AReaL in…
garrett4wade df5ee49
Merge branch 'fw/refactor' of https://github.com/inclusionAI/AReaL in…
garrett4wade eddfada
Merge branch 'fw/refactor2' of https://code.alipay.com/inclusionAI/AR…
garrett4wade d1f863c
Merge branch 'fw/refactor2' of https://code.alipay.com/inclusionAI/AR…
garrett4wade ae79dda
image_process0701
antoinegg1 3be8639
image_process0701
antoinegg1 3eba150
image_process0701_2
antoinegg1 e19a0dc
image_process0701_2
antoinegg1 3c36e4e
image_process0701_3
antoinegg1 bda8514
image_process0701_3
antoinegg1 a19b855
.
garrett4wade ab7503a
.
garrett4wade 4b8f824
.
garrett4wade a5299b1
.
garrett4wade c4b4d90
.
garrett4wade 3a8796b
.
garrett4wade c2fe048
imageprocess0702
antoinegg1 334a2b0
imageprocess0702
antoinegg1 14cad10
image_process0702_2
antoinegg1 e3929d1
image_process0702_2
antoinegg1 4139b22
image_process0702_3
antoinegg1 f9006ba
image_process0702_3
antoinegg1 40df511
image_process0702_4
antoinegg1 9583123
image_process0702_4
antoinegg1 ae463cc
image_process0702_5
antoinegg1 109be6a
image_process0702_5
antoinegg1 685045f
image_process0703_1
antoinegg1 618826e
image_process0703_1
antoinegg1 2bb0be3
0703_2
antoinegg1 ea0c65c
0703_2
antoinegg1 dd647e1
0703_3
antoinegg1 838774b
0703_3
antoinegg1 0e293e5
0703_4
antoinegg1 9669c85
0703_4
antoinegg1 53657b1
0703_4
antoinegg1 2900f8c
0703_4
antoinegg1 c12cc5e
0703_5
antoinegg1 c8d6d4c
0703_5
antoinegg1 a8e7a99
0703_6
antoinegg1 942a39b
0703_6
antoinegg1 dae6bec
0703_7
antoinegg1 a68f931
0703_7
antoinegg1 640a6be
0703_8
antoinegg1 20f7605
0703_8
antoinegg1 9b8a2d1
0703_9
antoinegg1 2c9f014
0703_9
antoinegg1 17af243
0703_11
antoinegg1 573cb7e
0703_11
antoinegg1 0766d19
0703_12
antoinegg1 a9a37f6
0703_12
antoinegg1 35505a2
0703_13
antoinegg1 f00c545
0703_13
antoinegg1 080f637
0703_14
antoinegg1 5db1b68
0703_14
antoinegg1 59dd80c
0703_15
antoinegg1 193d052
0703_15
antoinegg1 99633db
0703_16
antoinegg1 c75230e
0703_16
antoinegg1 5e2923b
0703-17
antoinegg1 5f0061d
0703-17
antoinegg1 f8d1211
0703_18
antoinegg1 03f54ab
0703_18
antoinegg1 19d7f94
0703_18
antoinegg1 1a6eb4a
0703_18
antoinegg1 86cbe43
0703_19
antoinegg1 6e13ea6
0703_19
antoinegg1 c04ee32
0704_1
antoinegg1 c96ceb9
0704_1
antoinegg1 aa1de1f
0704_2
antoinegg1 164c957
0704_2
antoinegg1 43ced6f
0704_3
antoinegg1 84258a2
0704_3
antoinegg1 8f62c9d
.
garrett4wade 89a8d8c
.
garrett4wade 6d3073b
0707_1
antoinegg1 32100c6
0707_1
antoinegg1 79af776
0707_2
antoinegg1 5f6bdcc
0707_2
antoinegg1 17ed423
Merge branch 'lcy/refactor' into fw/refactor
antoinegg1 9cdf903
Merge branch 'lcy/refactor' into fw/refactor
antoinegg1 7d9f41b
0703_3
antoinegg1 3132862
0703_3
antoinegg1 d15f131
r
antoinegg1 db590ea
p
antoinegg1 fe3c27f
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AR…
antoinegg1 5010107
Merge branch 'fw/refactor' of https://code.alipay.com/inclusionAI/ARe…
garrett4wade 43b07b3
fix
antoinegg1 5409498
fix
antoinegg1 132b755
fix
antoinegg1 9c0c094
refactor
antoinegg1 34a64a9
0707_6
antoinegg1 9dd893c
0707_7
antoinegg1 645b58c
refactor1
antoinegg1 90f4cf0
0707_undone
antoinegg1 b006b31
f
antoinegg1 aced39b
0708_1
antoinegg1 6018376
Merge remote-tracking branch 'origin/lcy/refactor' into lcy/refactor
antoinegg1 74a2eba
0708_2
antoinegg1 fcfa067
0708_3
antoinegg1 b584cd2
0708_7
antoinegg1 3d3f682
0708_4
antoinegg1 184f9e8
0709_1
antoinegg1 2b6f962
0709_2
antoinegg1 e7991fc
0709_3
antoinegg1 223cafd
0709_4
antoinegg1 c01052a
0709_5
antoinegg1 605342d
0709_
antoinegg1 7379a9d
0709_6
antoinegg1 8a7d656
0709_7
antoinegg1 92f144e
0709_7
antoinegg1 3eaf620
0709_8
antoinegg1 2edcd2a
0709_9
antoinegg1 0cd58b5
0710_1
antoinegg1 496413f
0710_2
antoinegg1 e57cb20
0710_2
antoinegg1 3122d90
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AR…
antoinegg1 50cf951
0710_3
antoinegg1 27c06b9
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AR…
antoinegg1 622781d
0710_3
antoinegg1 0d3c579
0710_3
antoinegg1 0640d5a
0710_5
antoinegg1 e1f2853
0710_4
antoinegg1 8c7affe
merge_lite
antoinegg1 395affa
merge_2
antoinegg1 4c0cd02
merge_3
antoinegg1 16e087d
0711_1
antoinegg1 fc51cd5
0711_2
antoinegg1 2af8cd5
0711_3
antoinegg1 b3fed3c
0711_4
antoinegg1 cad4488
0711_6
antoinegg1 437f7a7
0711_7
antoinegg1 35ab78b
0711_8
antoinegg1 04e432e
0711_8
antoinegg1 d6ff9e7
0711_9
antoinegg1 036aa9a
0711_10
antoinegg1 10a3731
0711-11
antoinegg1 434d2f5
PullRequest: 353 [Lite] Add gradient checkpointing to FSDPEngine
nuzant d8038b2
PullRequest: 354 [lite] GRPO pre-commit: minor changes in FSDP engine
garrett4wade d79f0dc
0714_1
antoinegg1 eb524e8
0714_2
antoinegg1 84cd936
0714_3
antoinegg1 b74f240
0714_3
antoinegg1 101c4e9
0714_5
antoinegg1 724628e
PullRequest: 355 [Lite] GRPO pre-commit 2: Refactor RemoteSGLangEngin…
garrett4wade 8a15551
PullRequest: 357 [lite] GRPO pre-commit 3: Fix typos and experiment u…
garrett4wade 3f95968
PullRequest: 358 [lite] Support GRPO training locally with the GSM8k …
garrett4wade 69f5450
merge1
antoinegg1 0435aa5
merge2
antoinegg1 e960c17
0715_1
antoinegg1 ec60071
0715_2
antoinegg1 325ef6e
0715_2
antoinegg1 c75dcaf
merge
garrett4wade 712a4ab
0716_1
antoinegg1 5efd861
0716_2
antoinegg1 b2bd639
PullRequest: 368 [lite] Refactor train engine after merging contribut…
garrett4wade b56f599
PullRequest: 371 [lite] [fix] fix misc bugs in GRPO implementation
garrett4wade 74fcc38
0716_3
antoinegg1 1cbb642
0716_4
antoinegg1 2419d44
0716_4
antoinegg1 8596ef4
0716_5
antoinegg1 3c7c739
0717_1
antoinegg1 871b25a
0717_3
antoinegg1 0a2b9db
0717_3
antoinegg1 510313b
0717_4
antoinegg1 ce796f2
0717_5
antoinegg1 e9dc112
0717_6
antoinegg1 587544b
0717_6
antoinegg1 a032333
0717_6
antoinegg1 c0176b5
0718_1
antoinegg1 0e27a10
0718_2
antoinegg1 a08043e
0718_4
antoinegg1 090850a
0718_5
antoinegg1 ddabd9c
PullRequest: 370 [lite] Add Slurm Launcher and Ray Launcher
nuzant ade6a1d
Merge remote-tracking branch 'origin/lite' into lcy/refactor
antoinegg1 c8952f0
merge_0721
antoinegg1 25b65a2
0721_1
antoinegg1 2f1b679
PullRequest: 392 [lite] Fix several bugs regarding RL learning and ad…
garrett4wade 588ffd2
0721_2
antoinegg1 a157510
0721_3
antoinegg1 8f26371
merge_0721_2
antoinegg1 9c4da33
Merge branch 'lite' of https://github.com/inclusionAI/AReaL into lite
garrett4wade 9fcc177
0721_4
antoinegg1 ab5db3f
.
garrett4wade 4dd4a22
.
garrett4wade 339e87a
0721_formal
antoinegg1 67760d3
0721_formal
antoinegg1 60ac722
0721_merge3
antoinegg1 a2d6d21
0721_merge4
antoinegg1 b4e8215
0721_merge5
antoinegg1 475c35c
0721_6
antoinegg1 aed6a90
Merge remote-tracking branch 'backup/lite' into lcy/refactor
antoinegg1 c295614
0721_merge6
antoinegg1 f451dbd
0721_merge7
antoinegg1 80862b7
0721_8
antoinegg1 79e2a81
0722_1
antoinegg1 3d2f7a9
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AR…
antoinegg1 7199ce2
0722_2
antoinegg1 eba0b5f
0722_3
antoinegg1 229f101
0722_4
antoinegg1 ea12141
0722_4
antoinegg1 c27a51b
0722_5
antoinegg1 5c0662f
0722_6
antoinegg1 af2f80c
0722_7
antoinegg1 8815be6
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AR…
antoinegg1 eff8f09
0723_1
antoinegg1 6bde86a
reformatted
antoinegg1 52c9447
clang-reformatted
antoinegg1 25884f5
clang-reformatted2
antoinegg1 391bd85
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AR…
antoinegg1 dd14838
0723_1
antoinegg1 9ec2c3f
0723_1
antoinegg1 4041afb
0723_1
antoinegg1 2a2e2fe
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AR…
antoinegg1 8e82c59
0723_merge3
antoinegg1 d12dec2
0723_4
antoinegg1 82442b8
Merge branch 'lcy/refactor' of https://code.alipay.com/inclusionAI/AR…
antoinegg1 00b5d87
0723_reformatted_5
antoinegg1 9a16605
0724_1
antoinegg1 6c28d52
0724_1
antoinegg1 c816a3c
0724_merge1
antoinegg1 e97e33f
0724_merge2
antoinegg1 176ec4b
0724_merge3
antoinegg1 5118cfa
0724_merge3
antoinegg1 5690b52
0724_merge4
antoinegg1 84e2d75
Merge remote-tracking branch 'backup/lite' into lcy/refactor
antoinegg1 1bc9310
0724_merge5
antoinegg1 13fc236
0724_merge6
antoinegg1 27fd51a
0724_merge7
antoinegg1 e705db1
0724_merge8
antoinegg1 6aeeabf
0724_4
antoinegg1 f5924b1
0724_merge7
antoinegg1 84be9c9
Merge remote-tracking branch 'backup/lite' into lcy/refactor
antoinegg1 6255ad5
0724-merge8
antoinegg1 b8549ac
0724_merge8
antoinegg1 4198cd6
0725_1
antoinegg1 3c272ff
0725_6
antoinegg1 8eaced4
0725_7
antoinegg1 4f8b17f
0725_4padded_image
antoinegg1 cc3c6bb
0725_9padded_image
antoinegg1 60ac19a
0725_10padded_image
antoinegg1 fb1796d
0725_11
antoinegg1 a4ad671
0725
antoinegg1 6b8bfcf
0725_12
antoinegg1 4ff813a
0725_format
antoinegg1 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,47 @@ | ||
| from typing import Optional | ||
|
|
||
| import transformers | ||
|
|
||
| VALID_DATASETS = ["gsm8k", "clevr_count_70k"] | ||
|
|
||
|
|
||
| def get_custom_dataset( | ||
| path: str, | ||
| rank: int, | ||
| world_size: int, | ||
| type: str = "sft", | ||
| split: Optional[str] = None, | ||
| tokenizer: Optional[transformers.PreTrainedTokenizerFast] = None, | ||
| processor: Optional[transformers.AutoProcessor] = None, | ||
| **kwargs, | ||
| ): | ||
|
|
||
| if "gsm8k" in path and type == "sft": | ||
| from examples.arealite.dataset.gsm8k import get_gsm8k_sft_dataset | ||
|
|
||
| return get_gsm8k_sft_dataset(path, split, tokenizer, rank, world_size, **kwargs) | ||
| elif "gsm8k" in path and type == "rl": | ||
| from examples.arealite.dataset.gsm8k import get_gsm8k_rl_dataset | ||
|
|
||
| return get_gsm8k_rl_dataset(path, split, rank, world_size, **kwargs) | ||
| elif "clevr_count_70k" in path and type == "sft": | ||
| from examples.arealite.dataset.clevr_count_70k import ( | ||
| get_clevr_count_70k_sft_dataset, | ||
| ) | ||
|
|
||
| return get_clevr_count_70k_sft_dataset( | ||
| path, split, processor, rank, world_size, **kwargs | ||
| ) | ||
| elif "clevr_count_70k" in path and type == "rl": | ||
| from examples.arealite.dataset.clevr_count_70k import ( | ||
| get_clevr_count_70k_rl_dataset, | ||
| ) | ||
|
|
||
| return get_clevr_count_70k_rl_dataset( | ||
| path, split, processor, rank, world_size, **kwargs | ||
| ) | ||
| else: | ||
| raise ValueError( | ||
| f"Dataset {path} with split {split} and training type {type} is not supported. " | ||
| f"Supported datasets are: {VALID_DATASETS}. " | ||
| ) | ||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.