-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Add MMMU benchmark results #4491
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
hnyls2002
merged 44 commits into
sgl-project:main
from
ravi03071991:ravi/benchmark_mmmu
Apr 25, 2025
Merged
Changes from 1 commit
Commits
Show all changes
44 commits
Select commit
Hold shift + click to select a range
0111828
Add MMMU benchmark results
6d4ce17
Update sglang qwen2.5 and minicpmv
02ed3b2
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 3bf70ea
Update metrics and run instructions
8106d2a
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 0870ca5
Update with latest results
82c18f2
Update minicpmv model results
6213f61
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 12d67d0
fix lint
47eeb56
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 60e905a
Update with llava based models and mllama
e7d9d4c
Update for Deepseek VL2
8fe39d9
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 e7a9b0f
update for MiniCPM-O-2_6
c9191b5
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 4ab2d1e
update
a001556
Update
7131879
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 e37958f
Add CI test for VLM models
c59be27
fix lints
98f88ad
remove static results
8730039
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 4320166
Update with lmm_evals
2912e47
update
1d926e6
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 cb9373a
remove env
aa5d7d4
Merge remote-tracking branch 'origin/ravi/benchmark_mmmu' into ravi/b…
6a49c71
Update CI install, run suit
1fd6495
change to python3
8916c64
Update with suggestions
40a319c
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 5d63bde
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 392e1dd
Update
539dd58
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 9e759c3
Update to mmmu_val
0d61208
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 5630f07
Update score for gemma3 model
2ba021b
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 3821bf5
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 7be94da
Update openai api key and base env variable names
58c7a78
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 88b8cf7
fix lmms-eval branch
3c95028
mem-fraction-static arguement
e8c645d
Merge branch 'main' into ravi/benchmark_mmmu
ravi03071991 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
could we use the latest one to always stand updated?
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The tests are breaking due to recent changes on
lmms-eval. Some code related to environment variables was modified, which caused the tests to fail - that’s why I fixed it to a specific version. If similar changes happen in the future, the tests might break again. What do you think?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think it's okay!