-
Notifications
You must be signed in to change notification settings - Fork 125
Split VisualSTS into eng and multi #133
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
hmm, what a hacky way of using the aggregate task ;) Like it though. We could also do this for the BRIGHT tasks (on the old leaderboard the split appears as different tasks) Should we by default exclude aggregate tasks for |
Thanks! And I don't have a strong preference. I guess if they are excluded by default, a parameter would added to "enable" them? e.g. |
hmm yea. Let us just include them for now, but could imagine that we could include a lot of "duplicates". |
|
looks great. Scores are matching with the aggregated results in the paper too. Do we have all models here? |
@gowitheflow-1998 |
To create the VisualSTS(eng) and VisualSTS(multi) columns in the LB for the MIEB benchmarks (to align with the paper), we split the tasks by using aggregate tasks, so that we don't need to rerun results again, and only need to generate new ones.
Checklist
make test.make pre-push.Adding a model checklist
mteb/models/directory. Instruction to add a model can be found here in the following PR: feat: Add MIEB and MIEB-lite as benchmarks mteb#2035