Results for BarExamQA PR #240

abdurrahmanbutler · 2025-07-21T05:22:11Z

Hi,
I’m submitting this pull request to push the results of intfloat/multilingual-e5-small and sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 on BarExamQA.

This PR is connected to embeddings-benchmark/mteb#2916, which adds BarExamQA to MTEB.

This pull request is being submitted courtesy of Isaacus, a legal AI research company.

Checklist

My model has a model sheet, report or similar
My model has a reference implementation in mteb/models/ this can be as an API. Instruction on how to add a model can be found here
- No, but there is an existing PR ___
The results submitted is obtained using the reference implementation
My model is available, either as a publicly accessible API or publicly on e.g., Huggingface
I solemnly swear that for all results submitted I have not trained on the evaluation dataset including training splits. If I have I have disclosed it clearly.

umarbutler

I can confirm that I have reviewed and approve this PR on behalf of Isaacus.

Samoed · 2025-07-21T06:43:27Z

...lts/intfloat__multilingual-e5-small/c007d7ef6fd86656326059b28395a7a03a7c5846/model_meta.json

I think you've run model incorrectly. You should use model from mteb.get_model

Hey @Samoed,
These results were generated by following the instructions for adding a dataset to MTEB: https://github.com/embeddings-benchmark/mteb/blob/main/docs/adding_a_dataset.md#submit-a-pr

The exact same code was used:

from mteb import MTEB from sentence_transformers import SentenceTransformer # Define the sentence-transformers model name model_name = "sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2" model = SentenceTransformer(model_name) evaluation = MTEB(tasks=[YourNewTask()])

I see. I think it's a bit outdated. Thanks for pointing!

Can you rerun this model using updated instruction embeddings-benchmark/mteb#2922?

Yep so I reran the model using mteb.get_model and it seems to produce the right model meta data. I believe the sentence transformers version of the model is newer, leading to different meta data.

results/intfloat__multilingual-e5-small/c007d7ef6fd86656326059b28395a7a03a7c5846/BarExamQA.json

abdurrahmanbutler added 2 commits July 21, 2025 15:19

Added results for BarExamQA PR

be6e8e6

Added results for BarExamQA PR

e32652d

umarbutler reviewed Jul 21, 2025

View reviewed changes

Samoed reviewed Jul 21, 2025

View reviewed changes

This was referenced Jul 21, 2025

dataset: add BarExamQA dataset embeddings-benchmark/mteb#2916

Merged

Use mteb.get_model in adding_a_dataset.md embeddings-benchmark/mteb#2922

Merged

Samoed approved these changes Jul 21, 2025

View reviewed changes

abdurrahmanbutler added 3 commits July 21, 2025 19:30

Merge branch 'embeddings-benchmark:main' into results-for-BarExamQA

9c8b7c8

deleting dir from using sentence transformers model

eb5c2bd

New results generated using mteb.get_model()

341bfbd

Samoed merged commit f4a723d into embeddings-benchmark:main Jul 21, 2025
2 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Results for BarExamQA PR #240

Results for BarExamQA PR #240

Uh oh!

abdurrahmanbutler commented Jul 21, 2025 •

edited

Loading

Uh oh!

umarbutler left a comment

Uh oh!

Samoed Jul 21, 2025

Uh oh!

umarbutler Jul 21, 2025

Uh oh!

Samoed Jul 21, 2025

Uh oh!

Samoed Jul 21, 2025

Uh oh!

abdurrahmanbutler Jul 21, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Results for BarExamQA PR #240

Results for BarExamQA PR #240

Uh oh!

Conversation

abdurrahmanbutler commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Uh oh!

umarbutler left a comment

Choose a reason for hiding this comment

Uh oh!

Samoed Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

umarbutler Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

Samoed Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

Samoed Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

abdurrahmanbutler Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

abdurrahmanbutler commented Jul 21, 2025 •

edited

Loading