Add Gemma-Embeddings-v0.8 Retrieval Results by nicholasmonath · Pull Request #59 · embeddings-benchmark/results

nicholasmonath · 2024-12-02T19:56:24Z

Add Gofer-Embeddings-v0.8 results on the retrieval subset of tasks in MTEB.

KennethEnevoldsen · 2024-12-03T13:42:18Z

Thanks for the PR @nicholasmonath it seems like multiple fields such as the MTEB version are not specified as shown by the tests. It also seems like the model_meta is not filled out.

Samoed · 2024-12-03T16:35:46Z

Could you provide the script you used to run MTEB? It seems a bit unusual that the original results didn’t include the MTEB version and evaluation time

nicholasmonath · 2024-12-03T23:31:36Z

Hi @KennethEnevoldsen and @Samoed. Thank you for your comments. We wrote a sanitizer to remove sensitive info like timings, but we realized that our sanitizer was overly sensitive and removed even necessary fields. We are working on updating the pull request. We will add back the MTEB version that we used (we noticed that it is actually an older version 1.0.3) and model_meta. However, we are still required to avoid evaluation time due to the sensitivity of the infrastructure that we use.

KennethEnevoldsen · 2024-12-04T07:58:08Z

Excluding runtime and co2 emissions is fine, however, 1.0.3 is quite an old version. I would strongly recommend running it on the latest version of mteb. The scores should be approximately the same (minor differences as the seed changed in older version of the code along with code changes). We also standardize the result format in later versions of MTEB. If your model is prompt-based, newer versions of the benchmark allow you to integrate that as well.

nicholasmonath · 2024-12-09T11:56:03Z

Hi @KennethEnevoldsen, thank you for comments and time reviewing this PR. We have now updated our MTEB version to 1.21.7. The latest files now have this version and we have only sanitized the evaluation time. Please let us know if you have any questions or concerns.

KennethEnevoldsen

Thanks for the update - there is a few issues remaining

results/google__Gemma-Embeddings-v0.8/d6813d20532a97ea8e30fc285397d5105316511f/ArguAna.json

results/google__Gemma-Embeddings-v0.8/d6813d20532a97ea8e30fc285397d5105316511f/model_meta.json

…5397d5105316511f/model_meta.json Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>

nicholasmonath · 2024-12-10T21:45:19Z

Thank you so much @KennethEnevoldsen! Sorry about those remaining issues. I believe I have resolved them all now. Please do let me know if there is anything else.

nicholasmonath · 2024-12-12T03:56:28Z

Hi @KennethEnevoldsen, thank you again for all of your help with this pull request. I wanted to check in about when the results would appear on the leaderboard? I thought that they might appear after the update today, but I don't see them added? Please let me know if there is anything more from my side that you need.

Thanks very much!

KennethEnevoldsen · 2024-12-12T04:17:56Z

For them to appear on the current leaderboard you will have to updatee paths.json (see snippet results.py) If adding a new models also add their names to results.py.

(we are close to having a new leaderboard ready where this will no longer be necessary)

nicholasmonath · 2024-12-12T04:52:32Z

Thank you for your quick reply and the information, @KennethEnevoldsen! I have updated paths.json here: #69

Note, that it looked like the MODELS in results.py are automatically pulled from this line: https://github.com/embeddings-benchmark/results/blob/main/results.py#L295 and so I did not modify this file.

nicholasmonath added 6 commits December 2, 2024 19:47

Add Gofer-Embeddings-v0.8 Retrieval Results

f2fe004

Set revision to be external

f4e4fea

Fix revision ID

2577635

Fix formatting

3df2119

Fix formatting

cecfff9

Fix formatting

eff7c50

Merge branch 'embeddings-benchmark:main' into main

b5a7196

nicholasmonath and others added 5 commits December 9, 2024 11:11

model naming

28c5baf

Update to improved model and results using mteb version 1.21.7

1be252c

Merge branch 'main' of github.com:nicholasmonath/results

0b5e079

Merge branch 'embeddings-benchmark:main' into main

c4366a5

Merge branch 'main' of github.com:nicholasmonath/results

48e738f

nicholasmonath changed the title ~~Add Gofer-Embeddings-v0.8 Retrieval Results~~ Add Gemma-Embeddings-v0.8 Retrieval Results Dec 9, 2024

KennethEnevoldsen reviewed Dec 10, 2024

View reviewed changes

results/google__Gemma-Embeddings-v0.8/d6813d20532a97ea8e30fc285397d5105316511f/ArguAna.json Outdated Show resolved Hide resolved

results/google__Gemma-Embeddings-v0.8/d6813d20532a97ea8e30fc285397d5105316511f/model_meta.json Outdated Show resolved Hide resolved

nicholasmonath and others added 5 commits December 10, 2024 16:26

Update results/google__Gemma-Embeddings-v0.8/d6813d20532a97ea8e30fc28…

618b029

…5397d5105316511f/model_meta.json Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>

Fix -1 evaluation time

aca915a

Merge branch 'main' of github.com:nicholasmonath/results

1ac505f

Fix revision hash / model meta data

be81705

Merge branch 'embeddings-benchmark:main' into main

8944407

KennethEnevoldsen approved these changes Dec 11, 2024

View reviewed changes

KennethEnevoldsen merged commit 2a8b9de into embeddings-benchmark:main Dec 11, 2024

nicholasmonath mentioned this pull request Dec 12, 2024

Update paths.json to include Gemma-Embedings-v0.8 Results #69

Merged

Samoed mentioned this pull request Dec 12, 2025

remove results of model with missing implementations in MTEB #362

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Gemma-Embeddings-v0.8 Retrieval Results#59

Add Gemma-Embeddings-v0.8 Retrieval Results#59
KennethEnevoldsen merged 17 commits intoembeddings-benchmark:mainfrom
nicholasmonath:main

nicholasmonath commented Dec 2, 2024

Uh oh!

KennethEnevoldsen commented Dec 3, 2024

Uh oh!

Samoed commented Dec 3, 2024

Uh oh!

nicholasmonath commented Dec 3, 2024

Uh oh!

KennethEnevoldsen commented Dec 4, 2024

Uh oh!

nicholasmonath commented Dec 9, 2024

Uh oh!

KennethEnevoldsen left a comment

Uh oh!

Uh oh!

Uh oh!

nicholasmonath commented Dec 10, 2024

Uh oh!

nicholasmonath commented Dec 12, 2024

Uh oh!

KennethEnevoldsen commented Dec 12, 2024

Uh oh!

nicholasmonath commented Dec 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

nicholasmonath commented Dec 2, 2024

Uh oh!

KennethEnevoldsen commented Dec 3, 2024

Uh oh!

Samoed commented Dec 3, 2024

Uh oh!

nicholasmonath commented Dec 3, 2024

Uh oh!

KennethEnevoldsen commented Dec 4, 2024

Uh oh!

nicholasmonath commented Dec 9, 2024

Uh oh!

KennethEnevoldsen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

nicholasmonath commented Dec 10, 2024

Uh oh!

nicholasmonath commented Dec 12, 2024

Uh oh!

KennethEnevoldsen commented Dec 12, 2024

Uh oh!

nicholasmonath commented Dec 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants