feat: experiment with different dimensions for OpenAI models on `MTEB(Medical)` by dbuades · Pull Request #78 · embeddings-benchmark/results

dbuades · 2024-12-20T13:38:49Z

As discussed in #71, this PR evaluates both text-embeddings-3-small (256, 512, 768, 1024) and text-embeddings-3-large (256, 512, 768, 1024, 1536) on the MTEB(Medical) benchmark.

This is a total of 9 combinations, currently stored in 9 separate folders. Code used to run the models can be found in this commit, following the approach described here.

I'll leave the PR in draft for now and we can use it to explore different ways to display experiments in the leaderboard without adding new model duplicates.

Thanks again @Muennighoff for providing the API key!

Checklist

Run tests locally to make sure nothing is broken using make test.
Run the results files checker make pre-push.

KennethEnevoldsen · 2025-02-05T08:07:16Z

Thanks for you work on this @dbuades. I believe we are currently working on transitioning to v2 where I don't think this feature is planned. So will close this for now, but I would love to add it in the future.

Again thanks PR, I simply think there is no one who has time to finalize this feature (@sam-hey might be interested - if you are let me know)

sam-hey · 2025-02-05T12:02:18Z

@KennethEnevoldsen A bit short on time this month, but I'll take a look next month

Muennighoff · 2025-02-05T16:59:48Z

I think we can still merge this given the folders for those models already exist? I think they were also displayed on the previous leaderboard so maybe just displaying them in the same way on the new leaderboard 🤔

Samoed · 2025-02-05T17:08:18Z

CC @x-tabdeveloping

dbuades · 2025-02-05T17:25:41Z

No worries, I understand! If merging them as-is is not appropriate and there’s any way I can help with the implementation, please let me know. I think these results are interesting since they highlight the impact of the dimensions. If you have any other preliminary designs, I have bandwidth next week to work on them.

dbuades added 10 commits December 20, 2024 14:26

fix: set kg_co2_emissions to null

2015806

feat: text-embeddings-3-small-256

ba96aee

feat: text-embeddings-3-small-512

6d16120

feat: text-embeddings-3-small-768

d55be50

feat: text-embeddings-3-small-1024

1acc6d9

feat: text-embeddings-3-large-256

f496129

feat: text-embeddings-3-large-512

0803e50

feat: text-embeddings-3-large-768

2503a2a

feat: text-embeddings-3-large-1024

41e7949

feat: text-embeddings-3-large-1536

282b1cc

dbuades marked this pull request as draft December 20, 2024 13:39

KennethEnevoldsen closed this Feb 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat: experiment with different dimensions for OpenAI models on `MTEB(Medical)`#78

feat: experiment with different dimensions for OpenAI models on `MTEB(Medical)`#78
dbuades wants to merge 10 commits intoembeddings-benchmark:mainfrom
clinia:exp/openai-dimensions-medical-mteb

dbuades commented Dec 20, 2024 •

edited

Loading

Uh oh!

KennethEnevoldsen commented Feb 5, 2025

Uh oh!

sam-hey commented Feb 5, 2025

Uh oh!

Muennighoff commented Feb 5, 2025

Uh oh!

Samoed commented Feb 5, 2025

Uh oh!

dbuades commented Feb 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Comments

Conversation

dbuades commented Dec 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Uh oh!

KennethEnevoldsen commented Feb 5, 2025

Uh oh!

sam-hey commented Feb 5, 2025

Uh oh!

Muennighoff commented Feb 5, 2025

Uh oh!

Samoed commented Feb 5, 2025

Uh oh!

dbuades commented Feb 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

dbuades commented Dec 20, 2024 •

edited

Loading