Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
438 commits
Select commit Hold shift + click to select a range
042d6e7
Update tasks table
github-actions[bot] Mar 18, 2025
b3a9191
1.36.26
invalid-email-address Mar 18, 2025
5ebee24
Pass task name to all evaluators (#2389)
Samoed Mar 18, 2025
e7b04a6
fix: renaming Zeroshot -> ZeroShot (#2395)
KennethEnevoldsen Mar 20, 2025
349d5a8
1.36.27
invalid-email-address Mar 20, 2025
cf84a79
fix: Update AmazonPolarityClassification license (#2402)
KennethEnevoldsen Mar 20, 2025
a0990cb
fix b1ade name (#2403)
Samoed Mar 20, 2025
d2dc2f6
1.36.28
invalid-email-address Mar 20, 2025
8be95b7
Minor style changes (#2396)
KennethEnevoldsen Mar 21, 2025
e2476d2
Added new dataset and tasks - ClusTREC-covid , clustering of thematic…
katzurik Mar 21, 2025
5b0bd56
Update tasks table
github-actions[bot] Mar 21, 2025
cae1575
fix: Major updates to docs + make mieb dep optional (#2397)
KennethEnevoldsen Mar 22, 2025
9c459a8
1.36.29
invalid-email-address Mar 22, 2025
811dbf6
remove Arabic_Triplet_Matryoshka_V2.py (#2405)
Samoed Mar 22, 2025
146a893
Min torchvision>0.2.1 (#2410)
isaac-chung Mar 22, 2025
095851f
fix: Add validation to model_name in `ModelMeta` (#2404)
Samoed Mar 22, 2025
a934610
1.36.30
invalid-email-address Mar 22, 2025
2833138
[MIEB] "capability measured"-Abstask 1-1 matching refactor [1/3]: rei…
gowitheflow-1998 Mar 22, 2025
065159d
Update tasks table
github-actions[bot] Mar 22, 2025
e8faf3f
fix: Add option to remove benchmark from leaderboard (#2417)
KennethEnevoldsen Mar 23, 2025
a25dadb
1.36.31
invalid-email-address Mar 23, 2025
9d9b0b4
fix: Add VDR Multilingual Dataset (#2408)
ayush1298 Mar 23, 2025
34edcd5
Update tasks table
github-actions[bot] Mar 23, 2025
0cdf2e0
1.36.32
invalid-email-address Mar 23, 2025
071741d
HOTFIX: pin setuptools (#2423)
Samoed Mar 24, 2025
39cee62
add __init__.py Clustering > kor folder, And edit __init__.py in C…
OnAnd0n Mar 25, 2025
55c542b
Update tasks table
github-actions[bot] Mar 25, 2025
731c4fc
Update speed dependencies with new setuptools release (#2429)
Samoed Mar 25, 2025
98ab0ef
add richinfoai models (#2427)
richinfo-ai Mar 25, 2025
d3eab6f
Added Memory Usage column on leaderboard (#2428)
ayush1298 Mar 25, 2025
0db0a20
docs: typos; Standardize spacing; Chronological order (#2436)
Muennighoff Mar 26, 2025
8a024be
fix: Add model specific dependencies in pyproject.toml (#2424)
ayush1298 Mar 26, 2025
6ae420d
1.36.33
invalid-email-address Mar 26, 2025
65446e5
[MIEB] "capability measured"-Abstask 1-1 matching refactor [2/3]: rei…
gowitheflow-1998 Mar 26, 2025
19dc625
Update tasks table
github-actions[bot] Mar 26, 2025
dadafbe
Error while evaluating MIRACLRetrievalHardNegatives: 'trust_remote_co…
KennethEnevoldsen Mar 27, 2025
43adb0c
Feat/searchmap preview (#2420)
Free-tek Mar 28, 2025
5af5547
Add Background Gradients in Summary and Task Table (#2392)
ayush1298 Mar 29, 2025
61d3c6c
add ops_moa_models (#2439)
ahxgw Mar 29, 2025
35a8a5b
leaderboard fix (#2456)
ayush1298 Mar 29, 2025
d11934f
ci: cache `~/.cache/huggingface` (#2464)
sam-hey Mar 31, 2025
8799126
[MIEB] "capability measured"-Abstask 1-1 matching refactor [3/3]: rei…
gowitheflow-1998 Apr 1, 2025
5b567bf
Update tasks table
github-actions[bot] Apr 1, 2025
f293d8b
fix: Adds family of NeuML/pubmedbert-base-embedding models (#2443)
nadshe Apr 1, 2025
c617598
fix: add nb_sbert model (#2339)
theatollersrud Apr 1, 2025
42068c6
1.36.34
invalid-email-address Apr 1, 2025
e837b09
suppress logging warnings on leaderboard (#2406)
Samoed Apr 2, 2025
6c8c8d2
fix: E5 instruct now listed as sbert compatible (#2475)
KennethEnevoldsen Apr 2, 2025
eef52be
1.36.35
invalid-email-address Apr 2, 2025
295ad0a
[MIEB] rename VisionCentric to VisionCentricQA (#2479)
isaac-chung Apr 2, 2025
17b53b4
ci: Run dataset loading only when pushing to main (#2480)
isaac-chung Apr 2, 2025
f5881b0
fix table in tasks.md (#2483)
ayush1298 Apr 3, 2025
9117c2f
Update tasks table
github-actions[bot] Apr 3, 2025
7d4302e
fix: add prompt to NanoDBPedia (#2486)
seongtaehong Apr 4, 2025
f23efd9
1.36.36
invalid-email-address Apr 4, 2025
c2cbdac
Fix Task Lang Table (#2487)
ayush1298 Apr 4, 2025
8d87f41
fix: Ignore datasets not available in tests (#2484)
KennethEnevoldsen Apr 4, 2025
09e763d
1.36.37
invalid-email-address Apr 4, 2025
cc3ad3b
[MIEB] align main metrics with leaderboard (#2489)
Samoed Apr 4, 2025
944fed7
typo in model name (#2491)
ayush1298 Apr 5, 2025
ef59031
SpeedTask add deprecated warning (#2493)
Samoed Apr 5, 2025
315522c
Docs: Update README.md (#2494)
isaac-chung Apr 5, 2025
deb4766
fix transformers version for now (#2504)
isaac-chung Apr 6, 2025
77bef06
Fix typos (#2509)
Muennighoff Apr 6, 2025
cb2825c
ci: refactor TaskMetadata eval langs test (#2501)
isaac-chung Apr 7, 2025
e7d67c5
rename to ImageClustering folder (#2516)
isaac-chung Apr 7, 2025
2e612e4
Clean up trailing spaces citation (#2518)
isaac-chung Apr 7, 2025
2356e49
[mieb] Memotion preprocessing code made more robust and readable (#2519)
gowitheflow-1998 Apr 7, 2025
2d15895
fix: validate lang code in ModelMeta (#2499)
ayush1298 Apr 8, 2025
efcbbe1
Update pyproject.toml (#2522)
Samoed Apr 8, 2025
aceb995
1.36.38
invalid-email-address Apr 8, 2025
d53e585
Fix leaderboard version (#2524)
Samoed Apr 8, 2025
d7a70fc
Fix gte-multilingual-base embed_dim (#2526)
tolgayan Apr 9, 2025
fc6ee95
[MIEB] Specify only the multilingual AggTask for MIEB-lite (#2539)
isaac-chung Apr 12, 2025
06da74e
[mieb] fix hatefulmemes (#2531)
gowitheflow-1998 Apr 12, 2025
7fcb582
Model conan (#2534)
lllsy12138 Apr 13, 2025
c52690d
fix: Update mteb.get_tasks with an exclude_aggregate parameter to exc…
Sid-MB Apr 14, 2025
81bccef
1.36.39
invalid-email-address Apr 14, 2025
99c22b5
docs: Add MIEB citation in benchmarks (#2544)
isaac-chung Apr 15, 2025
f2f37f8
Add 2 new Vietnamese Retrieval Datasets (#2393)
BaoLocPham Apr 15, 2025
b7e447a
Update tasks table
github-actions[bot] Apr 15, 2025
67881c4
fix: CacheWrapper per task (#2467)
flogrammer Apr 15, 2025
e6b1949
1.36.40
invalid-email-address Apr 15, 2025
58769c4
misc: move MMTEB scripts and notebooks to separate repo (#2546)
isaac-chung Apr 15, 2025
caa6e70
fix: Update requirements in JinaWrapper (#2548)
AlexeyVatolin Apr 15, 2025
cb86939
1.36.41
invalid-email-address Apr 15, 2025
75d3597
Docs: Add MIEB to README (#2550)
isaac-chung Apr 15, 2025
3ff993d
Add xlm_roberta_ua_distilled (#2547)
panalexeu Apr 15, 2025
1f82b59
fix me5 trainind data config to include xquad dataset (#2552)
torchtorchkimtorch Apr 16, 2025
8fe5742
feat: Added dataframe utilities to BenchmarkResults (#2542)
KennethEnevoldsen Apr 16, 2025
cab4687
1.37.0
invalid-email-address Apr 16, 2025
4a6e539
fix e5_R_mistral_7b (#2490)
ayush1298 Apr 16, 2025
50d7e9e
fix unintentional working of filters on leaderboard (#2535)
ayush1298 Apr 16, 2025
0ab947b
feat: UI Overhaul (#2549)
x-tabdeveloping Apr 17, 2025
5f42ce4
1.38.0
invalid-email-address Apr 17, 2025
5ed6773
add USER2 (#2560)
Samoed Apr 19, 2025
4b755a3
Fix leaderboard entry for BuiltBench (#2563)
mehrzadshm Apr 19, 2025
f7072d5
fix: jasper models embeddings having nan values (#2481)
yjoonjang Apr 20, 2025
91c31d1
1.38.1
invalid-email-address Apr 20, 2025
d475c7e
fix frida datasets (#2565)
Samoed Apr 20, 2025
f11ac2a
Add relle (#2564)
24September Apr 20, 2025
c0d3ca0
Backfill task metadata for metadata for GermanDPR and GermanQuAD (#2566)
KTFish Apr 21, 2025
e56aab5
Update tasks table
github-actions[bot] Apr 21, 2025
713635a
Add ModelMeta for CodeSearch-ModernBERT-Crow-Plus (#2570)
Shun0212 Apr 22, 2025
adfd92a
Docs: Improve MIEB docs (#2569)
isaac-chung Apr 23, 2025
235906b
Add missing annotations (#2498)
ayush1298 Apr 24, 2025
e03333f
Update tasks table
github-actions[bot] Apr 24, 2025
fa5f034
move icon & name to benchmark dataclass (#2573)
Samoed Apr 24, 2025
951bae3
Remove the comments from ImageEncoder (#2579)
KennethEnevoldsen Apr 26, 2025
0737e78
fix: Add Encodechka benchmark (#2561)
Samoed Apr 27, 2025
4f23d62
Update tasks table
github-actions[bot] Apr 27, 2025
821fbb0
1.38.2
invalid-email-address Apr 27, 2025
b1606ff
fix FlagEmbedding package name (#2588)
Samoed Apr 28, 2025
ca10bac
fix codecarbon version (#2587)
Samoed Apr 28, 2025
7b6d9d7
Add MIEB image only benchmark (#2590)
isaac-chung Apr 28, 2025
039a965
Add image only MIEB benchmark to LB left panel (#2596)
isaac-chung Apr 30, 2025
0bda363
update Doubao-1.5-Embedding (#2575)
namespace-Pt Apr 30, 2025
afb72ac
fix: Add WebSSL models (#2604)
isaac-chung May 1, 2025
5a74754
fix mieb citation (#2606)
Samoed May 1, 2025
9da45fb
1.38.3
invalid-email-address May 1, 2025
7eba525
Update Doubao-1.5-Embedding (#2611)
namespace-Pt May 1, 2025
c020ebb
CI: update benchmark table (#2609)
Samoed May 1, 2025
7bc22e2
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
3a7b723
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
bcf532e
Update Doubao-1.5-Embedding revision (#2613)
namespace-Pt May 1, 2025
114c273
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
7060607
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
37f86e2
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
75db6fb
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
ad232aa
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
61c611f
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
f9b747f
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
8914793
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
0665cd2
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
5b34e6a
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
3703f11
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
c54e88f
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
72eea70
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
edb9c78
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
edbf218
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
296c1ee
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
f17902a
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
f4d72bc
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
607eb6f
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
bd9bb89
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
0eec584
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
90cd48a
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
046ecf0
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
2942557
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
9e5ce29
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
3fd7bec
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
b65f0ec
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
54b863e
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
cd83936
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
cd4670c
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
2c2ed55
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
86069c7
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
952070e
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
917263c
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
6432ea8
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
f1f09f8
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
2168d9c
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
350181f
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
23b999d
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
0c24f8d
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
6ed9b90
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
6ae644c
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
a2e14ae
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
c32f3a9
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
5aafe93
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
5d9332c
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
18faed2
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
17ae76e
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
4dcffb9
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
fccc9b7
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
a2427d3
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
b78ea7d
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
49c33e2
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
da5bf31
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
1ae5750
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
d03650c
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
630c5bb
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
e9b5706
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
5f1b3d0
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
936dafb
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
4e42192
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
e2d43cf
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
2dd457c
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
437b5e6
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
3e41806
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
daa7807
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
258dd4e
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
758be74
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
e731eaa
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
8e7d4f4
Update tasks & benchmarks tables
github-actions[bot] May 1, 2025
422fca2
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
ca5c3ad
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
bd7e85a
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
ab42110
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
000d5bf
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
e4935e2
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
5694e30
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
5f4daf5
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
b720cfd
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
62a967b
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
4b97a83
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
b59392d
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
309c51f
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
ee272f2
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
5b65218
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
36e4172
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
11b5b33
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
4fd00cc
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
480ba52
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
44cec12
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
b620a12
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
4584831
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
df48ec9
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
e47b902
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
d92e507
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
43b364f
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
4844ab5
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
17120b2
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
1c1e179
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
e228e94
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
26cb06c
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
eb30080
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
979716d
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
60e1d2e
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
b0c9e63
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
73afd47
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
94e7585
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
b2bfa6b
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
69937da
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
20baefb
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
4d09a1a
CI: fix table (#2615)
Samoed May 2, 2025
603aa5b
Update tasks & benchmarks tables
github-actions[bot] May 2, 2025
eabd9a5
Update gradio version (#2558)
Samoed May 2, 2025
f063638
fix: Removed missing dataset for MTEB(Multilingual) and bumped version
KennethEnevoldsen May 2, 2025
82dcb3d
CI: fix infinitely committing issue (#2616)
Samoed May 2, 2025
cb57999
Add ScandiSent dataset (#2620)
isaac-chung May 2, 2025
485941b
Merge branch 'main' of https://github.com/embeddings-benchmark/mteb
KennethEnevoldsen May 2, 2025
2ecd7ad
lint
KennethEnevoldsen May 2, 2025
9cfa2e8
1.38.4
invalid-email-address May 2, 2025
e0c2dc9
Format all citations (#2614)
AlexeyVatolin May 2, 2025
54eb70e
fix citations (#2628)
Samoed May 2, 2025
a52ea2f
Add Talemaader pair classification task (#2621)
imenelydiaker May 3, 2025
9711091
Merge remote-tracking branch 'origin/main' into maeb-merge-in-main-20…
isaac-chung May 3, 2025
8182f92
fix citations
isaac-chung May 3, 2025
02cb120
fix citations
isaac-chung May 3, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
10 changes: 7 additions & 3 deletions .github/workflows/docs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,9 @@ on:
branches: [main]
pull_request:

permissions:
contents: write

jobs:
create-table-on-pr:
if: github.event_name == 'pull_request'
Expand All @@ -32,8 +35,6 @@ jobs:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v3
with:
token: ${{ secrets.RELEASE }}

- uses: actions/setup-python@v4
with:
Expand All @@ -49,6 +50,8 @@ jobs:
make build-docs

- name: Push table
env:
GITHUB_TOKEN: ${{ github.token }}
run: |
git config --global user.email "github-actions[bot]@users.noreply.github.com"
git config --global user.name "github-actions[bot]"
Expand All @@ -57,6 +60,7 @@ jobs:
echo "No changes detected"
else
git add docs/tasks.md
git commit -m "Update tasks table"
git add docs/benchmarks.md
git commit -m "Update tasks & benchmarks tables"
git push
fi
1 change: 1 addition & 0 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -38,6 +38,7 @@ build-docs:
@echo "--- 📚 Building documentation ---"
# since we do not have a documentation site, this just build tables for the .md files
python docs/create_tasks_table.py
python docs/create_benchmarks_table.py


model-load-test:
Expand Down
11 changes: 7 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -78,8 +78,9 @@ The following links to the main sections in the usage documentation.
| **General** | |
| [Evaluating a Model](docs/usage/usage.md#evaluating-a-model) | How to evaluate a model |
| [Evaluating on different Modalities](docs/usage/usage.md#evaluating-on-different-modalities) | How to evaluate image and image-text tasks |
| [MIEB](docs/mieb/readme.md) | How to run the Massive Image Embedding Benchmark |
| **Selecting Tasks** | |
| [Selecting a benchmark](docs/usage/usage.md#selecting-a-benchmark) | How to select and filter tasks |
| [Selecting a benchmark](docs/usage/usage.md#selecting-a-benchmark) | How to select benchmarks |
| [Task selection](docs/usage/usage.md#task-selection) | How to select and filter tasks |
| [Selecting Split and Subsets](docs/usage/usage.md#selecting-evaluation-split-or-subsets) | How to select evaluation splits or subsets |
| [Using a Custom Task](docs/usage/usage.md#using-a-custom-task) | How to evaluate on a custom task |
Expand All @@ -96,7 +97,8 @@ The following links to the main sections in the usage documentation.
| **Leaderboard** | |
| [Running the Leaderboard Locally](docs/usage/usage.md#running-the-leaderboard-locally) | How to run the leaderboard locally |
| [Report Data Contamination](docs/usage/usage.md#annotate-contamination) | How to report data contamination for a model |
| [Fetching Result from the Leaderboard](docs/usage/usage.md#fetching-results-from-the-leaderboard) | How to fetch the raw results from the leaderboard |
| [Loading and working with Results](docs/usage/results.md) | How to load and working with the raw results from the leaderboard, including making result dataframes |



## Overview
Expand All @@ -107,8 +109,8 @@ The following links to the main sections in the usage documentation.
| 📋 [Tasks] | Overview of available tasks |
| 📐 [Benchmarks] | Overview of available benchmarks |
| **Contributing** | |
| 🤖 [Adding a model] | Information related to how to submit a model to MTEB and to the leaderboard |
| 👩‍🔬 [Reproducible workflows] | Information related to how to create reproducible workflows with MTEB |
| 🤖 [Adding a model] | How to submit a model to MTEB and to the leaderboard |
| 👩‍🔬 [Reproducible workflows] | How to create reproducible workflows with MTEB |
| 👩‍💻 [Adding a dataset] | How to add a new task/dataset to MTEB |
| 👩‍💻 [Adding a benchmark] | How to add a new benchmark to MTEB and to the leaderboard |
| 🤝 [Contributing] | How to contribute to MTEB and set it up for development |
Expand Down Expand Up @@ -172,3 +174,4 @@ Some of these amazing publications include (ordered chronologically):
- Dawei Zhu, Liang Wang, Nan Yang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li. "[LongEmbed: Extending Embedding Models for Long Context Retrieval](https://arxiv.org/abs/2404.12096)" arXiv 2024
- Kenneth Enevoldsen, Márton Kardos, Niklas Muennighoff, Kristoffer Laigaard Nielbo. "[The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding](https://arxiv.org/abs/2406.02396)" arXiv 2024
- Ali Shiraee Kasmaee, Mohammad Khodadad, Mohammad Arshi Saloot, Nick Sherck, Stephen Dokas, Hamidreza Mahyar, Soheila Samiee. "[ChemTEB: Chemical Text Embedding Benchmark, an Overview of Embedding Models Performance & Efficiency on a Specific Domain](https://arxiv.org/abs/2412.00532)" arXiv 2024
- Chenghao Xiao, Isaac Chung, Imene Kerboua, Jamie Stirling, Xin Zhang, Márton Kardos, Roman Solomatin, Noura Al Moubayed, Kenneth Enevoldsen, Niklas Muennighoff. "[MIEB: Massive Image Embedding Benchmark](https://arxiv.org/abs/2504.10471)" arXiv 2025
Loading