-
Notifications
You must be signed in to change notification settings - Fork 566
feat: Introduce MAEB #3470
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
feat: Introduce MAEB #3470
Changes from all commits
Commits
Show all changes
235 commits
Select commit
Hold shift + click to select a range
32b7af8
Started the following:
sufen-f 8eff2c6
Minor changes and linted files. #2093
sufen-f 53a2e36
Minor changes and linted files. #2093
sufen-f ed93f2b
Minor changes and linted files. #2093
sufen-f fbab033
Refs #2068: Initial Implementation of audio-text retrieval abstask an…
imadtyx d39e187
Added MockAudioClustering task + MockAudioEncoder for testcase
alisartazkhan bcca37f
MockAudioClustering + MockAudioEncoder (#2093)
2a238ed
Added wav2vec model wrapper
alisartazkhan 7816974
Added subTask with small sample of dataset for testing
07f53b1
Added four w2v variants
alisartazkhan 882af38
Update wav2vec_models.py
alisartazkhan daeada0
Added wav2vec (5), wavlm (7), and whisper (5) models
alisartazkhan c1ebf2a
Added revisions from HF to wav2vec models, added silhouette score, DB…
sufen-f 716deed
Update mteb/models/wavlm_models.py
alisartazkhan ce1bee9
setting up colab
sufen-f 4cf7e6f
Merge remote-tracking branch 'origin/maeb' into maeb
sufen-f 545b938
added a2a
ed978fa
PCA + hidden layer + shuffling
1616ba9
New task: emotion clustering
ac14d16
Added qwen2 model
alisartazkhan 1302477
Added Wav2Vec model, voice clustering task, VoxCeleb dataset subset (…
sufen-f 4f23fdf
Merge branch 'maeb' into maeb
sufen-f ee10191
Revert "Maeb - added voice clustering task, wav2vec model and VoxCele…
sufen-f f1449c0
Revert "Revert "Maeb - added voice clustering task, wav2vec model and…
sufen-f d731d40
Revert "Revert "Revert "Maeb - added voice clustering task, wav2vec m…
sufen-f a0de4fc
Add Audio (Multi Label) Classification Abstask, Baseline Audio model,…
anime-sh 0620c58
Add ESC50 and zero-shot classification (#2133)
RahulSChand 6d9eca3
Add unfused clap model for zero-shot (#2269)
RahulSChand 2188585
Add new and complete version of FSD50K multi-label audio classificati…
RahulSChand bdefb14
added large, music and speech clap models (#2284)
RahulSChand 2e5dc67
add AbsTaskAudioClassification, ESC50 & GunshotTriangulation datasets…
silky1708 bf9fe16
Add NSynth dataset (#2306)
silky1708 a94ea50
Add urbansound8k for zero-shot (#2292)
RahulSChand 52a88ae
Add Emotion classification Ravdess dataset (#2320)
RahulSChand cd07f24
[MAEB] main merge (#2341)
isaac-chung ef30e3d
adding GTZAN Genre dataset (#2307)
silky1708 5cf3840
Adding Beijing Opera dataset (#2356)
silky1708 368e720
update TaskMetadata from mteb:maeb
silky1708 25136ba
make pr
silky1708 79e06fe
update ruff to 0.9.7; make lint
silky1708 f85627f
update TaskMetadata from mteb:maeb
silky1708 0cf07f4
update TaskMetadata
silky1708 7460a13
add Mridingham datasets
silky1708 d5caae6
rm comment
silky1708 187d7bc
Adding Libricount dataset (#2361)
silky1708 3bae6b6
Adding Crema-D Dataset for emotion classification [HEAR] (#2368)
silky1708 307aa57
Adding FSDD dataset (Free Spoken Digit Dataset) (#2371)
silky1708 6ad0bc2
Add VoxCelebSA, SpokenQAforIC, VehicleSoundClustering from Dynamic-SU…
diffunity 230064a
fix FSD-50K Task Metadata, Label handling and add stratified subsampl…
anime-sh 89ab596
Add music clustering dataset (#2232)
mina-parham f3a0403
[MAEB] merge main -> maeb (#2471)
isaac-chung 5af86e5
Create AbsTask and Evaluator for audio pair classification task (#2457)
switchpiggy 01c462d
Add Language, Gender, and Age classifcation tasks based on common-la…
anime-sh 5acab7f
Merge main into MAEB (#2488)
isaac-chung 31925c5
added wavlm models (#2472)
alisartazkhan 7e57e9d
Adding SIB-FLEURS (#2357)
diffunity 991a0fc
update wavlm models
alisartazkhan 5fc6e4d
update wavlm models
alisartazkhan 14f6b41
Add files via upload
mnasser3 9eaca21
Update whisper_models.py license format
mnasser3 040d5c6
Updated wavlm and whisper models to fit maeb structure (#2572)
alisartazkhan aba957c
Delete mteb/abstasks/Image/AbsTaskZeroshotClassification.py
isaac-chung 2fada5b
[MAEB] Merge in main 20250503 (#2635)
isaac-chung 4c53823
Added SpeechCommands Dataset (Subset) (#2645)
AdnanElAssadi56 804be31
Added ESC50 Clustering Dataset (#2652)
AdnanElAssadi56 e1bc62f
Added Qwen2-7b (#2660)
alisartazkhan 41b4c45
Added the IEMOCAP Datasets (#2640)
AdnanElAssadi56 4cd81ce
Add sew-d and unispeech models
sufen-f 1163e62
Add sew-d and unispeech models
sufen-f cef8d57
Merge branch 'model_development' into maeb
sufen-f 2d25266
Revert "Merge branch 'model_development' into maeb"
sufen-f 0fb74db
Reapply "Merge branch 'model_development' into maeb"
sufen-f a2e6cf2
Revert to 41b4c451d48ca1234b508a5972662dc0c25573fa
sufen-f 390b867
Add sew-d and unispeech models #2693 #2694 (#2701)
sufen-f 6f15209
Added Minds14 Dataset (#2644)
AdnanElAssadi56 17197e0
Added Hubert Models (#2689)
AdnanElAssadi56 ee8e26f
Added AST Model (#2691)
AdnanElAssadi56 95a03f7
Added Data2Vec Models (#2690)
AdnanElAssadi56 645255b
Adding BirdSet dataset
imadtyx e067d88
Update __init__.py to include BirdSet dataset(s)
imadtyx 1afb4ac
MAEB: Encodec Model (#2754)
AdnanElAssadi56 d4b9abd
MAEB: MMS Models (#2750)
AdnanElAssadi56 cf51d8f
MAEB: Seamlessm4t Model (V2) (#2751)
AdnanElAssadi56 439ee37
[MAEB] CNN14 Model (PANNs) (#2757)
AdnanElAssadi56 6e434aa
Added TutAcoustic Scenes Dataset (#2647)
AdnanElAssadi56 88436e3
MAEB: M-CTC-T Model (#2753)
AdnanElAssadi56 c5d8484
Added GTZAN Clustering Dataset (#2653)
AdnanElAssadi56 1af8eb1
Added AmbientAcousticContext Dataset (#2642)
AdnanElAssadi56 69d67e4
Added Crema_d Dataset (#2651)
AdnanElAssadi56 cd7c6e9
Added VoxCeleb Clustering Dataset (#2654)
AdnanElAssadi56 eb173b9
Audio Reranking Abstask+ Evaluator + Mini/Dummy AudioCaps Subset (#2744)
AdnanElAssadi56 31f38f2
Added 5 datasets for audio pair classification (#2463)
kkaitlyn111 ece46da
Adds SpokeN-100-English (#2342)
mina-parham 89563e1
Adds VocalSound dataset (#2337)
mina-parham 9114dc6
Added Birdclef Subset Dataset (#2641)
AdnanElAssadi56 c383316
Merge branch 'maeb' of github.com:embeddings-benchmark/mteb into maeb
isaac-chung a81eec3
lint
isaac-chung e990850
Added VoxPopuli Datasets (#2648)
AdnanElAssadi56 6bc4c5a
added SpeechCommand dataset and Keyword spotting task (#2329)
RahulSChand bdbe51f
[MAEB] Merge from main up to 1.38.30 (#2840)
isaac-chung 5510897
Added Yamnet and VGGish models (#2687)
ayush1298 3c464f9
Add urbansound 8k linear probing (#2845)
isaac-chung a4842d5
add stratified_subsampling to Audio clustering datasets (#2854)
isaac-chung 1453ad6
Audio Reranking Eval Update + 5 Reranking Datasets (#2849)
AdnanElAssadi56 73c9d2c
[MAEB] Sync with 1.38.33 (#2883)
isaac-chung 8a8a101
MAEB Classification Datasets Downsampling/Formatting + MTEB UPLOAD (#…
AdnanElAssadi56 c7b8542
Merge main maeb 07 10 (#2894)
Samoed 74bdc03
merge main
Samoed 8f8577f
SibFluers Dataset Multilingual Extention (#2890)
AdnanElAssadi56 f1eb63c
Implemented Audio Any2AnyRetrieval + 3 Datasets for A2A, A2T, T2A (#2…
kkaitlyn111 ab0899c
[MAEB] encode() for audio-only models should raise error (#2914)
isaac-chung f619034
fix: add missing clap model handling
isaac-chung 4e79b1a
dataset: add Clotho by creating the datasets on the fly (#2915)
isaac-chung 6b37b71
dataset: Add SoundDescs (#2911)
isaac-chung a19e7b4
Audio Retrieval Dataset: UrbanSound8K (#2920)
AdnanElAssadi56 698500d
Audio Retrieval Dataset: MACS (#2921)
AdnanElAssadi56 ca4b73c
SpeechT5 Model (#2901)
AdnanElAssadi56 6671fcc
MAEB Model MSCLAP (#2902)
AdnanElAssadi56 dd6a76a
MAEB Model Wav2Clip (#2908)
AdnanElAssadi56 7e1fb93
Audio Retrieval Dataset: EmoVDB (#2923)
AdnanElAssadi56 48febd1
MAEB Model MuQ-MuLan (#2909)
AdnanElAssadi56 7801759
fix encode() in audio models (#2926)
isaac-chung 7a4be45
Audio Retrieval Dataset: HiFiTTS (#2924)
AdnanElAssadi56 8a01d4e
Audio Retrieval Dataset: MusicCaps (#2918)
AdnanElAssadi56 53071b3
Audio Retrieval Dataset: CMU-Arctic (#2929)
AdnanElAssadi56 b087dfe
Audio Models Batch Fix (#2932)
AdnanElAssadi56 aadd51e
Add AudioSet and AudioSetMini (#2952)
isaac-chung b875aa2
[MAEB] Fix whisper model audio inference (#2954)
isaac-chung 54561ed
Common voice (#2951)
hepengfe d841b33
fleurs retrieval tasks (#2976)
hepengfe 069b294
MAEB Model Evaluation Fixes (#2956)
AdnanElAssadi56 671be23
Fix ClothoA2T modality (#2988)
isaac-chung 49528b6
Revert "MAEB Model Evaluation Fixes" (#2993)
isaac-chung c7278c9
Audio Retrieval Dataset: AudioSet-Strong (#2931)
AdnanElAssadi56 5b827d9
Audio Retrieval Dataset: GigaSpeech (#2925)
AdnanElAssadi56 017c2be
Audio Retrieval Dataset: LibriTTS (#2917)
AdnanElAssadi56 4b992c9
Maeb main merge 26 08 (#3076)
Samoed 21c2fce
Spoken SQuAD - MAEB (#3074)
arteemg 53b8b62
Main merge for maeb -> 1.38.52 (#3109)
isaac-chung 4c06b59
Fix VocalSound split naming (#3108)
isaac-chung 0f64441
Audio Retrieval Dataset: JLCorpus (#2927)
AdnanElAssadi56 81b621a
MAEB Models Eval Fixes 2 (#3010)
AdnanElAssadi56 7bca59f
MAEB Models Eval Fixes 3 (#3184)
AdnanElAssadi56 9f1c7a6
Maeb merge main v2 (#3447)
Samoed 3f83aed
Merge branch 'main' into maeb
Samoed e80affe
Merge branch 'main' into maeb
Samoed 96e3631
clenup
Samoed dad573a
make maeb tasks importable (#3496)
Samoed 437eb78
Refactor tasks and models to new interface (#3497)
Samoed 93fc653
[MAEB] Merge with `AbsRetrieval` (#3528)
Samoed 603f3e8
Merge branch 'main' into maeb
Samoed a6026d2
[MAEB] merge zeroshot classification (#3580)
Samoed 54c06a2
[MAEB] merge clustering (#3582)
Samoed eea0678
[MAEB] Merge pair classification (#3577)
Samoed 3a79938
[MAEB] Merge `AudioReranking` with `Retrieval` (#3570)
Samoed 3141664
[MAEB] Make `Qwen2-Audio` support text (#3581)
Samoed 9f9f865
[MAEB] Merge classification (#3590)
Samoed 892a6bf
[MAEB] Merge multilabel classification (#3614)
Samoed 21624a6
[MAEB] CLAP Token Length Error from Fleurs (#3710)
AdnanElAssadi56 60fa898
Merge branch 'main' into maeb
Samoed f532b54
fix task
Samoed fc753ff
fix task
Samoed 1c68b37
Add torch.no_grad to speecht5 (#3862)
AdnanElAssadi56 8e22fc3
Add proper text batching to Muq Model (#3856)
AdnanElAssadi56 ae69880
Update music_caps dataset path (#3854)
AdnanElAssadi56 6589273
Fix Sewd Model revision (#3857)
AdnanElAssadi56 0b03b14
Add Safey check for vggish (#3850)
AdnanElAssadi56 a36139f
Fix sound_desc dataset name typo (#3859)
AdnanElAssadi56 5e0318e
[MAEB] Ensure that Yamnet handles empty audio snippets (#3846)
AdnanElAssadi56 a5abf11
[MAEB] Add AST Model minimum length (#3861)
AdnanElAssadi56 033cf3d
fix: Add text modality to audio zeroshot classification tasks (#3878)
isaac-chung f719ed3
[MAEB] merge from main again (#3873)
isaac-chung 13a0f6c
[MAEB] init imports fix (#3845)
AdnanElAssadi56 c057e08
[MAEB] Commonvoice Task Fix (#3847)
AdnanElAssadi56 5ee0efe
[MAEB] fix speechcommands column name (#3853)
AdnanElAssadi56 f00fadf
Fix label column name + Filter out empty audio samples (#3860)
AdnanElAssadi56 83c77bc
Use zxx-Zxxx language code for non-human audio tasks (#3880)
isaac-chung a5c3751
[MAEB] Model Meta Revision Fix (#3868)
AdnanElAssadi56 4f4792d
Merge branch 'main' into maeb
Samoed 9e620b7
[MAEB] Add mctct safety check (#3848)
AdnanElAssadi56 782ce45
[MAEB] Add proper wav2clip text encoder (#3849)
AdnanElAssadi56 2f335aa
[MAEB] Make seemlessm45 model more efficient (#3851)
AdnanElAssadi56 e26db1d
[MAEB] Add monkey_patching for cnn14 to handle torchaudio versions (#…
AdnanElAssadi56 25e6c23
[MAEB] Use new subsampled version of fsd50_mini (#3865)
AdnanElAssadi56 357ac90
[MAEB] Remove AudioCapsMiniReranking Task (#3883)
AdnanElAssadi56 87cf288
fix name of evaluator for multilabel tasks (#3882)
Samoed f634317
Update audio datasets card generation (#3884)
Samoed fcb5e77
[MAEB] Add Msclap Workaround using temp files to match official model…
AdnanElAssadi56 8477591
[MAEB] Add safety checks for encodec model (#3858)
AdnanElAssadi56 3d17dbc
Audio statistics (#3833)
Samoed 96fdf25
Remove maeb results folder (sym link) (#3898)
Samoed 7645df8
[MAEB] Move non-linguistic audio tasks from eng to zxx folders (#3900)
isaac-chung edf141a
Add GoogleSVQ retrieval task with all 26 locales (#3907)
isaac-chung f5856e0
Task type consistency (#3955)
isaac-chung d21d709
Add new dataset Expresso, Globe V2 (age + gender) (#3904)
diffunity 2ab715e
Filter out empty samples in voice_gender_clustering (#3864)
AdnanElAssadi56 cdc1c37
add tau 2022 mobile development set (#3966)
diffunity 4d6d411
fix category
Samoed e71b488
Add LASS and CSTR-VCTK (accents + gender) datasets (#3942)
diffunity 4a1e4c3
Compute statistics (#3903)
Samoed 10d8f37
Add mini to common voice tasks (#3964)
Samoed 2a98d8d
Reupload tasks with trust remote code (#3961)
Samoed d51d19a
[MAEB] Msclap Eval Error Fix (#3943)
AdnanElAssadi56 eca03ed
[MAEB] Filtered VoxPopuliAccentPairCls Dataset (#3976)
AdnanElAssadi56 1e90c1a
[MAEB] Speedup SVQ, Common voice, Clotho, Fleurs datasets loading (#…
Samoed 566a6d9
[MAEB] Find tasks with empty samples (#3978)
Samoed b5f9d25
[MAEB] adding`LCO` models (#3963)
gowitheflow-1998 291eedc
add urban sound
Samoed 2b2a83f
[MAEB] add models citations (#3980)
Samoed f0b2fe2
fix fleurs citation (#4010)
Samoed 0cb5c05
Fix citations (#4012)
Samoed a87dcde
MAEB task selection (#3867)
isaac-chung 25cea64
Merge main 3001 (#4019)
Samoed 7fd8642
Merge branch 'main' into maeb
Samoed d43dcac
move maeb scripts to sep repo (#4028)
isaac-chung 79f9d2b
MAEB reference models (#4026)
isaac-chung 290f7c1
Support datasets v4 (#4004)
Samoed 75609f0
Merge branch 'main' into maeb
Samoed e8532cb
[maeb] Move datasets to mteb org (#4048)
Samoed cea6599
[MAEB] Move tasks to tasks folder (#4057)
Samoed 71d45e3
Merge branch 'main' into maeb
Samoed 5fd436e
Make birdset dataset handling more efficient (#3863)
AdnanElAssadi56 f193eed
skip test audio models
Samoed c0efa80
[MAEB] Use collator for processing (#4069)
Samoed 6120b00
[MAEB] Add GLOBE v3 dataset (#4091)
diffunity 4d54d5a
Add audio task installation instructions to docs (#4097)
isaac-chung b1f55e9
Merge remote-tracking branch 'origin/main' into maeb
isaac-chung 165514f
Update uv.lock
isaac-chung 5299695
Merge branch 'main' into maeb
isaac-chung 29136c1
Fix _update_description call sites to match new signature
isaac-chung 7ac00fa
try install runtime libraries that provide libav*.so
isaac-chung 744bd2f
fix pyproject (#4098)
Samoed 864461a
remove vllm duplicate
Samoed 90afbff
Update .github/workflows/test.yml
isaac-chung 91460f8
fix windows ci (#4100)
Samoed File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -163,3 +163,5 @@ powermetrics_log.txt | |
| /docs/overview/available_benchmarks.md | ||
|
|
||
| CLAUDE.md | ||
|
|
||
| *.tex | ||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We also need to update the whats_new.md