add results for Youtu-Embedding-V1#265
add results for Youtu-Embedding-V1#265KennethEnevoldsen merged 1 commit intoembeddings-benchmark:mainfrom
Conversation
Model Results ComparisonReference models: Results for
|
| task_name | Youtu-RAG/Youtu-Embedding-V1 | google/gemini-embedding-001 | intfloat/multilingual-e5-large | Max result |
|---|---|---|---|---|
| AFQMC | 0.6711 | nan | 0.3301 | 0.7225 |
| ATEC | 0.5989 | nan | 0.3981 | 0.6464 |
| BQ | 0.7401 | nan | 0.485 | 0.8125 |
| CLSClusteringP2P | 0.7580 | nan | nan | 0.8225 |
| CLSClusteringS2S | 0.7131 | nan | nan | 0.7387 |
| CMedQAv1-reranking | 0.9162 | nan | 0.6765 | 0.9434 |
| CMedQAv2-reranking | 0.9211 | nan | 0.6672 | 0.9353 |
| CmedqaRetrieval | 0.5742 | nan | 0.2866 | 0.5658 |
| Cmnli | 0.9015 | nan | nan | 0.9501 |
| CovidRetrieval | 0.9291 | 0.7913 | 0.7561 | 0.9606 |
| DuRetrieval | 0.9107 | nan | 0.853 | 0.9423 |
| EcomRetrieval | 0.7328 | nan | 0.5467 | 0.7764 |
| IFlyTek | 0.5273 | nan | 0.4186 | 0.5799 |
| JDReview | 0.9054 | nan | 0.8054 | 0.9169 |
| LCQMC | 0.7997 | nan | 0.7595 | 0.8240 |
| MMarcoReranking | 0.3890 | nan | 0.2912 | 0.4689 |
| MMarcoRetrieval | 0.8957 | nan | 0.792 | 0.9033 |
| MedicalRetrieval | 0.7324 | nan | 0.5144 | 0.7562 |
| MultilingualSentiment | 0.8089 | nan | 0.709 | 0.8536 |
| Ocnli | 0.8923 | nan | nan | 0.9513 |
| OnlineShopping | 0.9479 | nan | 0.9045 | 0.9716 |
| PAWSX | 0.6782 | nan | 0.1463 | 0.7009 |
| QBQTC | 0.5958 | nan | nan | 0.7145 |
| STSB | 0.8576 | 0.855 | 0.8236 | 0.9199 |
| T2Reranking | 0.7277 | 0.6795 | 0.6632 | 0.7283 |
| T2Retrieval | 0.8902 | nan | 0.7607 | 0.8926 |
| TNews | 0.6010 | nan | 0.488 | 0.6090 |
| ThuNewsClusteringP2P | 0.8698 | nan | nan | 0.8879 |
| ThuNewsClusteringS2S | 0.8459 | nan | nan | 0.8790 |
| VideoRetrieval | 0.8105 | nan | 0.5828 | 0.8384 |
| Waimai | 0.8980 | nan | 0.863 | 0.9231 |
| Average | 0.7755 | 0.7753 | 0.6051 | 0.8108 |
|
The model implementation has been added to |
|
@KennethEnevoldsen Could you please kindly provide an update on the current progress? |
|
Yes indeed! Thanks for the ping I don't see anything too problematic in the scores given that the training data annotations |
Checklist
mteb/models/this can be as an API. Instruction on how to add a model can be found here