Add results for KaLM-Team/KaLM_Embedding-X-0605 in MMTEB #217

YanshekWoo · 2025-06-05T09:20:48Z

Checklist

My model has a model sheet, report or similar: KaLM-Embedding
My model has a reference implementation in mteb/models/ this can be as an API. Instruction on how to add a model can be found here
- No, but there is an existing PR: add kalm_models ModelMeta
The results submitted is obtained using the reference implementation
My model is available, either as a publicly accessible API or publicly on e.g., Huggingface
I solemnly swear that for all results submitted I have not on the evaluation dataset including training splits. If I have I have disclosed it clearly.

Signed-off-by: root <root@TENCENT64.site>

KennethEnevoldsen · 2025-06-05T11:15:10Z

This is dependent on the PR, where an implementation is missing, which @Samoed will add. Once added, we should probably test that the score aligns so I will assign this to @Samoed. @Samoed feel free to request a review at the end if you need a second pair of eyes on it

YanshekWoo · 2025-06-13T12:20:54Z

@Samoed
Excuse me, I'd like to know the progress of this Merge Request.
Are there any difficulties or areas where you need my clarification or assistance?

Samoed · 2025-06-13T12:28:22Z

We need to integrate your model firstly. Can you review metric results from our current implementation embeddings-benchmark/mteb#2478 (comment)
@YanshekWoo

YanshekWoo · 2025-06-23T10:38:36Z

Before merging the results, can we update them first?

By the way, could we raise an issue here regarding the evaluation instructions for MTEB?

Samoed · 2025-06-23T10:48:46Z

By the way, could we raise an issue here regarding the evaluation instructions for MTEB?

I'm not sure what you want. Can you update your implementation in embeddings-benchmark/mteb#2775 using new loader for Kalm models from embeddings-benchmark/mteb#2478?

YanshekWoo · 2025-06-23T11:37:36Z

I'm not sure what you want. Can you update your implementation in embeddings-benchmark/mteb#2775 using new loader for Kalm models from embeddings-benchmark/mteb#2478?

The Instruction we evaluated has been updated, so the results for each dataset will also change accordingly.

We used SentenceTransformer for loading and specifying prompts during our evaluation, but we found that there is an issue with the current mteb code when specifying specific task instructions:
if the task dataset contains a "-", there will be an error in parsing the task_name, such as when specifying "NQ-PL-query".

https://github.com/embeddings-benchmark/mteb/blob/d7ff1ab3168ddf765d497b22e928af355f0cffe6/mteb/models/wrapper.py#L77

Samoed · 2025-06-23T12:02:50Z

The Instruction we evaluated has been updated, so the results for each dataset will also change accordingly.

Can you update them in your implementation?

if the task dataset contains a "-", there will be an error in parsing the task_name, such as when specifying "NQ-PL-query".

I see. I will fix it

YanshekWoo · 2025-06-24T02:05:25Z

Can you update them in your implementation?

Yes. I will update it today.

* update gte-modernbert-base * update CQADupstackRetrieval for gte-modernbert-base * add Qwen3-Embedding results * add results for Qwen3 Embedding * update FloresBitextMining.json results * udpate model revsion

* add geoembedding results * rename geoembedding result file * minor fix --------- Co-authored-by: zhangzeqing <zhangzeqing@zhejianglab.com> Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>

* add cadet results * add revision and model metadata

* add xyz model * add xyz model * use current mteb version * use current mteb version

* Update LGAI-Embedding results Submit results * Update model_meta.json updated

* init automate script * bump versions * fix script name * add tabulate * add resutls for test model * handle no result on task * fix function * fix function * remove testuser * fix script help * try to run only one model * install from sources * update script * format * fix typo * fetch main * fetch main in script * remove revision check * fix reference models arg * bump python version

* update script * remove comment

* update results * mv to revision_id * change model_meta * update revision_id * fix meta * update on scores --------- Co-authored-by: Kolodin Egor <eikolodin@sberbank.ru>

* add geoembedding results * rename geoembedding result file * minor fix --------- Co-authored-by: zhangzeqing <zhangzeqing@zhejianglab.com> Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>

YanshekWoo · 2025-06-25T07:02:53Z

Can you update them in your implementation?

I have created a new PR of the model card at: embeddings-benchmark/mteb#2850

And the new results with new instruction are updated in new PR: #227

root added 3 commits June 5, 2025 16:07

feat: add KaLM-Team/KaLM_Embedding_X_0605

f54cb2d

Signed-off-by: root <root@TENCENT64.site>

feat: add KaLM-Team/KaLM_Embedding_X_0605

b05f28e

Signed-off-by: root <root@TENCENT64.site>

feat: add KaLM-Team/KaLM-Embedding-X-0605

4086595

Signed-off-by: root <root@TENCENT64.site>

YanshekWoo changed the title ~~feat: add KaLM-Team/KaLM_Embedding_X_0605 model results in MMTEB~~ feat: add KaLM-Team/KaLM_Embedding-X-0605 model results in MMTEB Jun 5, 2025

YanshekWoo changed the title ~~feat: add KaLM-Team/KaLM_Embedding-X-0605 model results in MMTEB~~ Add results for KaLM-Team/KaLM_Embedding-X-0605 in MMTEB Jun 5, 2025

KennethEnevoldsen assigned Samoed Jun 5, 2025

KennethEnevoldsen added the waiting for review of implementation This PR is waiting for an implementation review before merging the results. label Jun 16, 2025

Samoed mentioned this pull request Jun 23, 2025

fix: prompt validation for tasks with - embeddings-benchmark/mteb#2846

Merged

ll0ruc added 14 commits June 25, 2025 12:24

Add files via upload

cd2abb9

Add files via upload

2b3c9b2

Add files via upload

8ccd077

Add files via upload

4b525e1

Add files via upload

8b5588f

Add files via upload

c630143

Add files via upload

b7da47c

Add files via upload

bff6c96

Add files via upload

d7b5fec

Add files via upload

179adf8

Add files via upload

d41d179

Add files via upload

ad687ca

Add files via upload

0f49e40

Add files via upload

26b1ff3

ll0ruc and others added 28 commits June 25, 2025 12:32

Update R2MEDMedicalSciencesRetrieval.json

72f9c83

Update R2MEDMedicalSciencesRetrieval.json

bc87377

Update R2MEDMedicalSciencesRetrieval.json

59491ac

Update R2MEDMedicalSciencesRetrieval.json

f2e3aed

Update R2MEDMedicalSciencesRetrieval.json

6fdabc7

Update R2MEDMedicalSciencesRetrieval.json

40c15c8

Update R2MEDMedicalSciencesRetrieval.json

62cdc4b

Update R2MEDMedicalSciencesRetrieval.json

6f7da65

Update R2MEDMedicalSciencesRetrieval.json

6f76ff8

Update R2MEDMedicalSciencesRetrieval.json

ac05aec

Update R2MEDMedicalSciencesRetrieval.json

1ea14a7

Update R2MEDMedicalSciencesRetrieval.json

d221fcf

Add results for Qwen3-Embedding series models (#214)

d6536f7

* update gte-modernbert-base * update CQADupstackRetrieval for gte-modernbert-base * add Qwen3-Embedding results * add results for Qwen3 Embedding * update FloresBitextMining.json results * udpate model revsion

fixed mistake in create pr results

959f3e3

add geoembedding results (#215)

de51683

* add geoembedding results * rename geoembedding result file * minor fix --------- Co-authored-by: zhangzeqing <zhangzeqing@zhejianglab.com> Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>

add cadet results (#218)

60c42cf

* add cadet results * add revision and model metadata

add xyz model (#207)

e93844e

* add xyz model * add xyz model * use current mteb version * use current mteb version

Update LGAI-Embedding results (#219)

bdd314c

* Update LGAI-Embedding results Submit results * Update model_meta.json updated

Added encodechka results (#182)

79c71ca

update script (#222)

341be28

* update script * remove comment

update results for giga-embeddings-instruct (#208)

bc97323

* update results * mv to revision_id * change model_meta * update revision_id * fix meta * update on scores --------- Co-authored-by: Kolodin Egor <eikolodin@sberbank.ru>

add KaLM-Team__KaLM-Embedding-X-0605 with new instruct

2bd2521

add KaLM-Team__KaLM-Embedding-X-0605 with new instruct

cdbf82f

Add files via upload

19a6f35

merge

1601b6a

Add files via upload

47a248d

add geoembedding results (#215)

cf3ebf6

* add geoembedding results * rename geoembedding result file * minor fix --------- Co-authored-by: zhangzeqing <zhangzeqing@zhejianglab.com> Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>

YanshekWoo closed this Jun 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add results for KaLM-Team/KaLM_Embedding-X-0605 in MMTEB #217

Add results for KaLM-Team/KaLM_Embedding-X-0605 in MMTEB #217

Uh oh!

YanshekWoo commented Jun 5, 2025

Uh oh!

KennethEnevoldsen commented Jun 5, 2025

Uh oh!

YanshekWoo commented Jun 13, 2025

Uh oh!

Samoed commented Jun 13, 2025 •

edited

Loading

Uh oh!

YanshekWoo commented Jun 23, 2025

Uh oh!

Samoed commented Jun 23, 2025 •

edited

Loading

Uh oh!

YanshekWoo commented Jun 23, 2025

Uh oh!

Samoed commented Jun 23, 2025

Uh oh!

YanshekWoo commented Jun 24, 2025

Uh oh!

YanshekWoo commented Jun 25, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

Add results for KaLM-Team/KaLM_Embedding-X-0605 in MMTEB #217

Add results for KaLM-Team/KaLM_Embedding-X-0605 in MMTEB #217

Uh oh!

Conversation

YanshekWoo commented Jun 5, 2025

Checklist

Uh oh!

KennethEnevoldsen commented Jun 5, 2025

Uh oh!

YanshekWoo commented Jun 13, 2025

Uh oh!

Samoed commented Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

YanshekWoo commented Jun 23, 2025

Uh oh!

Samoed commented Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

YanshekWoo commented Jun 23, 2025

Uh oh!

Samoed commented Jun 23, 2025

Uh oh!

YanshekWoo commented Jun 24, 2025

Uh oh!

YanshekWoo commented Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

Samoed commented Jun 13, 2025 •

edited

Loading

Samoed commented Jun 23, 2025 •

edited

Loading

YanshekWoo commented Jun 25, 2025 •

edited

Loading