Skip to content

Conversation

@YanshekWoo
Copy link
Contributor

Checklist

  • My model has a model sheet, report or similar: KaLM-Embedding
  • My model has a reference implementation in mteb/models/ this can be as an API. Instruction on how to add a model can be found here
  • The results submitted is obtained using the reference implementation
  • My model is available, either as a publicly accessible API or publicly on e.g., Huggingface
  • I solemnly swear that for all results submitted I have not on the evaluation dataset including training splits. If I have I have disclosed it clearly.

root added 3 commits June 5, 2025 16:07
Signed-off-by: root <root@TENCENT64.site>
Signed-off-by: root <root@TENCENT64.site>
Signed-off-by: root <root@TENCENT64.site>
@YanshekWoo YanshekWoo changed the title feat: add KaLM-Team/KaLM_Embedding_X_0605 model results in MMTEB feat: add KaLM-Team/KaLM_Embedding-X-0605 model results in MMTEB Jun 5, 2025
@YanshekWoo YanshekWoo changed the title feat: add KaLM-Team/KaLM_Embedding-X-0605 model results in MMTEB Add results for KaLM-Team/KaLM_Embedding-X-0605 in MMTEB Jun 5, 2025
@KennethEnevoldsen
Copy link
Contributor

This is dependent on the PR, where an implementation is missing, which @Samoed will add. Once added, we should probably test that the score aligns so I will assign this to @Samoed. @Samoed feel free to request a review at the end if you need a second pair of eyes on it

@YanshekWoo
Copy link
Contributor Author

@Samoed
Excuse me, I'd like to know the progress of this Merge Request.
Are there any difficulties or areas where you need my clarification or assistance?

@Samoed
Copy link
Member

Samoed commented Jun 13, 2025

We need to integrate your model firstly. Can you review metric results from our current implementation embeddings-benchmark/mteb#2478 (comment)
@YanshekWoo

@KennethEnevoldsen KennethEnevoldsen added the waiting for review of implementation This PR is waiting for an implementation review before merging the results. label Jun 16, 2025
@YanshekWoo
Copy link
Contributor Author

Before merging the results, can we update them first?

By the way, could we raise an issue here regarding the evaluation instructions for MTEB?

@Samoed
Copy link
Member

Samoed commented Jun 23, 2025

By the way, could we raise an issue here regarding the evaluation instructions for MTEB?

I'm not sure what you want. Can you update your implementation in embeddings-benchmark/mteb#2775 using new loader for Kalm models from embeddings-benchmark/mteb#2478?

@YanshekWoo
Copy link
Contributor Author

I'm not sure what you want. Can you update your implementation in embeddings-benchmark/mteb#2775 using new loader for Kalm models from embeddings-benchmark/mteb#2478?

The Instruction we evaluated has been updated, so the results for each dataset will also change accordingly.

We used SentenceTransformer for loading and specifying prompts during our evaluation, but we found that there is an issue with the current mteb code when specifying specific task instructions:
if the task dataset contains a "-", there will be an error in parsing the task_name, such as when specifying "NQ-PL-query".

https://github.com/embeddings-benchmark/mteb/blob/d7ff1ab3168ddf765d497b22e928af355f0cffe6/mteb/models/wrapper.py#L77

@Samoed
Copy link
Member

Samoed commented Jun 23, 2025

The Instruction we evaluated has been updated, so the results for each dataset will also change accordingly.

Can you update them in your implementation?

if the task dataset contains a "-", there will be an error in parsing the task_name, such as when specifying "NQ-PL-query".

I see. I will fix it

@YanshekWoo
Copy link
Contributor Author

Can you update them in your implementation?

Yes. I will update it today.

ll0ruc and others added 28 commits June 25, 2025 12:32
* update gte-modernbert-base

* update CQADupstackRetrieval for gte-modernbert-base

* add Qwen3-Embedding results

* add results for Qwen3 Embedding

* update FloresBitextMining.json results

* udpate model revsion
* add geoembedding results

* rename geoembedding result file

* minor fix

---------

Co-authored-by: zhangzeqing <zhangzeqing@zhejianglab.com>
Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>
* add cadet results

* add revision and model metadata
* add xyz model

* add xyz model

* use current mteb version

* use current mteb version
* Update LGAI-Embedding results

Submit results

* Update model_meta.json

updated
* init automate script

* bump versions

* fix script name

* add tabulate

* add resutls for test model

* handle no result on task

* fix function

* fix function

* remove testuser

* fix script help

* try to run only one model

* install from sources

* update script

* format

* fix typo

* fetch main

* fetch main in script

* remove revision check

* fix reference models arg

* bump python version
* update script

* remove comment
* update results

* mv to revision_id

* change model_meta

* update revision_id

* fix meta

* update on scores

---------

Co-authored-by: Kolodin Egor <eikolodin@sberbank.ru>
* add geoembedding results

* rename geoembedding result file

* minor fix

---------

Co-authored-by: zhangzeqing <zhangzeqing@zhejianglab.com>
Co-authored-by: Kenneth Enevoldsen <kennethcenevoldsen@gmail.com>
@YanshekWoo
Copy link
Contributor Author

YanshekWoo commented Jun 25, 2025

Can you update them in your implementation?

I have created a new PR of the model card at: embeddings-benchmark/mteb#2850

And the new results with new instruction are updated in new PR: #227

@YanshekWoo YanshekWoo closed this Jun 25, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

waiting for review of implementation This PR is waiting for an implementation review before merging the results.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

10 participants