model: add kalm_models ModelMeta #2775

YanshekWoo · 2025-06-05T07:35:37Z

Checklist

I did not add a dataset, or if I did, I added the dataset checklist to the PR and completed it.
I did not add a model, or if I did, I added the model checklist to the PR and completed it.

Model Checklist

I have filled out the ModelMeta object to the extent possible
I have ensured that my model can be loaded using
- mteb.get_model(model_name, revision) and
- mteb.get_model_meta(model_name, revision)
I have tested the implementation works on a representative set of tasks.
The model is public, i.e. is available either as an API or the wieght are publicly avaiable to download

Signed-off-by: xinshuohu <xinshuohu@tencent.com>

Samoed · 2025-06-05T08:18:45Z

mteb/models/kalm_models.py

+    "MTOPIntentClassification": ["train"],
+}
+
+KaLM_Embedding_X_0605 = ModelMeta(


Can you add implementation of your model? If it similar to original KALM, I can push work on that PR

If it similar to original KALM, I can push work on that PR

Yes, it is almost the same implementation of HIT_TMG__KaLM_embedding_multilingual_mini_instruct_v1.

Perhaps the entire set of models related to KaLM should be moved to kalm_models.py.

Yes, I will try to finish work on #2478 on weekends then

@YanshekWoo can you try to run your models with implementation from #2478? It was merged to main

@Samoed OK, I will try to test it. Thanks.

@Samoed
I have tested the latest version of MTEB (1.38.30), and I believe its results are completely fine now.

Some of the results (from different task type) for HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1.5 are as follows:

Task Reported Reproduced

EmotionClassification 0.86900 0.86885

FiQA2018 0.44741 0.44072

SprintDuplicateQuestions 0.93057 0.930568

STS12 0.80167 0.801666

The code of evaluation is as follows:

import mteb # Specify the model that we want to evaluate model = mteb.get_model("HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1.5") # specify what you want to evaluate it on tasks = mteb.get_tasks(tasks=["EmotionClassification", "FiQA2018", "STS12", "SprintDuplicateQuestions"]) # run the evaluation evaluation = mteb.MTEB(tasks=tasks) results = evaluation.run(model, encode_kwargs={"batch_size": 256}, verbosity=2,)

YanshekWoo · 2025-06-25T04:49:58Z

I have created a new PR via the latest mteb.
Please refer to the new one: #2850

xinshuohu added 4 commits June 5, 2025 12:56

feat: add kalm_models

1ae0f37

fix: change adapted_from in KaLM models

72a55fa

Signed-off-by: xinshuohu <xinshuohu@tencent.com>

feat: add kalm_models

ce13d89

Signed-off-by: xinshuohu <xinshuohu@tencent.com>

feat: add kalm_models

b70d5ef

Signed-off-by: xinshuohu <xinshuohu@tencent.com>

YanshekWoo changed the title ~~Dev kalm~~ add kalm_models ModelMeta Jun 5, 2025

YanshekWoo mentioned this pull request Jun 5, 2025

feat: add KaLM-Team/KaLM_Embedding_X_0605 model results in MMTEB embeddings-benchmark/results#216

Closed

6 tasks

Samoed reviewed Jun 5, 2025

View reviewed changes

YanshekWoo mentioned this pull request Jun 5, 2025

Add results for KaLM-Team/KaLM_Embedding-X-0605 in MMTEB embeddings-benchmark/results#217

Closed

6 tasks

Samoed changed the title ~~add kalm_models ModelMeta~~ model: add kalm_models ModelMeta Jun 6, 2025

KennethEnevoldsen assigned Samoed Jun 16, 2025

KennethEnevoldsen added the new model Questions related to adding a new model to the benchmark label Jun 24, 2025

KennethEnevoldsen assigned YanshekWoo Jun 24, 2025

YanshekWoo closed this Jun 25, 2025

YanshekWoo deleted the dev_kalm branch June 25, 2025 06:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model: add kalm_models ModelMeta #2775

model: add kalm_models ModelMeta #2775

Uh oh!

YanshekWoo commented Jun 5, 2025

Uh oh!

Samoed Jun 5, 2025

Uh oh!

YanshekWoo Jun 5, 2025

Uh oh!

Samoed Jun 5, 2025

Uh oh!

Samoed Jun 15, 2025 •

edited

Loading

Uh oh!

YanshekWoo Jun 16, 2025

Uh oh!

YanshekWoo Jun 25, 2025

Uh oh!

YanshekWoo commented Jun 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Task	Reported	Reproduced
EmotionClassification	0.86900	0.86885
FiQA2018	0.44741	0.44072
SprintDuplicateQuestions	0.93057	0.930568
STS12	0.80167	0.801666

model: add kalm_models ModelMeta #2775

model: add kalm_models ModelMeta #2775

Uh oh!

Conversation

YanshekWoo commented Jun 5, 2025

Checklist

Model Checklist

Uh oh!

Samoed Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

YanshekWoo Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

Samoed Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

Samoed Jun 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

YanshekWoo Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

YanshekWoo Jun 25, 2025

Choose a reason for hiding this comment

Uh oh!

YanshekWoo commented Jun 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Samoed Jun 15, 2025 •

edited

Loading