Skip to content

update results for giga-embeddings-instruct#208

Merged
Samoed merged 6 commits intoembeddings-benchmark:mainfrom
ekolodin:main
Jun 22, 2025
Merged

update results for giga-embeddings-instruct#208
Samoed merged 6 commits intoembeddings-benchmark:mainfrom
ekolodin:main

Conversation

@ekolodin
Copy link
Contributor

@ekolodin ekolodin commented May 29, 2025

Checklist

  • My model has a model sheet, report or similar
  • My model has a reference implementation in mteb/models/ this can be as an API. Instruction on how to add a model can be found here
    • No, but there is an existing PR ___
  • The results submitted is obtained using the reference implementation
  • My model is available, either as a publicly accessible API or publicly on e.g., Huggingface
  • I solemnly swear that for all results submitted I have not on the evaluation dataset including training splits. If I have I have disclosed it clearly.

Copy link
Member

@Samoed Samoed left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You need to put your new results in dir witn model revision

@ekolodin
Copy link
Contributor Author

You need to put your new results in dir witn model revision

In what dir?

@Samoed
Copy link
Member

Samoed commented May 29, 2025

In your case you should put in /results/ai-sage__Giga-Embeddings-instruct/646f5ff3587e74a18141c8d6b60d1cffd5897b92/

@ekolodin
Copy link
Contributor Author

where did you get this string 646f5ff3587e74a18141c8d6b60d1cffd5897b92?

@ekolodin
Copy link
Contributor Author

but i changed the model weights, should this revision id be changed?

@ekolodin
Copy link
Contributor Author

ok, i've moved to revision_id dir

@Samoed
Copy link
Member

Samoed commented May 29, 2025

Yes, you should change revision of your model in main repo too

@ekolodin
Copy link
Contributor Author

ekolodin commented Jun 3, 2025

So, I've updated the revision_id on the latest one

@ekolodin ekolodin requested a review from Samoed June 3, 2025 13:24
@Samoed
Copy link
Member

Samoed commented Jun 3, 2025

You should update revision in main repo too, and after that run your model with mteb implementation @ekolodin

@ekolodin
Copy link
Contributor Author

ekolodin commented Jun 3, 2025

what main repo? this is not main repo? i do not understand you @Samoed

@Samoed
Copy link
Member

Samoed commented Jun 3, 2025

@ekolodin
Copy link
Contributor Author

ekolodin commented Jun 5, 2025

I've made it @Samoed

embeddings-benchmark/mteb#2774

@ekolodin
Copy link
Contributor Author

ekolodin commented Jun 6, 2025

Any updates?

@Samoed
Copy link
Member

Samoed commented Jun 6, 2025

No, you need to run your model with implementation from mteb.

@ekolodin
Copy link
Contributor Author

ekolodin commented Jun 6, 2025

I've already run it, and the push the results.

@ekolodin ekolodin requested a review from Samoed June 10, 2025 11:52
@ekolodin
Copy link
Contributor Author

@Samoed could you please merge?

@Samoed
Copy link
Member

Samoed commented Jun 10, 2025

As I said I don't think you've correctly run your model. I've got

Task MTEB results Your PR
TERRa 0.638273 0.583729
CEDRClassification 0.6247 0.684857

Please, rerun your model with latest mteb.

https://www.kaggle.com/code/samoed/notebook908a227956/notebook?scriptVersionId=244761810

@ekolodin
Copy link
Contributor Author

I think you use wrong instructions for benchmarks

@ekolodin
Copy link
Contributor Author

Where do you set up instructions in your example? I can't see it, we have custom instructions for each task

@Samoed
Copy link
Member

Samoed commented Jun 10, 2025

we have custom instructions for each task

If you have custom instructions, then you should add them for your model. We're integrating KALM that using custom instructions for tasks embeddings-benchmark/mteb#2478, too. I'll try to merge this PR soon

@ekolodin
Copy link
Contributor Author

Where can I get your default instructions?

@Samoed
Copy link
Member

Samoed commented Jun 10, 2025

They are defined in task Metadata, for example for TERRa or if instruction is not defined in task then default instruction for task type will be used, e.g. PairClassification. Instructions are selected by this function https://github.com/embeddings-benchmark/mteb/blob/e6238f2305be79f7b32d934d85ef4557fb89cb22/mteb/models/wrapper.py#L91

@ekolodin
Copy link
Contributor Author

I have added custom instructions code: embeddings-benchmark/mteb#2836

Copy link
Member

@Samoed Samoed left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Now I verify scores

@Samoed Samoed merged commit ec3c9cb into embeddings-benchmark:main Jun 22, 2025
2 of 3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants