Skip to content

Conversation

@bschifferer
Copy link
Contributor

@bschifferer bschifferer commented Jun 27, 2025

Checklist

  • My model has a model sheet, report or similar
  • My model has a reference implementation in
  • The results submitted is obtained using the reference implementation (script is provided in the HuggingFace checkpoints)
  • My model is available, either as a publicly accessible API or publicly on e.g., Huggingface
  • solemnly swear that for all results submitted I have not on the evaluation dataset including training splits. If I have I have disclosed it clearly.

Models are available on HF (will be publish soon)
https://huggingface.co/nvidia/llama-nemoretriever-colembed-1b-v1
https://huggingface.co/nvidia/llama-nemoretriever-colembed-3b-v1

They contain a description of the training datasets and a script to run MTEB VisualDocumentRetrieval

@bschifferer
Copy link
Contributor Author

@KennethEnevoldsen , @Samoed , @isaac-chung - Hello, can you might help me what the error is?

KeyError: 'intfloat/multilingual-e5-large'

I dont change anything related to intfloat model. Why is there a key error? Can I somehow trigger to rerun the tests?

@Samoed
Copy link
Member

Samoed commented Jun 27, 2025

I think you've just run a task that intfloat/multilingual-e5-large haven't run. I fixed this issue in #229

@bschifferer
Copy link
Contributor Author

@Samoed - How did I run the task - I just open a PR to submit our results.

Do I ned to rebase/change something in my PR?

@Samoed
Copy link
Member

Samoed commented Jun 27, 2025

No, you don't need to anything. This is fine that this CI failed

@rnyak
Copy link
Contributor

rnyak commented Jun 27, 2025

@Samoed hello. any ETA for merging your PR ? thanks.

@Samoed
Copy link
Member

Samoed commented Jun 28, 2025

This PR is not blocking merging your results

Copy link
Contributor

@isaac-chung isaac-chung left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The model PR only has minor comments. This is good to go.

@bschifferer
Copy link
Contributor Author

Thanks a lot for the quick feedback and review. I thought the checks are mandatory to pass, but it seems it is optional.

@isaac-chung @Samoed : Do you think it is possible to merge the PR by Monday afternoon? We want to share/promote our models, but we are waiting until the models are available on the leaderboards of MTEB VisualDocumentRetrieval and ViDoRe V1+V2 (both uses the results from the MTEB repository, now). That would be great

I wish you a great weekend.

@Samoed
Copy link
Member

Samoed commented Jun 28, 2025

I think it will be possible, but you should a bit fix up your model metadata

@bschifferer
Copy link
Contributor Author

I think it will be possible, but you should a bit fix up your model metadata

Thanks @Samoed - I just updated my PR in the MTEB library for model_meta_data and updated the .json files in this PR

@isaac-chung isaac-chung merged commit 08fa208 into embeddings-benchmark:main Jun 28, 2025
2 of 3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants