Skip to content

Conversation

@orionw
Copy link
Contributor

@orionw orionw commented Aug 28, 2025

This adds a new dataset, LIMIT, that is available today from my internship with GDM:

  • Paper: https://arxiv.org/abs/2508.21038

  • Code: https://github.com/google-deepmind/limit

  • I have run the following models on the task (adding the results to the pr). These can be run using the mteb run -m {model_name} -t {task_name} command.

    • sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
    • intfloat/multilingual-e5-small
  • I have checked that the performance is neither trivial (both models gain close to perfect scores) nor random (both models gain close to random scores).

  • I have considered the size of the dataset and reduced it if it is too big (2048 examples is typically large enough for most tasks)

Copy link
Collaborator

@isaac-chung isaac-chung left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Congrats on the paper!

@orionw orionw enabled auto-merge (squash) August 30, 2025 18:54
@orionw
Copy link
Contributor Author

orionw commented Aug 30, 2025

Thanks @isaac-chung! It's still failing the tests which is a bit weird.

@Samoed is this cuz of my PR or is this good to merge? I see you've been doing a lot of work to fix the branches CI, thank you!

@isaac-chung
Copy link
Collaborator

isaac-chung commented Aug 30, 2025

I think the missing piece here is to merge #3098 into the v2 branch. @Samoed has a PR open now: #3102

Copy link
Contributor

@KennethEnevoldsen KennethEnevoldsen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Once tests pass this is good to merge - congrats on the paper, very happy to see it!

@orionw orionw merged commit b7d0bec into v2.0.0 Sep 1, 2025
7 of 8 checks passed
@orionw orionw deleted the add_limit branch September 1, 2025 16:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants