Skip to content

Merge main 11 10#3321

Merged
Samoed merged 21 commits intov2.0.0from
merge_main_11_10
Oct 12, 2025
Merged

Merge main 11 10#3321
Samoed merged 21 commits intov2.0.0from
merge_main_11_10

Conversation

@Samoed
Copy link
Member

@Samoed Samoed commented Oct 11, 2025

If you add a model or a dataset, please add the corresponding checklist:

q275343119 and others added 19 commits October 6, 2025 14:06
* Refactor: Move zero-shot percentage calculation to the end of summary table creation which only apply to RTEB table.

* Update RTEB benchmark name from "RTEB(beta)" to "RTEB" for consistency in display.

* feat - RTEB(beta)

* feat - remove Zero-shot

---------

Co-authored-by: ethan <smiletoye@gmail.com>
* model: Add ReasonIR

* Update mteb/models/reasonir_model.py

Co-authored-by: Roman Solomatin <samoed.roman@gmail.com>

* Update mteb/models/reasonir_model.py

Co-authored-by: Roman Solomatin <samoed.roman@gmail.com>

* update n_parameters of ReasonIR

Co-authored-by: Niklas <n.muennighoff@gmail.com>

---------

Co-authored-by: Roman Solomatin <samoed.roman@gmail.com>
Co-authored-by: Niklas <n.muennighoff@gmail.com>
Currently we pin 3 columns, this makes it hard or impossible to view on phones. The 3rd column is also no longer garuanteed as RTEB leaderboard does not use the zero-shot column
Automatically generated by python-semantic-release
* fix: Only pin model name and rank

Currently we pin 3 columns, this makes it hard or impossible to view on phones. The 3rd column is also no longer garuanteed as RTEB leaderboard does not use the zero-shot column

* fix: resolve flash-attention dependency issue

This has been tested and works.

fixed Resolve flash-attention dependency issues
Fixes #3240
Automatically generated by python-semantic-release
* Retry and token counting in Cohere models

* Retry and token counting in Cohere models

* Retry and token counting in Cohere models

---------

Co-authored-by: Roman Solomatin <36135455+Samoed@users.noreply.github.com>
Automatically generated by python-semantic-release
* sort by mean task type and use pure rank for MIEB LBs

* lint

* rename task type column for readability
* add prompt for MIRACLRetrievalHardNegatives

* add `MIRACLRetrievalHardNegatives.v2`

* Update mteb/tasks/Retrieval/multilingual/MIRACLRetrieval.py

Co-authored-by: Kenneth Enevoldsen <kenevoldsen@pm.me>

* move common metadata to dict

---------

Co-authored-by: Roman Solomatin <samoed.roman@gmail.com>
Co-authored-by: Kenneth Enevoldsen <kenevoldsen@pm.me>
Automatically generated by python-semantic-release
Automatically generated by python-semantic-release
* Add en code retriever model

* fix model_name

* Update mteb/models/en_code_retriever.py

Co-authored-by: Roman Solomatin <samoed.roman@gmail.com>

* correct lint

---------

Co-authored-by: Roman Solomatin <samoed.roman@gmail.com>
# Conflicts:
#	mteb/benchmarks/benchmarks/benchmarks.py
#	mteb/evaluation/evaluators/RegressionEvaluator.py
#	mteb/leaderboard/app.py
#	mteb/models/model_implementations/cohere_models.py
#	mteb/models/model_implementations/cohere_v.py
#	mteb/models/overview.py
#	mteb/tasks/Classification/__init__.py
#	mteb/tasks/Classification/svk/__init__.py
#	mteb/tasks/Retrieval/multilingual/MIRACLRetrieval.py
#	pyproject.toml
#	tests/test_benchmark/mock_tasks.py
@Samoed Samoed added the v2 label Oct 11, 2025
@Samoed Samoed merged commit a08e6a6 into v2.0.0 Oct 12, 2025
12 checks passed
@Samoed Samoed deleted the merge_main_11_10 branch October 12, 2025 06:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

10 participants