add model memory usage by Samoed · Pull Request #1934 · embeddings-benchmark/mteb

Samoed · 2025-02-02T17:24:32Z

To make fully compatible old leaderboard with new, we can add model memory usage. Add property to ModelMeta.

Code Quality

Code Formatted: Format the code using make lint to maintain consistent style.

Documentation

Updated Documentation: Add or update documentation to reflect the changes introduced in this PR.

Testing

New Tests Added: Write tests to cover new functionality. Validate with make test-with-coverage.
Tests Passed: Run tests locally using make test or make test-with-coverage to ensure no existing functionality is broken.

isaac-chung · 2025-02-03T04:43:15Z

Have you already discussed with @KennethEnevoldsen or @x-tabdeveloping ? I'd prefer not to have such back-and-forth: to add then remove then add back things. #1729

Samoed · 2025-02-03T05:35:22Z

Opened issue for this #1935. Previously it was removed, because it was not filled and should be autocalculated based on the number of parameters of the model

isaac-chung

I understand now (we're not adding this back to be filled in - we are auto-calculating it here. Great stuff!). Thanks for the initiative! Just 2 clarifications then I think we're ready.

mteb/model_meta.py

isaac-chung

Sweet!

x-tabdeveloping

Left a couple of minor comments. Thanks for adding this

mteb/model_meta.py

x-tabdeveloping · 2025-02-04T08:27:03Z

mteb/model_meta.py

+        if self.n_parameters is None:
+            return None
+        # Model memory in bytes. For FP32 each parameter is 4 bytes.
+        model_memory_bytes = self.num_params * 4


Is this a good assumption to make? Do all models have FP32 parameters?

Large models (>1B) params usally loaded with fp16/bp16, but I don't know how to handle this automatically

I suppose you could get this information using huggingface_hub.hf_api.get_safetensors_metadata

And then do a cached_property so that it doesn't have to be fetched every time

I think integrating this could slow down leaderboard building. Maybe we could manually set memory usage instead or add information about the number of parameters for each model weight?

I agree. We could fetch all of these in a script and manually keep count of them in ModelMeta perhaps?

Calculated them

# Conflicts: # mteb/models/bge_models.py # mteb/models/promptriever_models.py

# Conflicts: # mteb/models/ru_sentence_models.py

Samoed · 2025-02-06T19:54:29Z

@x-tabdeveloping Can this PR be merged?

KennethEnevoldsen

Looks good on my end - great addition. Do we want to create a column for it on the leaderboard (if so let us make an issue on that)

Samoed · 2025-02-07T10:21:35Z

@KennethEnevoldsen I've already created issue #1935

# Conflicts: # mteb/models/gritlm_models.py

add model memory usage

b7704fc

Samoed requested a review from x-tabdeveloping February 2, 2025 17:24

lint

e6fa5f5

Samoed mentioned this pull request Feb 2, 2025

[Leaderboard] Autocalculate memory usage in model meta #1935

Closed

isaac-chung reviewed Feb 3, 2025

View reviewed changes

mteb/model_meta.py Outdated Show resolved Hide resolved

mteb/model_meta.py Outdated Show resolved Hide resolved

update

40fa6c3

isaac-chung approved these changes Feb 3, 2025

View reviewed changes

x-tabdeveloping reviewed Feb 4, 2025

View reviewed changes

Samoed added 7 commits February 4, 2025 18:21

calculate memory usage based on file size

a992de8

calculate memory usage

46d1779

Merge branch 'refs/heads/main' into add_memory_usage

95e6939

# Conflicts: # mteb/models/bge_models.py # mteb/models/promptriever_models.py

add memory usage for MIEB models

00c164d

add last model usage

7c50c9a

add memory_usage_mb to overview

aaae8c4

fix rerank

8919302

Samoed requested a review from x-tabdeveloping February 5, 2025 08:46

Samoed and others added 3 commits February 5, 2025 12:20

Merge branch 'main' into add_memory_usage

609b7e9

Merge branch 'refs/heads/main' into add_memory_usage

c0fdad5

# Conflicts: # mteb/models/ru_sentence_models.py

update memory usage

2095a7e

Samoed changed the title ~~feat: add model memory usage~~ add model memory usage Feb 7, 2025

KennethEnevoldsen approved these changes Feb 7, 2025

View reviewed changes

Merge branch 'refs/heads/main' into add_memory_usage

b96d59e

# Conflicts: # mteb/models/gritlm_models.py

Samoed enabled auto-merge (squash) February 7, 2025 16:15

update memory usage

f3f87b1

Samoed merged commit e46539a into main Feb 7, 2025
9 checks passed

Samoed deleted the add_memory_usage branch February 7, 2025 16:26

Comments

Conversation

Samoed commented Feb 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Quality

Documentation

Testing

Uh oh!

isaac-chung commented Feb 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Samoed commented Feb 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

isaac-chung left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

isaac-chung left a comment

Choose a reason for hiding this comment

Uh oh!

x-tabdeveloping left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

x-tabdeveloping Feb 4, 2025

Choose a reason for hiding this comment

Uh oh!

Samoed Feb 4, 2025

Choose a reason for hiding this comment

Uh oh!

x-tabdeveloping Feb 4, 2025

Choose a reason for hiding this comment

Uh oh!

x-tabdeveloping Feb 4, 2025

Choose a reason for hiding this comment

Uh oh!

Samoed Feb 4, 2025

Choose a reason for hiding this comment

Uh oh!

x-tabdeveloping Feb 4, 2025

Choose a reason for hiding this comment

Uh oh!

Samoed Feb 4, 2025

Choose a reason for hiding this comment

Uh oh!

Samoed commented Feb 6, 2025

Uh oh!

KennethEnevoldsen left a comment

Choose a reason for hiding this comment

Uh oh!

Samoed commented Feb 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Samoed commented Feb 2, 2025 •

edited

Loading

isaac-chung commented Feb 3, 2025 •

edited

Loading

Samoed commented Feb 3, 2025 •

edited

Loading

isaac-chung left a comment •

edited

Loading

Samoed commented Feb 7, 2025 •

edited

Loading