Additional Trainer Argument for features of different modalities #3225

mengerj · 2025-02-11T08:36:47Z

Hi,

I have been working on multi-modal embedding models based on the sentence-transformers framework and found that I need to hard code keys into the "collect_features" method within the trainer. I simply added an argument to allow passing a list of additional keys that might be present in your features (similar to "pixel_values" for CLIP). I would appreciate a merge so I don't need to rely on my fork for the multi-modal embedding model.

Best,
Jonatan

…y in features

…mers

…he previous float32

…feauture keys (e.g for other modalities) to the collect features function.

tomaarsen · 2025-03-21T11:06:51Z

Hello!

I think this is indeed quite valuable to support, so I'd like to help you out here. However, I think we might be better off going in a different direction that should work out of the box without the user having to ever specify anything: #3276

Feel free to give me some feedback there!

Also, apologies for the delays. I've been focusing on the huge #3222, which is now nearing completion.

Tom Aarsen

menger and others added 13 commits January 22, 2025 14:02

modify trainer collect_features method to handle "omics_embedding" ke…

ad3c50e

…y in features

Merge branch 'UKPLab:master' into master

fa9bf35

renamed from omics_embedding to omics_representation for clarity

73ae44f

Merge branch 'master' of https://github.com/mengerj/sentence-transfor…

18300cd

…mers

actually changing the trainer collect_features method correctly

5f7adaa

cast acc threshold and f1 threshold to float64 due to error parsing t…

c10212d

…he previous float32

Merge branch 'UKPLab:master' into master

a73d27a

Include a parameter into the SentenceTransfomerTrainer to pass extra …

c3c9f9d

…feauture keys (e.g for other modalities) to the collect features function.

Merge branch 'UKPLab:master' into master

c56eff6

correctly handle suffixes with "_" before the input key

4b36100

Merge branch 'UKPLab:master' into master

93cafb6

fixed formatting issue

49e59ca

Merge branch 'UKPLab:master' into master

c2b4cf9

tomaarsen mentioned this pull request Mar 21, 2025

Update collect_features to allow different modalities more easily in the Trainer #3276

Draft

Merge branch 'UKPLab:master' into master

cae9549

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Additional Trainer Argument for features of different modalities #3225

Additional Trainer Argument for features of different modalities #3225

Uh oh!

mengerj commented Feb 11, 2025

Uh oh!

tomaarsen commented Mar 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Additional Trainer Argument for features of different modalities #3225

Are you sure you want to change the base?

Additional Trainer Argument for features of different modalities #3225

Uh oh!

Conversation

mengerj commented Feb 11, 2025

Uh oh!

tomaarsen commented Mar 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants