Skip to content

Conversation

@mengerj
Copy link

@mengerj mengerj commented Feb 11, 2025

Hi,

I have been working on multi-modal embedding models based on the sentence-transformers framework and found that I need to hard code keys into the "collect_features" method within the trainer. I simply added an argument to allow passing a list of additional keys that might be present in your features (similar to "pixel_values" for CLIP). I would appreciate a merge so I don't need to rely on my fork for the multi-modal embedding model.

Best,
Jonatan

@tomaarsen
Copy link
Member

Hello!

I think this is indeed quite valuable to support, so I'd like to help you out here. However, I think we might be better off going in a different direction that should work out of the box without the user having to ever specify anything: #3276

Feel free to give me some feedback there!

Also, apologies for the delays. I've been focusing on the huge #3222, which is now nearing completion.

  • Tom Aarsen

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants