Skip to content

MAEB Model MuQ-MuLan#2909

Merged
isaac-chung merged 6 commits intoembeddings-benchmark:maebfrom
AdnanElAssadi56:maeb-model-muq_mulan
Jul 21, 2025
Merged

MAEB Model MuQ-MuLan#2909
isaac-chung merged 6 commits intoembeddings-benchmark:maebfrom
AdnanElAssadi56:maeb-model-muq_mulan

Conversation

@AdnanElAssadi56
Copy link
Contributor

Added Audio Retrieval Model: OpenMuQ/MuQ-MuLan-large

Results:
JamAltArtistA2ARetrieval.json
JamAltLyricA2TRetrieval.json
JamAltLyricT2ARetrieval.json

…o input types. Updated tensor conversion logic for numpy arrays and lists, ensuring compatibility with existing torch tensor formats. Improved resampling handling for audio inputs.
requires_package(self, "muq", "pip install 'mteb[muq]'")
from muq import MuQMuLan

self.model_name = model_name
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this used anywhere else?

Comment on lines +47 to +50
def _handle_batch(
self, batch: AudioData | Iterable[tuple[AudioData, str]]
) -> list[torch.Tensor]:
"""Handle a single batch of audio data."""
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seeing a lot of repeated code with the other PRs. Maybe we can extract and reuse this in mteb/models/audio_utils.py or something.


return text_embeds.cpu().numpy()

def encode(
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

encode is really just for text embeddings. It has been confused and we should also fix the rest of the audio models separately.

loader=partial(MuQMuLanWrapper, model_name="OpenMuQ/MuQ-MuLan-large"),
name="OpenMuQ/MuQ-MuLan-large",
languages=["eng-Latn", "zho-Hans"], # English and Chinese support
revision="8a081dbcf84edd47ea7db3c4ecb8fd1ec1ddacfe8a081dbcf84edd47ea7db3c4ecb8fd1ec1ddacfe",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this a GH commit hash?

@isaac-chung isaac-chung merged commit 48febd1 into embeddings-benchmark:maeb Jul 21, 2025
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants