model : add ASR support for LFM2-Audio-1.5B #17694

tdakhran · 2025-12-02T14:28:51Z

LFM2-Audio-1.5B supports audio input and audio output.

PR adds only ASR support. To perform ASR invoke CLI with

bin/llama-mtmd-cli -m LFM2-Audio-1.5B-F32.gguf --mmproj mmproj-LFM2-Audio-1.5b-F32.gguf -n 30 --audio input.wav -sys "Perform ASR." -p "<__media__>"

Changes to existing code:

model requires system prompt, -sys enabled for llama-mtmd-cli
mel bins generation reworked, now it is generated dynamically and supports different n_fft values
OP_SSM_CONV for CUDA backend is extended to support kernel size 9

cc: @ngxson

tdakhran · 2025-12-02T14:42:14Z

tested that llama-server works as intended with input

[
        {"role": "system", "content": "Perform ASR."},
        {
            "role": "user",
            "content": [
                {
                    "type": "input_audio",
                    "input_audio": {
                        "format": "wav",
                        "data": base64.b64encode(pathlib.Path("/data/playground/issue_400/10.wav").read_bytes()).decode(
                            "utf-8"
                        ),
                    },
                },
            ],
        },
    ]

tdakhran added 4 commits December 2, 2025 15:18

convert backbone to gguf

84da624

convert mmproj to gguf

9f1d9e4

ASR works

b224f39

Refactor and enable cuda convs

6d54ddc

tdakhran changed the title ~~model : add LFM2-Audio-1.5B support~~ model : add ASR support for LFM2-Audio-1.5B Dec 2, 2025

github-actions bot added testing Everything test related Nvidia GPU Issues specific to Nvidia GPUs examples python python script changes ggml changes relating to the ggml tensor library for machine learning labels Dec 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

model : add ASR support for LFM2-Audio-1.5B #17694

model : add ASR support for LFM2-Audio-1.5B #17694

tdakhran commented Dec 2, 2025

Uh oh!

tdakhran commented Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

model : add ASR support for LFM2-Audio-1.5B #17694

Are you sure you want to change the base?

model : add ASR support for LFM2-Audio-1.5B #17694

Conversation

tdakhran commented Dec 2, 2025

Uh oh!

tdakhran commented Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant