[Refactor] Remove dependency on librosa#2273
Conversation
lishunyang12
left a comment
There was a problem hiding this comment.
looks good overall, couple small things
1180487 to
4ffbb8f
Compare
|
Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits. |
|
main branch is now targeting vllm 0.19.0, this PR can be merged. |
|
Ok it seems the remainder of the librosa functions also have counterparts in torchaudio, I'm expanding the scope of this PR to completely drop the dependency on librosa. |
4ffbb8f to
008c0fc
Compare
|
Done, the dependency on librosa is now fully dropped. See also vllm-project/vllm#39079 |
2d7454e to
c1a598e
Compare
tzhouam
left a comment
There was a problem hiding this comment.
Please also update the Dockerfiles, thanks.
df97f2b to
8204fe1
Compare
Done, manually inspected the wheels (for x86_64 linux), they do contain the native libraries. |
|
@gcanlin Please have a look. The CI failure seems to be caused by the main branch instead of this PR. |
|
It seems the failure is related to this PR? |
@gcanlin |
…_audio Signed-off-by: Nick Cao <ncao@redhat.com>
…le_audio_resampy Signed-off-by: Nick Cao <ncao@redhat.com>
…scale_fbanks Signed-off-by: Nick Cao <ncao@redhat.com>
Co-authored-by: Claude <noreply@anthropic.com> Signed-off-by: Nick Cao <ncao@redhat.com>
Uses of librosa has been replaced with wrapper functions from vllm (using soundfile and pyav internally). Both soundfile and pyav wheels have the native libraries bundled (libsndfile and ffmpeg), thus no additional installation is required on the host, making these docs outdated and misleading. Signed-off-by: Nick Cao <ncao@redhat.com>
Signed-off-by: Nick Cao <ncao@redhat.com>
These libraries are already bundled in the pyav and soundfile wheels Signed-off-by: Nick Cao <ncao@redhat.com>
Signed-off-by: Nick Cao <ncao@redhat.com>
f2b6b6b to
2560d54
Compare
| apt-get install -y --no-install-recommends --fix-missing \ | ||
| curl \ | ||
| espeak-ng \ | ||
| ffmpeg \ |
There was a problem hiding this comment.
@xuechendi Could you check whether XPU have the same issue #2708?
Signed-off-by: Nick Cao <ncao@redhat.com> Co-authored-by: Claude <noreply@anthropic.com>

Purpose
Replace librosa.load() with vllm.multimodal.media.audio.load_audio() and librosa.resample() with vllm.multimodal.audio.resample_audio_resampy(), reusing the functions introduced in vllm#37058.
See also: #1725
NOTE: there are still a few other references to librosa to be removed in followup PRs.
NOTE: Do not merge until the release of vllm 0.18.1 where these helper functions are introduced.
Test Plan
Test Result
PASS
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model. Please runmkdocs serveto sync the documentation editions to./docs.BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)