Whisper OpenVINO

This repo is a fork of whisper ASR models with openvino backend. Currently, the transcribe functionality of all models but large is supported.

To install, please run the following command with the environment described in the origin repo: https://github.com/openai/whisper.git

pip install git+https://github.com/zhuzilin/whisper-openvino.git

And you can use this modified version of whisper the same as the origin version. For example, to test the performace gain, I transcrible the John Carmack's amazing 92 min talk about rendering at QuakeCon 2013 (you could check the record on youtube) with macbook pro 2019 (Intel(R) Core(TM) i7-9750H CPU @ 2.60GHz) with:

whisper carmack.mp3 --model tiny.en --beam_size 3

And the end-to-end time is shown below:

audio length	origin whisper	whisper openvino
92 min	67.57 min	39.16 min

You can check the transcribed txt in carmack.mp3.txt.

All weights and models include the intermediate ONNX are uploaded to huggingface model hub.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
data		data
notebooks		notebooks
tests		tests
whisper		whisper
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
approach.png		approach.png
carmack.mp3.txt		carmack.mp3.txt
language-breakdown.svg		language-breakdown.svg
model-card.md		model-card.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Whisper OpenVINO

About

Releases

Packages

Languages

License

zhuzilin/whisper-openvino

Folders and files

Latest commit

History

Repository files navigation

Whisper OpenVINO

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages