eddy

In fluid dynamics, an eddy is the swirling of a fluid and the reverse current created when the fluid is in a turbulent flow regime.

C++ inference library for multi-vendor edge NPUs. Current focus: OpenVINO 2025.x backend for Parakeet-TDT and Whisper models. Additional runtimes (Qualcomm QNN, AMD Ryzen AI Software) coming soon.

For Apple platforms (macOS/iOS), use FluidAudio

Model Cards:

Building

Prerequisites

1. Install OpenVINO 2025.x

Windows: Download and install from OpenVINO Toolkit Downloads

Default location: C:\Program Files (x86)\Intel\openvino_2025.0.0\

Add OpenVINO to PATH (required for runtime):

# Add to your system PATH or run before using executables:
set PATH=%PATH%;C:\Program Files (x86)\Intel\openvino_2025.0.0\runtime\bin\intel64\Release

Linux:

# Download and install from intel.com/openvino or use APT
wget https://storage.openvinotoolkit.org/repositories/openvino/packages/2025.0/linux/l_openvino_toolkit_ubuntu22_2025.0.0.tar.gz
tar -xvzf l_openvino_toolkit_ubuntu22_2025.0.0.tar.gz
cd l_openvino_toolkit_ubuntu22_2025.0.0
sudo ./install_openvino_dependencies.sh
source setupvars.sh

2. Install Build Tools

Windows:

CMake 3.16+
Visual Studio 2019/2022 with C++ Desktop Development workload
Git (for vcpkg)

Linux:

sudo apt install cmake build-essential git

3. Install vcpkg

Windows:

git clone https://github.com/microsoft/vcpkg.git C:\vcpkg
cd C:\vcpkg
.\bootstrap-vcpkg.bat

Linux/macOS:

git clone https://github.com/microsoft/vcpkg.git ~/vcpkg
cd ~/vcpkg
./bootstrap-vcpkg.sh

Build with vcpkg

# Configure with vcpkg toolchain
cmake -S . -B build -DCMAKE_TOOLCHAIN_FILE=C:\vcpkg\scripts\buildsystems\vcpkg.cmake

# Or specify OpenVINO manually if not using vcpkg
cmake -S . -B build -DOpenVINO_DIR=/opt/intel/openvino/runtime/cmake

# Build (Release mode recommended)
cmake --build build --config Release

The build produces:

Static library: eddy (linkable C++ library)
CLI tools: parakeet_cli.exe, hf_fetch_models.exe (examples)
Benchmarks: benchmark_librispeech.exe, benchmark_fleurs.exe

Quick Start

Download models (first time only):

# Windows (with OpenVINO in PATH)
build\examples\cpp\Release\hf_fetch_models.exe --model parakeet-v2

# Linux
build/examples/cpp/hf_fetch_models --model parakeet-v2

Test transcription:

# Create test audio or use your own 16kHz WAV file
build\examples\cpp\Release\parakeet_cli.exe audio.wav --model parakeet-v2 --device CPU

Models auto-download on first inference if not manually fetched. Cached in:

Windows: %LOCALAPPDATA%\eddy\models\parakeet-v2\files
Linux: ~/.cache/eddy/models/parakeet-v2/files

Optional: Whisper Support

Whisper requires OpenVINO GenAI (not included by default):

cmake -S . -B build -DEDDY_ENABLE_WHISPER=ON -DOpenVINOGenAI_DIR="<path-to-genai-cmake>"

Usage

Basic Transcription

Parakeet V2 (English only, 600MB):

# NPU (Intel Core Ultra)
build/examples/cpp/Release/parakeet_cli.exe audio.wav --model parakeet-v2 --device NPU

# CPU (any x86_64)
build/examples/cpp/Release/parakeet_cli.exe audio.wav --model parakeet-v2 --device CPU

Parakeet V3 (Multilingual - 24 languages, 1.1GB):

# English
build/examples/cpp/Release/parakeet_cli.exe audio.wav --model parakeet-v3 --device CPU

# Note: V3 currently only tested with English. Multilingual support coming soon.

Whisper (if built with EDDY_ENABLE_WHISPER=ON):

build/examples/cpp/Release/whisper_example.exe path/to/whisper-model audio.wav NPU

Device Selection

Device	Best For	Performance
NPU	Intel Core Ultra (Meteor Lake+)	38-41× RTFx
CPU	Any x86_64 processor	8× RTFx
AUTO	Let OpenVINO choose	Varies

Audio Requirements:

Format: WAV (mono or stereo)
Sample Rate: 16kHz
Bit Depth: 16-bit PCM

See C++ API documentation for library integration.

Models & Performance

Benchmarked on Intel Core Ultra 7 155H (Meteor Lake) with Intel AI Boost NPU. RTFx values are averaged across LibriSpeech test-clean dataset.

Model	Languages	NPU Speed (avg)	CPU Speed (avg)	Size
Parakeet V2	English	38× RTFx	8× RTFx	600MB
Parakeet V3	24 languages	41× RTFx	8× RTFx	1.1GB
Whisper large-v3-turbo	99 languages	16× RTFx	0.44× RTFx	1.6GB

RTFx = Real-Time Factor (higher is faster). 38× means processing is 38× faster than real-time playback - 10 minutes of audio transcribed in ~16 seconds.

Performance Comparison: eddy (OpenVINO) vs PyTorch

Benchmarked on Intel Core Ultra 7 155H (Meteor Lake):

Model	eddy NPU	eddy CPU	PyTorch GPU (Arc 140V)	PyTorch CPU	eddy NPU Speedup (vs PyTorch GPU)
Parakeet V2	38× RTFx	8× RTFx	8.4× RTFx¹	2× RTFx²	4.5× faster
Parakeet V3	41× RTFx	8× RTFx	8.4× RTFx¹	2.5× RTFx²	4.9× faster
Whisper large-v3-turbo	16× RTFx	0.44× RTFx	5.5× RTFx	0.90× RTFx	2.9× faster

¹ Benchmarked using NeMo parakeet-tdt_ctc-110m as proxy (similar architecture) ² Estimated based on NeMo reference implementations

eddy's NPU implementation provides 3-5× acceleration over PyTorch GPU and 8-20× over PyTorch CPU on Intel Core Ultra 7 155H.

Parakeet V3 Languages: English, Spanish, Italian, French, German, Dutch, Russian, Polish, Ukrainian, Slovak, Bulgarian, Finnish, Romanian, Croatian, Czech, Swedish, Estonian, Hungarian, Lithuanian, Danish, Maltese, Slovenian, Latvian, Greek

Benchmarks: See BENCHMARK_RESULTS.md for detailed results.

Roadmap

Voice Activity Detection (VAD)
C# bindings for .NET applications
Qualcomm QNN backend (Snapdragon NPU)
AMD Ryzen AI Software backend
Additional audio model support

Support & Resources

Troubleshooting

NPU Not Detected

Windows: Check for Intel Core Ultra (Meteor Lake or newer):

build/examples/cpp/Release/parakeet_cli.exe --list-devices

Linux: NPU support requires the Intel NPU driver.

Note: Linux NPU support has not been tested yet.

Requirements:

Ubuntu 22.04+ with kernel 6.6+
Intel Core Ultra (Meteor Lake) or newer processor

For installation instructions, see the official Intel NPU driver documentation:

Installation Guide: github.com/intel/linux-npu-driver
Latest Releases: github.com/intel/linux-npu-driver/releases

Slow Performance

Ensure OpenVINO 2025.x is installed
Try --device NPU for NPU acceleration (optimized for Intel Core Ultra)
See the Performance section above for expected speed on each device

Model Configuration Issues

Ensure you're using the correct model configuration:

V2: blank_token_id = 1024
V3: blank_token_id = 8192

# Verify model version
build/examples/cpp/Release/parakeet_cli.exe --version

Citation

@misc{eddy-2025,
  title={eddy: High-Performance ASR with OpenVINO and Parakeet TDT},
  author={FluidInference Team},
  year={2025},
  url={https://github.com/FluidInference/eddy}
}

@inproceedings{nvidia-parakeet-tdt,
  title={Parakeet-TDT: Token Duration Transducer for ASR},
  author={NVIDIA NeMo Team},
  year={2024},
  url={https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2}
}

License

Apache 2.0 - See LICENSE for details.

Third-party model licenses may vary. See ThirdPartyLicenses/ for details on Parakeet TDT models (CC-BY-4.0) and other dependencies.

Acknowledgments

NVIDIA NeMo Team: Parakeet TDT architecture and base models
Intel OpenVINO: Cross-platform inference runtime and NPU support
Benchmark Datasets: LibriSpeech (OpenSLR), FLEURS (Google Research)

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.github/workflows		.github/workflows
ThirdPartyLicenses		ThirdPartyLicenses
assets/audio		assets/audio
benchmarks		benchmarks
bindings/csharp		bindings/csharp
cmake		cmake
docs		docs
documentation		documentation
examples		examples
include/eddy		include/eddy
src		src
third_party		third_party
.gitignore		.gitignore
AGENTS.md		AGENTS.md
BENCHMARK.md		BENCHMARK.md
BENCHMARK_RESULTS.md		BENCHMARK_RESULTS.md
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
banner.jpg		banner.jpg
benchmark_fleurs.py		benchmark_fleurs.py
claude.md		claude.md
vcpkg.json		vcpkg.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

eddy

Building

Prerequisites

1. Install OpenVINO 2025.x

2. Install Build Tools

3. Install vcpkg

Build with vcpkg

Quick Start

Optional: Whisper Support

Usage

Basic Transcription

Device Selection

Models & Performance

Performance Comparison: eddy (OpenVINO) vs PyTorch

Roadmap

Support & Resources

Troubleshooting

NPU Not Detected

Slow Performance

Model Configuration Issues

Citation

License

Acknowledgments

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

License

FluidInference/eddy-audio

Folders and files

Latest commit

History

Repository files navigation

eddy

Building

Prerequisites

1. Install OpenVINO 2025.x

2. Install Build Tools

3. Install vcpkg

Build with vcpkg

Quick Start

Optional: Whisper Support

Usage

Basic Transcription

Device Selection

Models & Performance

Performance Comparison: eddy (OpenVINO) vs PyTorch

Roadmap

Support & Resources

Troubleshooting

NPU Not Detected

Slow Performance

Model Configuration Issues

Citation

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages