GitHub - aj3423/vosk-sound-test

A Golang demo for how to use:

VOSK for speech recognition.
miniaudio for capturing audio input from microphone.

What it does:

Capture audio input and send to a local voice recognition engine
Play back the captured sound
Record voice to a ".wav" file

Usage:

docker run -d -p 2700:2700 alphacep/kaldi-en:latest This runs official VOSK-server docker image.
./vosk-sound-test It starts capturing and displays the words you say, also saves audio to file out.wav
Press <Enter> to stop capturing and play back
Press <Enter> again to exit.

Troubleshooting

Low sound quality

Maybe you're using some bluetooth airbuds like "airpods". For system like Linux, the input sound frequency is limited to 8000 at bluetooth stack, 16000 is a minimal frequency for VOSK to work well. A dedicated wired/wireless microphone should work.

It doesn't work at all

Open the system sound manager, verify the recording device while this program is capturing. Sometimes a wrong device is choosed by default.

Build from source

Install Golang
git clone https://github.com/aj3423/vosk-sound-test
cd vosk-sound-test
go build .

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE.md		LICENSE.md
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Golang demo for how to use:

What it does:

Usage:

Troubleshooting

Build from source

License

About

Releases 1

Packages

Languages

License

aj3423/vosk-sound-test

Folders and files

Latest commit

History

Repository files navigation

A Golang demo for how to use:

What it does:

Usage:

Troubleshooting

Build from source

License

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages