ONNX model detect soft hum as speech #164
Answered
by
snakers4
wciurzynski
asked this question in
Q&A
-
🐛 BugUsing function get_speech_timestamps (from silero-vad) for attached .wav I got result [{'start': 4128, 'end': 29664}] so model detect it as speech. |
Beta Was this translation helpful? Give feedback.
Answered by
snakers4
Jan 26, 2022
Replies: 1 comment 4 replies
-
If you renormalize this audio (the model does this internally) - you get this: I can hear some microphone / wind (?) artefacts, this is probably why network gets triggered. But during the white noise it gets un-triggered. The probability chart looks like this:
For noisier data it is generally a good idea to set |
Beta Was this translation helpful? Give feedback.
4 replies
Answer selected by
snakers4
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
If you renormalize this audio (the model does this internally) - you get this:
I can hear some microphone / wind (?) artefacts, this is probably why network gets triggered. But during the white noise it gets un-triggered.
The probability chart looks like this: