Is it normal that saying "hello" isn't detected well? #585

qwbarch · 2024-12-09T03:11:31Z

qwbarch
Dec 9, 2024

I noticed that when I say "hello" the probability is always very low. If I say "hello, test test", the part where i say "test test" is completely fine, whereas the "hello" has a low probability as usual.

This is on both the v4 and v5 model. Is this expected behavior?

Answered by snakers4

Dec 10, 2024

Loading different versions like this:

  model, utils = torch.hub.load(repo_or_dir='snakers4/silero-vad:v3.1',
                                model='silero_vad',
                                force_reload=True,
                                onnx=USE_ONNX)

  (get_speech_timestamps,
  save_audio,
  read_audio,
  VADIterator,
  collect_chunks) = utils

v3.1

v4.0

v5.1

So, in your particular case, I would suggest these params:

speech_timestamps = get_speech_timestamps(wav, model, sampling_rate=SAMPLING_RATE, visualize_probs=True, threshold=0.15)

View full answer

snakers4 · 2024-12-09T05:07:44Z

snakers4
Dec 9, 2024
Maintainer

Can you send an audio example?

2 replies

qwbarch Dec 10, 2024
Author

Whoops, sorry for not providing an example. Here is an audio clip:
https://drive.google.com/file/d/1MvX4tMAhDjtzq3mI32JJgt2r5-VzPiaP/view?usp=sharing

On the v4 model, this will still have a low probability (around 0.1f-0.3f if I remember right), but on the v5 model it seems to think there's no speech detected at all

snakers4 Dec 10, 2024
Maintainer

Loading different versions like this:

  model, utils = torch.hub.load(repo_or_dir='snakers4/silero-vad:v3.1',
                                model='silero_vad',
                                force_reload=True,
                                onnx=USE_ONNX)

  (get_speech_timestamps,
  save_audio,
  read_audio,
  VADIterator,
  collect_chunks) = utils

v3.1

v4.0

v5.1

So, in your particular case, I would suggest these params:

speech_timestamps = get_speech_timestamps(wav, model, sampling_rate=SAMPLING_RATE, visualize_probs=True, threshold=0.15)

Answer selected by snakers4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it normal that saying "hello" isn't detected well? #585

{{title}}

Replies: 1 comment 2 replies

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Is it normal that saying "hello" isn't detected well? #585

qwbarch Dec 9, 2024

Replies: 1 comment · 2 replies

snakers4 Dec 9, 2024 Maintainer

qwbarch Dec 10, 2024 Author

snakers4 Dec 10, 2024 Maintainer

qwbarch
Dec 9, 2024

Replies: 1 comment 2 replies

snakers4
Dec 9, 2024
Maintainer

qwbarch Dec 10, 2024
Author

snakers4 Dec 10, 2024
Maintainer