Skip to content

Is it normal that saying "hello" isn't detected well? #585

Answered by snakers4
qwbarch asked this question in Q&A
Discussion options

You must be logged in to vote

Loading different versions like this:

  model, utils = torch.hub.load(repo_or_dir='snakers4/silero-vad:v3.1',
                                model='silero_vad',
                                force_reload=True,
                                onnx=USE_ONNX)

  (get_speech_timestamps,
  save_audio,
  read_audio,
  VADIterator,
  collect_chunks) = utils
v3.1

v4.0

v5.1

So, in your particular case, I would suggest these params:

speech_timestamps = get_speech_timestamps(wav, model, sampling_rate=SAMPLING_RATE, visualize_probs=True, threshold=0.15)

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@qwbarch
Comment options

@snakers4
Comment options

Answer selected by snakers4
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants