Skip to content

Tuning the v4 model hyper-params #268

Answered by adamnsandle
JJun-Guo asked this question in Q&A
Discussion options

You must be logged in to vote

v4 was trained not to respond to background voice

As i can see, v4 finds speech in you example:

you may need to tune min_speech_duration_ms parameter in get_speech_timestamps (default value is 250ms):

speech_timestamps = get_speech_timestamps(wav, model, sampling_rate=SAMPLING_RATE, min_speech_duration_ms=50)
print(speech_timestamps)  # [{'start': 3616, 'end': 4951}]

Replies: 5 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by snakers4
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
help wanted Extra attention is needed
3 participants
Converted from issue

This discussion was converted from issue #266 on November 09, 2022 09:27.