Tuning the v4 model hyper-params #268

JJun-Guo · 2022-11-08T09:37:55Z

JJun-Guo
Nov 8, 2022

Is the v4-model more muted? The same speech segment (including a few vocals) has output through the v3- model, but there is no output after the v4-model, speech_timestamps is an empty list.

Answered by adamnsandle

Nov 9, 2022

v4 was trained not to respond to background voice

As i can see, v4 finds speech in you example:

you may need to tune min_speech_duration_ms parameter in get_speech_timestamps (default value is 250ms):

speech_timestamps = get_speech_timestamps(wav, model, sampling_rate=SAMPLING_RATE, min_speech_duration_ms=50)
print(speech_timestamps)  # [{'start': 3616, 'end': 4951}]

View full answer

snakers4 · 2022-11-08T09:39:05Z

snakers4
Nov 8, 2022
Maintainer

Please provide your audio files and probability graphs (return_prob=True).

0 replies

JJun-Guo · 2022-11-08T11:09:21Z

JJun-Guo
Nov 8, 2022
Author

Please provide your audio files and probability graphs (return_prob=True).

The audio path is https://github.com/JJ-Guo1996/AMR-code/blob/main/audio.wav

0 replies

JJun-Guo · 2022-11-09T08:59:47Z

JJun-Guo
Nov 9, 2022
Author

How about your test result?

0 replies

adamnsandle · 2022-11-09T09:03:13Z

adamnsandle
Nov 9, 2022
Collaborator

v4 was trained not to respond to background voice

As i can see, v4 finds speech in you example:

you may need to tune min_speech_duration_ms parameter in get_speech_timestamps (default value is 250ms):

speech_timestamps = get_speech_timestamps(wav, model, sampling_rate=SAMPLING_RATE, min_speech_duration_ms=50)
print(speech_timestamps)  # [{'start': 3616, 'end': 4951}]

0 replies

JJun-Guo · 2022-11-10T04:16:32Z

JJun-Guo
Nov 10, 2022
Author

Why with the same parameter settings（v3 with min_speech_duration_ms=250）, v3 can get output but v4 has no output? Did v4 do any optimizations?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tuning the v4 model hyper-params #268

{{title}}

Replies: 5 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Tuning the v4 model hyper-params #268

JJun-Guo Nov 8, 2022

Replies: 5 comments

snakers4 Nov 8, 2022 Maintainer

JJun-Guo Nov 8, 2022 Author

JJun-Guo Nov 9, 2022 Author

adamnsandle Nov 9, 2022 Collaborator

JJun-Guo Nov 10, 2022 Author

JJun-Guo
Nov 8, 2022

snakers4
Nov 8, 2022
Maintainer

JJun-Guo
Nov 8, 2022
Author

JJun-Guo
Nov 9, 2022
Author

adamnsandle
Nov 9, 2022
Collaborator

JJun-Guo
Nov 10, 2022
Author