Skip to content

Feature request - [Print "Speech" while talking] #548

Answered by snakers4
RoboEvangelist asked this question in Q&A
Discussion options

You must be logged in to vote

The VAD outputs probability each ~30ms. So you are free to do any post-processing / aggregation you would like.

A good start is to modify the logic behind the get speech timestamps function.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by snakers4
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
enhancement New feature or request
2 participants
Converted from issue

This discussion was converted from issue #547 on October 06, 2024 07:48.