Skip to content

Feature request - API to return the raw speech probabilities #274

Answered by adamnsandle
ryanheise asked this question in Q&A
Discussion options

You must be logged in to vote

You can use .audio_forward method for that purpose. Both models got this method in the latest v4 release.

wav = read_audio('files/en.wav')
raw_probs = model.audio_forward(wav, sr=16000, num_samples=512)
# tensor([[0.0948, 0.1472, 0.1674,  ..., 0.9034, 0.9971, 0.9988]])

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by snakers4
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
enhancement New feature or request
3 participants
Converted from issue

This discussion was converted from issue #273 on November 21, 2022 11:01.