-
Hello! I'm sorry this will probably be a really dumb question, but I'm afraid I don't know where else to investigate the answer: What would the optimal sample rate be for input to whisper? Seems too high will slow it down with too much data, and too low may cause lower quality. I'm not expert so I'm sure it will seem like I have no idea what I'm talking about! Anyway, I'm sure you're all super busy, so no worries if you can't reply--just thank you for reading this far! :) Have a good one! 😍 |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 7 replies
-
Hi! Regardless of the sampling rate used in the original audio file, the audio signal gets resampled to 16kHz (via |
Beta Was this translation helpful? Give feedback.
-
Also, in case it's useful to anyone, the module's audio.py handles the conversion. Here's a chopped up snippet so you can see the format, sample rate, etc.:
|
Beta Was this translation helpful? Give feedback.
Hi!
Regardless of the sampling rate used in the original audio file, the audio signal gets resampled to 16kHz (via
ffmpeg
). So it should work with the recordings you have (likely 44.1 or 48 kHz). If you're creating new recordings and have an option to record in 16 kHz, it may become marginally faster since it can skip resampling and use less space than using a higher sample rate. Although, you'd probably not want to do this for the sake of keeping the recording in a higher audio quality.