We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The text was updated successfully, but these errors were encountered:
你好,@youzeliang , 程序报错的原因是有些音频无法转换,这可能是speech_recognition模组的问题,至少我没有发现该代码仓中在此方面有其他问题。合并后的txt文件没有东西是因为:在有些音频无法转换为txt的情况下,这些txt文件就不存在了(没有创建),所以无法将其合并。 解决方法:将file_utils.py文件中的combine_text函数的:
if os.path.exists(text_file): with open(text_file) as f: k.write(f.read() + "\r\n") else: break
中的break换成continue。 这样程序就会跳过不存在的txt文件,只合并存在的。 另外你还可以更改convert_by_google函数,为转化失败的音频文件创建空txt文件,亦可。
又: 我最近在做视频台词提取,暂时没有找到好的语音转文字api,想问问大家是否有好的推荐,谢谢。 针对目前代码仓中的处理来说,恰如作者所言,切分30秒来提取台词语义的效果并不好,想请教大家是否有做图片字幕识别的项目(如使用ocr, opencv)可供学习推荐,感谢。
Sorry, something went wrong.
No branches or pull requests
The text was updated successfully, but these errors were encountered: