-
Notifications
You must be signed in to change notification settings - Fork 378
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] 语音输入和输出支持 #208
Comments
Title: [Feature]
**What features do you want or have any suggestions? ** **Are there any similar competing products that we can refer to? ** |
点击语音报错:json f "stack": "Error: Failed to execut |
Click the voice error report: json f "stack": "Error: Failed to execute |
确定一下是否可以使用 openai 的 tts 模型 |
Determine whether you can use openai’s tts model |
手动调用api发现无法正常使用tts模型,感谢大佬回复,谢谢。 |
I manually called the API and found that the tts model could not be used normally. Thank you for your reply. Thank you. |
@Hk-Gosuto 大佬,语音输入的最终的效果演示图是什么样的?需要key支持什么模型?readme上写的需要https访问是指网址需要https域名吗,这个的原因是? |
@Hk-Gosuto Sir, what is the final effect of voice input? What model does the key need to support? The readme that requires https access means that the website requires an https domain name. What is the reason for this? |
设置里开启后,发送按钮会变成语音输入,点击后开始说话,说完再点停止就行。 |
这个为啥不是通过调用openai的api实现的? |
Why is this not achieved by calling OpenAI’s API? |
这个不收费,识别效果也挺好的,为啥要用 wishper? |
There is no charge for this, and the recognition effect is pretty good. Why use wishper? |
噢噢好的 |
Oh ok |
可以多试一些场景,如果复杂场景效果不好的话,后面会考虑增加 wishper 适配。 |
You can try more scenes. If the effect of complex scenes is not good, we will consider adding wishper adaptation later. |
当使用openai tts时,每一次让它说,都会重新申请一次tts请求,能不能第一次就把语音下载到本地,过后重新听就不浪费请求了 |
When using openai tts, every time it is asked to speak, it will re-apply for a tts request. Can the voice be downloaded to the local for the first time, so that it can be listened to again later without wasting the request? |
我看看能不能把音频丢 indexedDB 里,可以先切换到 edge tts 那个不产生费用。 |
I'll see if I can throw the audio into indexedDB. I can switch to edge tts first which doesn't incur any charges. |
你想要什么功能或者有什么建议?
支持语音转文字和文字转语音的功能。
有没有可以参考的同类竞品?

类似下面这个人的项目里的功能:https://github.com/vual/ChatGPT-Next-Web-Pro
The text was updated successfully, but these errors were encountered: