Speech recognition and text-to-speech using OpenAI and gTTS

This script allows you to use speech recognition to input a prompt, send the prompt to OpenAI to generate a response, and then use gTTS to convert the response to an audio file and play the audio file.

Prerequisites

You need to have a valid OpenAI API key. You can sign up for a free API key at https://beta.openai.com/.
You need to install the following packages: openai, gTTS, pyaudio, SpeechRecognition, playsound. You can install these packages using pip install openai gTTS pyaudio SpeechRecognition playsound or use pipenv if you wish to contain a virtual environment.

Usage

Replace YOUR_API_KEY_HERE in the following line with your actual OpenAI API key: openai.api_key = "YOUR_API_KEY_HERE"
Run the script using python smart_speaker.py.
The script will prompt you to say something. Speak a sentence into your microphone. You may need to allow the program permission to access your microphone on a Mac, a prompt should appear when running the program.
The script will send the spoken sentence to OpenAI, generate a response using the text-to-speech model, and play the response as an audio file.

Customization

You can change the OpenAI model engine by modifying the value of model_engine. For example, to use the "davinci" engine, set model_engine = "davinci".
You can change the language of the generated audio file by modifying the value of language. For example, to generate audio in French, set language = 'fr'.
You can adjust the temperature parameter in the following line to control the randomness of the generated response:

response = openai.Completion.create(
engine=model_engine,
prompt=prompt,
max_tokens=1024,
n=1,
temperature=0.7,
)

Higher values of temperature will result in more diverse and random responses, while lower values will result in more deterministic responses.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
smart_speaker.py		smart_speaker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech recognition and text-to-speech using OpenAI and gTTS

Prerequisites

Usage

Customization

About

Releases

Packages

Languages

License

BlockOne987/ChatGPT-OpenAI-Smart-Speaker

Folders and files

Latest commit

History

Repository files navigation

Speech recognition and text-to-speech using OpenAI and gTTS

Prerequisites

Usage

Customization

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages