Speech-to-Text Service

Overview

Speech-to-Text service using gRPC and the Whisper model from Hugging Face's Transformers library. The service allows users to send audio data and receive transcribed text in response.

Features

gRPC-based communication for efficient data transfer.
Utilizes the Whisper model for high-quality speech recognition.
Includes a client for sending audio data and receiving transcriptions.
Unit tests to ensure the functionality of the service.

Requirements

Python 3.10 or higher

Setup Instructions

Clone the Repository

git clone https://github.com/your-username/speech-to-text-service.git
cd speech-to-text-service

Install Dependencies Make sure to install the required packages as mentioned above:
```
pip install -r requirements.txt
pip install -r requirements-dev.txt
```
Install Pre-commit To install pre-commit, run:
```
pip install pre-commit
```
Environment Variables Create a .env file in the root directory and set the following variables for local environment:
```
ENVIRONMENT=local
PORT=50051
```
For non-local environments (development or production), set the following variables:
```
ENVIRONMENT=development  # or production
SSL_PRIVATE_KEY_PATH=<path-to-private-key>
SSL_CERTIFICATE_CHAIN_PATH=<path-to-certificate-chain>
```
Run the gRPC Server Start the server by running:
```
python main.py
```
Run the Client In a separate terminal, run the client to send audio data:
```
python src/client/client.py
```
Run Pre-commit Hooks To ensure code quality, install and run pre-commit hooks:
```
pre-commit run --all-files
```
Code Formatting and Linting Use ruff for linting and black for code formatting:
```
ruff check .
black .
```

Testing

To run the unit tests, execute the following command:

python -m unittest discover -s src/tests

Audio Data

Place your test audio files in the data directory. The client will look for test_audio.wav by default.

Contributing

Feel free to submit issues or pull requests. Contributions are welcome!

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
protos		protos
src		src
.blackignore		.blackignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
client.py		client.py
main.py		main.py
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
stt_service_pb2.py		stt_service_pb2.py
stt_service_pb2_grpc.py		stt_service_pb2_grpc.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech-to-Text Service

Overview

Features

Requirements

Setup Instructions

Testing

Audio Data

Contributing

License

About

Releases

Packages

Languages

License

shivadharmi/stt-grpc-service

Folders and files

Latest commit

History

Repository files navigation

Speech-to-Text Service

Overview

Features

Requirements

Setup Instructions

Testing

Audio Data

Contributing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages