A Python GUI application for speech-to-text transcription using OpenAI's Whisper model.
- Record audio from microphone
- Transcribe speech to text using Whisper models
- Automatic clipboard copying of transcribed text
- Audio device selection
- Configurable global hotkeys for system-wide recording control
- Local and global keyboard shortcuts
- Automatic temporary file cleanup
- Modern state-driven UI with progress indicators
- Python 3.9+
- Audio input device (microphone)
- CUDA 12.8 Toolkit
- ffmpeg installation
-
Install UV (if not already installed):
pip install uv
-
Clone or download the project:
git clone <repository-url> cd SimpleWhisper
-
Install dependencies:
cd src uv sync
-
Run the application:
cd src uv run python main.py
Ctrl+R
- Start recordingCtrl+S
- Stop recordingCtrl+T
- Open settings
Ctrl+Shift+R
- Toggle recording (default)Ctrl+Shift+S
- Stop recording (default)
Configure global hotkeys in Settings > Global Hotkeys tab.
- Audio Device: Select microphone input device
- Global Hotkeys: Configure system-wide keyboard shortcuts
- Whisper Model: Choose transcription model for accuracy vs speed
Settings are automatically saved to settings.json
.