English | 中文
Video Subtitle Master is a powerful desktop application for batch generating subtitles for videos and translating them into other languages. This project is an enhanced version of the open-source VideoSubtitleGenerator, redesigned as a user-friendly client tool.
Note
The current release has been tested on macOS. Windows testing was done in a virtual environment. If you encounter any issues, please feel free to open an Issue for feedback.
This application retains all the features of the original VideoSubtitleGenerator command-line tool, with the following enhancements:
- Graphical user interface for easier operation
- Source and target language subtitle files are saved in the same directory as the video for convenient subtitle attachment during playback
- Batch processing of video/audio/subtitle files
- Support for generating subtitles from video or audio files
- Ability to translate generated or imported subtitles
- Multiple translation services supported:
- Volcano Engine Translation
- Baidu Translation
- DeepLX Translation (Note: Batch translation may be rate-limited)
- Local Ollama model translation
- Support for OpenAI-style API translations (e.g., DeepSpeed)
- Customizable subtitle file naming for compatibility with various media players
- Flexible translated subtitle content: choose between pure translation or original + translated subtitles
- Integrated
whisper.cpp
with optimization for Apple Silicon, offering faster generation speeds - Built-in
fluent-ffmpeg
, eliminating the need for separateffmpeg
installation - Support for running locally installed
whisper
command - Option to choose model download source (domestic mirror or official source)
- Customizable number of concurrent tasks
Starting from version 1.20.0, Core ML is supported on Apple Silicon, providing faster speech recognition. For users who have previously installed older versions, please uninstall the old version first and then reinstall the new version. Additionally, select "Reinstall Whisper" in the settings interface to enable Core ML acceleration.
This project supports various translation services, including Baidu Translation, Volcano Engine Translation, DeepLX, local Ollama models, and OpenAI-style APIs. Using these services requires the appropriate API keys or configurations.
For information on obtaining API keys for services like Baidu Translation and Volcano Engine, please refer to https://bobtranslate.com/service/. We appreciate the information provided by Bob, an excellent software tool.
- Go to the releases page and download the appropriate package for your operating system
- Install and run the program
- Configure the desired translation services within the application
- Select the video or subtitle files you want to process
- Set relevant parameters (e.g., source language, target language, model)
- Start the processing task
1️⃣ Clone the project locally
git clone https://github.com/buxuku/video-subtitle-master.git
2️⃣ Install dependencies using yarn install
or npm install
cd video-subtitle-master
yarn install
3️⃣ After installing dependencies, run yarn start
or npm start
to launch the project
yarn start
Due to the large size of model files, downloading them through the software may be challenging. You can manually download models and import them into the application. Here are two links for downloading models:
-
Domestic mirror (faster download speeds): https://hf-mirror.com/ggerganov/whisper.cpp/tree/main
-
Hugging Face official source: https://huggingface.co/ggerganov/whisper.cpp/tree/main
After downloading, you can import the model files into the application using the "Import Model" feature on the "Model Management" page.
Import steps:
- On the "Model Management" page, click the "Import Model" button.
- In the file selector that appears, choose your downloaded model file.
- After confirming the import, the model will be added to your list of installed models.
Execute the following command in the terminal:
sudo xattr -dr com.apple.quarantine /Applications/Video\ Subtitle\ Master.app
Then try running the application again.
Issues and Pull Requests are welcome to help improve this project!
This project is licensed under the MIT License. See the LICENSE file for details.