-
Notifications
You must be signed in to change notification settings - Fork 5
feat: add hyprwhspr speech-to-text subagent for Linux (t027) #575
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Warning You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again! |
WalkthroughRegisters hyprwhspr, a native Linux speech-to-text tool, within the subagent infrastructure by updating the voice tools entry, introducing comprehensive documentation covering installation, CLI commands, transcription backends, and GPU acceleration options, and marking related implementation tasks as complete. Changes
Estimated Code Review Effort🎯 2 (Simple) | ⏱️ ~8 minutes Possibly Related PRs
Poem
🚥 Pre-merge checks | ✅ 4 | ❌ 1❌ Failed checks (1 warning)
✅ Passed checks (4 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing touches🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
🔍 Code Quality Report�[0;35m[MONITOR]�[0m Code Review Monitoring Report �[0;34m[INFO]�[0m Latest Quality Status: �[0;34m[INFO]�[0m Recent monitoring activity: 📈 Current Quality Metrics
Generated on: Sun Feb 8 05:12:42 UTC 2026 Generated by AI DevOps Framework Code Review Monitoring |
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 1
🤖 Fix all issues with AI agents
In `@TODO.md`:
- Line 267: The Notes entry for t071 contains a duplicated blocker string; open
TODO.md, find the t071 Notes line containing `BLOCKED: Max retries exceeded:
clean_exit_no_signal` and remove the accidental second occurrence so the blocker
appears only once (ensure the line reads with a single `BLOCKED: Max retries
exceeded: clean_exit_no_signal` and clean up any extra whitespace/newline
introduced).
🧹 Nitpick comments (2)
.agents/tools/voice/hyprwhspr.md (2)
43-57:curl | bashinstall pattern contradicts project security stance (t104).The Debian/Ubuntu and Fedora/openSUSE sections both recommend
curl -fsSL ... | bash, the exact pipe-to-shell anti-pattern that t104 hardened against across the codebase. While these are upstream's instructions, the subagent doc is the project's own recommendation surface — consider noting the safer alternative (download-then-inspect-then-execute) consistent with theverified_install()pattern established in setup.sh.Also, the Debian and Fedora install blocks are identical — if the upstream commands are truly the same for both, consider consolidating or noting that explicitly.
Proposed safer install pattern
### Debian / Ubuntu ```bash -curl -fsSL https://raw.githubusercontent.com/goodroot/hyprwhspr/main/scripts/install-deps.sh | bash +# Download and review before executing +curl -fsSL https://raw.githubusercontent.com/goodroot/hyprwhspr/main/scripts/install-deps.sh -o /tmp/install-deps.sh +bash /tmp/install-deps.sh git clone https://github.com/goodroot/hyprwhspr.git ~/hyprwhspr cd ~/hyprwhspr && ./bin/hyprwhspr setupFedora / openSUSE
-curl -fsSL https://raw.githubusercontent.com/goodroot/hyprwhspr/main/scripts/install-deps.sh | bash +# Download and review before executing +curl -fsSL https://raw.githubusercontent.com/goodroot/hyprwhspr/main/scripts/install-deps.sh -o /tmp/install-deps.sh +bash /tmp/install-deps.sh git clone https://github.com/goodroot/hyprwhspr.git ~/hyprwhspr cd ~/hyprwhspr && ./bin/hyprwhspr setup</details> --- `19-19`: **Star count will go stale.** `790+ stars` is a point-in-time snapshot. This is a minor nit — consider either dropping the count or noting it's approximate/as-of-date. Other subagents in the repo appear to include star counts too, so this is consistent at least. </blockquote></details> </blockquote></details> <!-- This is an auto-generated comment by CodeRabbit for review status -->
| - Notes: seo/backlink-checker.md created (106 lines). Covers Ahrefs/DataForSEO backlink APIs, WHOIS expired domain detection, reclamation workflow, and integration with existing SEO subagents. | ||
| - [ ] t071 Voice AI models for speech generation and transcription #tools #voice #ai ~30m (ai:20m test:5m read:5m) ref:GH#502 logged:2026-01-24 related:t027 | ||
| - Notes: Add support for voice AI models covering both TTS (speech generation) and STT (transcription). API providers: Hugging Face Inference API (TTS/STT endpoints), ElevenLabs, OpenAI TTS/Whisper. Local models: Qwen3-TTS (0.6B/1.7B, Apache-2.0, 10 languages, voice clone/design/custom, streaming, vLLM support - https://github.com/QwenLM/Qwen3-TTS), Whisper (transcription), Bark, Coqui TTS. Create subagent at tools/voice/ or tools/ai/voice.md covering: model selection (local vs API, quality vs speed), installation (pip install qwen-tts, HF download), usage patterns (TTS generation, voice cloning, voice design, transcription), streaming support, GPU requirements. Related to t027 (hyprwhspr speech-to-text). BLOCKED: Max retries exceeded: clean_exit_no_signal | ||
| - Notes: Add support for voice AI models covering both TTS (speech generation) and STT (transcription). API providers: Hugging Face Inference API (TTS/STT endpoints), ElevenLabs, OpenAI TTS/Whisper. Local models: Qwen3-TTS (0.6B/1.7B, Apache-2.0, 10 languages, voice clone/design/custom, streaming, vLLM support - https://github.com/QwenLM/Qwen3-TTS), Whisper (transcription), Bark, Coqui TTS. Create subagent at tools/voice/ or tools/ai/voice.md covering: model selection (local vs API, quality vs speed), installation (pip install qwen-tts, HF download), usage patterns (TTS generation, voice cloning, voice design, transcription), streaming support, GPU requirements. Related to t027 (hyprwhspr speech-to-text). BLOCKED: Max retries exceeded: clean_exit_no_signal BLOCKED: Max retries exceeded: clean_exit_no_signal |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Duplicate BLOCKED note on t071.
The Notes field for t071 now contains BLOCKED: Max retries exceeded: clean_exit_no_signal twice in succession. This looks like an accidental double-append from a failed supervisor retry cycle.
Proposed fix — remove the duplicate
- - Notes: Add support for voice AI models covering both TTS (speech generation) and STT (transcription). API providers: Hugging Face Inference API (TTS/STT endpoints), ElevenLabs, OpenAI TTS/Whisper. Local models: Qwen3-TTS (0.6B/1.7B, Apache-2.0, 10 languages, voice clone/design/custom, streaming, vLLM support - https://github.com/QwenLM/Qwen3-TTS), Whisper (transcription), Bark, Coqui TTS. Create subagent at tools/voice/ or tools/ai/voice.md covering: model selection (local vs API, quality vs speed), installation (pip install qwen-tts, HF download), usage patterns (TTS generation, voice cloning, voice design, transcription), streaming support, GPU requirements. Related to t027 (hyprwhspr speech-to-text). BLOCKED: Max retries exceeded: clean_exit_no_signal BLOCKED: Max retries exceeded: clean_exit_no_signal
+ - Notes: Add support for voice AI models covering both TTS (speech generation) and STT (transcription). API providers: Hugging Face Inference API (TTS/STT endpoints), ElevenLabs, OpenAI TTS/Whisper. Local models: Qwen3-TTS (0.6B/1.7B, Apache-2.0, 10 languages, voice clone/design/custom, streaming, vLLM support - https://github.com/QwenLM/Qwen3-TTS), Whisper (transcription), Bark, Coqui TTS. Create subagent at tools/voice/ or tools/ai/voice.md covering: model selection (local vs API, quality vs speed), installation (pip install qwen-tts, HF download), usage patterns (TTS generation, voice cloning, voice design, transcription), streaming support, GPU requirements. Related to t027 (hyprwhspr speech-to-text). BLOCKED: Max retries exceeded: clean_exit_no_signal📝 Committable suggestion
‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.
| - Notes: Add support for voice AI models covering both TTS (speech generation) and STT (transcription). API providers: Hugging Face Inference API (TTS/STT endpoints), ElevenLabs, OpenAI TTS/Whisper. Local models: Qwen3-TTS (0.6B/1.7B, Apache-2.0, 10 languages, voice clone/design/custom, streaming, vLLM support - https://github.com/QwenLM/Qwen3-TTS), Whisper (transcription), Bark, Coqui TTS. Create subagent at tools/voice/ or tools/ai/voice.md covering: model selection (local vs API, quality vs speed), installation (pip install qwen-tts, HF download), usage patterns (TTS generation, voice cloning, voice design, transcription), streaming support, GPU requirements. Related to t027 (hyprwhspr speech-to-text). BLOCKED: Max retries exceeded: clean_exit_no_signal BLOCKED: Max retries exceeded: clean_exit_no_signal | |
| - Notes: Add support for voice AI models covering both TTS (speech generation) and STT (transcription). API providers: Hugging Face Inference API (TTS/STT endpoints), ElevenLabs, OpenAI TTS/Whisper. Local models: Qwen3-TTS (0.6B/1.7B, Apache-2.0, 10 languages, voice clone/design/custom, streaming, vLLM support - https://github.com/QwenLM/Qwen3-TTS), Whisper (transcription), Bark, Coqui TTS. Create subagent at tools/voice/ or tools/ai/voice.md covering: model selection (local vs API, quality vs speed), installation (pip install qwen-tts, HF download), usage patterns (TTS generation, voice cloning, voice design, transcription), streaming support, GPU requirements. Related to t027 (hyprwhspr speech-to-text). BLOCKED: Max retries exceeded: clean_exit_no_signal |
🤖 Prompt for AI Agents
In `@TODO.md` at line 267, The Notes entry for t071 contains a duplicated blocker
string; open TODO.md, find the t071 Notes line containing `BLOCKED: Max retries
exceeded: clean_exit_no_signal` and remove the accidental second occurrence so
the blocker appears only once (ensure the line reads with a single `BLOCKED: Max
retries exceeded: clean_exit_no_signal` and clean up any extra
whitespace/newline introduced).



Summary
.agents/tools/voice/hyprwhspr.mdcovering installation (Arch AUR, Debian/Ubuntu, Fedora/openSUSE), CLI commands, transcription backends (Parakeet, Whisper, onnx-asr, REST API), GPU acceleration, configuration, and troubleshootingsubagent-index.toonundertools/voice/Closes #511
Summary by CodeRabbit
New Features
Documentation
Chores