Skip to content

Conversation

@marcusquinn
Copy link
Owner

@marcusquinn marcusquinn commented Feb 8, 2026

Summary

  • Add hyprwhspr subagent documentation at .agents/tools/voice/hyprwhspr.md covering installation (Arch AUR, Debian/Ubuntu, Fedora/openSUSE), CLI commands, transcription backends (Parakeet, Whisper, onnx-asr, REST API), GPU acceleration, configuration, and troubleshooting
  • Register hyprwhspr in subagent-index.toon under tools/voice/
  • Mark t027 complete in TODO.md

Closes #511

Summary by CodeRabbit

  • New Features

    • Integrated hyprwhspr as a native speech-to-text tool for Linux Wayland systems, expanding Voice AI capabilities with local dictation support.
  • Documentation

    • Added comprehensive guide covering installation, usage patterns, configuration options, and troubleshooting for the new speech-to-text tool.
  • Chores

    • Updated project task completion statuses.

@gemini-code-assist
Copy link

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@coderabbitai
Copy link
Contributor

coderabbitai bot commented Feb 8, 2026

Walkthrough

Registers hyprwhspr, a native Linux speech-to-text tool, within the subagent infrastructure by updating the voice tools entry, introducing comprehensive documentation covering installation, CLI commands, transcription backends, and GPU acceleration options, and marking related implementation tasks as complete.

Changes

Cohort / File(s) Summary
Subagent Registration
.agents/subagent-index.toon
Added hyprwhspr tag to tools/voice entry dependencies, registering the speech-to-text tool as an available component alongside existing voice bridge infrastructure.
Tool Documentation
.agents/tools/voice/hyprwhspr.md
New documentation file providing comprehensive reference for hyprwhspr, including installation steps (Arch/Debian/Fedora/openSUSE), CLI command reference, supported transcription backends, GPU acceleration guidance, configuration options, and troubleshooting.
Project Tracking
TODO.md
Marked t027 (hyprwhspr support), t111 (objective runner with safety), and t133 (cloud GPU deployment) as completed; added duplicate BLOCKED note to t071.

Estimated Code Review Effort

🎯 2 (Simple) | ⏱️ ~8 minutes

Possibly Related PRs

Poem

🎙️ Whispers caught on Linux winds,
hyprwhspr's magic now begins,
From Wayland's realm to text so clear,
Speech-to-text brought ever near! ✨

🚥 Pre-merge checks | ✅ 4 | ❌ 1
❌ Failed checks (1 warning)
Check name Status Explanation Resolution
Out of Scope Changes check ⚠️ Warning Minor scope creep detected: TODO.md changes include unrelated task completions (t133, t111) and duplicate BLOCKED note for t071, beyond the primary t027 objective. Consider isolating t027 implementation in a focused commit; defer unrelated TODO updates (t133, t111, t071) to separate maintenance PR for cleaner scope and easier rollback if needed.
✅ Passed checks (4 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title accurately describes the primary change: adding hyprwhspr speech-to-text support with explicit task reference (t027), directly matching the changeset's main feature addition.
Linked Issues check ✅ Passed The PR fully implements issue #511 (t027) requirements: adds hyprwhspr documentation covering installation, configuration, and supported backends for Arch/Omarchy Linux, registers it in subagent-index.toon, and marks the task complete.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Post copyable unit tests in a comment
  • Commit unit tests in branch feature/t027

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@github-actions
Copy link

github-actions bot commented Feb 8, 2026

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 36 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Sun Feb 8 05:12:39 UTC 2026: Code review monitoring started
Sun Feb 8 05:12:40 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 36

📈 Current Quality Metrics

  • BUGS: 0
  • CODE SMELLS: 36
  • VULNERABILITIES: 0

Generated on: Sun Feb 8 05:12:42 UTC 2026


Generated by AI DevOps Framework Code Review Monitoring

@sonarqubecloud
Copy link

sonarqubecloud bot commented Feb 8, 2026

@marcusquinn marcusquinn merged commit f83bb65 into main Feb 8, 2026
10 of 11 checks passed
Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🤖 Fix all issues with AI agents
In `@TODO.md`:
- Line 267: The Notes entry for t071 contains a duplicated blocker string; open
TODO.md, find the t071 Notes line containing `BLOCKED: Max retries exceeded:
clean_exit_no_signal` and remove the accidental second occurrence so the blocker
appears only once (ensure the line reads with a single `BLOCKED: Max retries
exceeded: clean_exit_no_signal` and clean up any extra whitespace/newline
introduced).
🧹 Nitpick comments (2)
.agents/tools/voice/hyprwhspr.md (2)

43-57: curl | bash install pattern contradicts project security stance (t104).

The Debian/Ubuntu and Fedora/openSUSE sections both recommend curl -fsSL ... | bash, the exact pipe-to-shell anti-pattern that t104 hardened against across the codebase. While these are upstream's instructions, the subagent doc is the project's own recommendation surface — consider noting the safer alternative (download-then-inspect-then-execute) consistent with the verified_install() pattern established in setup.sh.

Also, the Debian and Fedora install blocks are identical — if the upstream commands are truly the same for both, consider consolidating or noting that explicitly.

Proposed safer install pattern
 ### Debian / Ubuntu

 ```bash
-curl -fsSL https://raw.githubusercontent.com/goodroot/hyprwhspr/main/scripts/install-deps.sh | bash
+# Download and review before executing
+curl -fsSL https://raw.githubusercontent.com/goodroot/hyprwhspr/main/scripts/install-deps.sh -o /tmp/install-deps.sh
+bash /tmp/install-deps.sh
 git clone https://github.com/goodroot/hyprwhspr.git ~/hyprwhspr
 cd ~/hyprwhspr && ./bin/hyprwhspr setup

Fedora / openSUSE

-curl -fsSL https://raw.githubusercontent.com/goodroot/hyprwhspr/main/scripts/install-deps.sh | bash
+# Download and review before executing
+curl -fsSL https://raw.githubusercontent.com/goodroot/hyprwhspr/main/scripts/install-deps.sh -o /tmp/install-deps.sh
+bash /tmp/install-deps.sh
git clone https://github.com/goodroot/hyprwhspr.git ~/hyprwhspr
cd ~/hyprwhspr && ./bin/hyprwhspr setup
</details>

---

`19-19`: **Star count will go stale.**

`790+ stars` is a point-in-time snapshot. This is a minor nit — consider either dropping the count or noting it's approximate/as-of-date. Other subagents in the repo appear to include star counts too, so this is consistent at least.

</blockquote></details>

</blockquote></details>

<!-- This is an auto-generated comment by CodeRabbit for review status -->

- Notes: seo/backlink-checker.md created (106 lines). Covers Ahrefs/DataForSEO backlink APIs, WHOIS expired domain detection, reclamation workflow, and integration with existing SEO subagents.
- [ ] t071 Voice AI models for speech generation and transcription #tools #voice #ai ~30m (ai:20m test:5m read:5m) ref:GH#502 logged:2026-01-24 related:t027
- Notes: Add support for voice AI models covering both TTS (speech generation) and STT (transcription). API providers: Hugging Face Inference API (TTS/STT endpoints), ElevenLabs, OpenAI TTS/Whisper. Local models: Qwen3-TTS (0.6B/1.7B, Apache-2.0, 10 languages, voice clone/design/custom, streaming, vLLM support - https://github.com/QwenLM/Qwen3-TTS), Whisper (transcription), Bark, Coqui TTS. Create subagent at tools/voice/ or tools/ai/voice.md covering: model selection (local vs API, quality vs speed), installation (pip install qwen-tts, HF download), usage patterns (TTS generation, voice cloning, voice design, transcription), streaming support, GPU requirements. Related to t027 (hyprwhspr speech-to-text). BLOCKED: Max retries exceeded: clean_exit_no_signal
- Notes: Add support for voice AI models covering both TTS (speech generation) and STT (transcription). API providers: Hugging Face Inference API (TTS/STT endpoints), ElevenLabs, OpenAI TTS/Whisper. Local models: Qwen3-TTS (0.6B/1.7B, Apache-2.0, 10 languages, voice clone/design/custom, streaming, vLLM support - https://github.com/QwenLM/Qwen3-TTS), Whisper (transcription), Bark, Coqui TTS. Create subagent at tools/voice/ or tools/ai/voice.md covering: model selection (local vs API, quality vs speed), installation (pip install qwen-tts, HF download), usage patterns (TTS generation, voice cloning, voice design, transcription), streaming support, GPU requirements. Related to t027 (hyprwhspr speech-to-text). BLOCKED: Max retries exceeded: clean_exit_no_signal BLOCKED: Max retries exceeded: clean_exit_no_signal
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟡 Minor

Duplicate BLOCKED note on t071.

The Notes field for t071 now contains BLOCKED: Max retries exceeded: clean_exit_no_signal twice in succession. This looks like an accidental double-append from a failed supervisor retry cycle.

Proposed fix — remove the duplicate
-  - Notes: Add support for voice AI models covering both TTS (speech generation) and STT (transcription). API providers: Hugging Face Inference API (TTS/STT endpoints), ElevenLabs, OpenAI TTS/Whisper. Local models: Qwen3-TTS (0.6B/1.7B, Apache-2.0, 10 languages, voice clone/design/custom, streaming, vLLM support - https://github.com/QwenLM/Qwen3-TTS), Whisper (transcription), Bark, Coqui TTS. Create subagent at tools/voice/ or tools/ai/voice.md covering: model selection (local vs API, quality vs speed), installation (pip install qwen-tts, HF download), usage patterns (TTS generation, voice cloning, voice design, transcription), streaming support, GPU requirements. Related to t027 (hyprwhspr speech-to-text). BLOCKED: Max retries exceeded: clean_exit_no_signal BLOCKED: Max retries exceeded: clean_exit_no_signal
+  - Notes: Add support for voice AI models covering both TTS (speech generation) and STT (transcription). API providers: Hugging Face Inference API (TTS/STT endpoints), ElevenLabs, OpenAI TTS/Whisper. Local models: Qwen3-TTS (0.6B/1.7B, Apache-2.0, 10 languages, voice clone/design/custom, streaming, vLLM support - https://github.com/QwenLM/Qwen3-TTS), Whisper (transcription), Bark, Coqui TTS. Create subagent at tools/voice/ or tools/ai/voice.md covering: model selection (local vs API, quality vs speed), installation (pip install qwen-tts, HF download), usage patterns (TTS generation, voice cloning, voice design, transcription), streaming support, GPU requirements. Related to t027 (hyprwhspr speech-to-text). BLOCKED: Max retries exceeded: clean_exit_no_signal
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
- Notes: Add support for voice AI models covering both TTS (speech generation) and STT (transcription). API providers: Hugging Face Inference API (TTS/STT endpoints), ElevenLabs, OpenAI TTS/Whisper. Local models: Qwen3-TTS (0.6B/1.7B, Apache-2.0, 10 languages, voice clone/design/custom, streaming, vLLM support - https://github.com/QwenLM/Qwen3-TTS), Whisper (transcription), Bark, Coqui TTS. Create subagent at tools/voice/ or tools/ai/voice.md covering: model selection (local vs API, quality vs speed), installation (pip install qwen-tts, HF download), usage patterns (TTS generation, voice cloning, voice design, transcription), streaming support, GPU requirements. Related to t027 (hyprwhspr speech-to-text). BLOCKED: Max retries exceeded: clean_exit_no_signal BLOCKED: Max retries exceeded: clean_exit_no_signal
- Notes: Add support for voice AI models covering both TTS (speech generation) and STT (transcription). API providers: Hugging Face Inference API (TTS/STT endpoints), ElevenLabs, OpenAI TTS/Whisper. Local models: Qwen3-TTS (0.6B/1.7B, Apache-2.0, 10 languages, voice clone/design/custom, streaming, vLLM support - https://github.com/QwenLM/Qwen3-TTS), Whisper (transcription), Bark, Coqui TTS. Create subagent at tools/voice/ or tools/ai/voice.md covering: model selection (local vs API, quality vs speed), installation (pip install qwen-tts, HF download), usage patterns (TTS generation, voice cloning, voice design, transcription), streaming support, GPU requirements. Related to t027 (hyprwhspr speech-to-text). BLOCKED: Max retries exceeded: clean_exit_no_signal
🤖 Prompt for AI Agents
In `@TODO.md` at line 267, The Notes entry for t071 contains a duplicated blocker
string; open TODO.md, find the t071 Notes line containing `BLOCKED: Max retries
exceeded: clean_exit_no_signal` and remove the accidental second occurrence so
the blocker appears only once (ensure the line reads with a single `BLOCKED: Max
retries exceeded: clean_exit_no_signal` and clean up any extra
whitespace/newline introduced).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

t027: Add hyprwhspr speech-to-text support (Arch/Omarchy Linux only)

1 participant