Skip to content

feat: add cloud voice agents and S2S model docs (t080)#713

Merged
marcusquinn merged 1 commit intomainfrom
feature/t080
Feb 9, 2026
Merged

feat: add cloud voice agents and S2S model docs (t080)#713
marcusquinn merged 1 commit intomainfrom
feature/t080

Conversation

@marcusquinn
Copy link
Owner

Summary

  • New: cloud-voice-agents.md — comprehensive guide for deploying production voice agents using GPT-4o Realtime (OpenAI GA), MiniCPM-o 2.6 (open-weight Apache-2.0), and NVIDIA Nemotron Speech (Riva NIM composable pipeline)
  • Updated: voice-ai-models.md with expanded S2S section, NVIDIA Magpie TTS and Parakeet ASR in cloud/enterprise tables, new enterprise priority row
  • Updated: Cross-references in speech-to-speech.md, pipecat-opencode.md, and subagent-index.toon

What's Covered

cloud-voice-agents.md (new, 468 lines)

  • Architecture overview (native S2S vs cascaded pipeline)
  • Model comparison table (GPT-4o Realtime, MiniCPM-o 2.6, NVIDIA Nemotron, Gemini 2.0 Live, AWS Nova Sonic)
  • GPT-4o Realtime: setup via OpenAI Agents SDK, Pipecat, and direct WebSocket; voices, pricing, GA migration notes
  • MiniCPM-o 2.6: basic speech conversation, streaming mode, deployment options (HF, vLLM, llama.cpp, Ollama, int4, GGUF)
  • NVIDIA Nemotron Speech: Parakeet ASR models (TDT v2 🤖 CodeRabbit AI Integration - Comprehensive Code Review Setup #1 on HF leaderboard), Magpie TTS, StudioVoice, NIM API and self-hosted setup
  • Deployment patterns: browser WebRTC, phone SIP/Twilio, self-hosted privacy, hybrid cloud+local
  • Cost comparison and framework selection guide
  • Monitoring and observability recommendations

voice-ai-models.md (updated)

  • S2S section expanded from 4 to 5 native models + new NVIDIA composable pipeline table
  • Added NVIDIA Magpie TTS to cloud TTS table
  • Added NVIDIA Riva Parakeet to cloud STT table
  • New "Enterprise" row in priority selection guide
  • Updated decision flow with enterprise/on-prem path
  • Fixed MiniCPM-o version reference (4.5 → 2.6)

Task

Resolves t080: Set up cloud voice agents and S2S models (GPT-4o-Realtime, MiniCPM-o, Nemotron)

Add comprehensive cloud-voice-agents.md covering GPT-4o Realtime (GA),
MiniCPM-o 2.6 (open-weight omni-modal), and NVIDIA Nemotron Speech
(Riva NIM composable pipeline) for deploying production voice agents.

- New: cloud-voice-agents.md with deployment patterns (browser WebRTC,
  phone SIP/Twilio, self-hosted, hybrid), cost comparison, framework
  selection guide, and code examples for all three model families
- Updated: voice-ai-models.md with expanded S2S section, NVIDIA Magpie
  TTS and Parakeet ASR in cloud tables, enterprise priority row
- Updated: speech-to-speech.md and pipecat-opencode.md with cross-refs
- Updated: subagent-index.toon with cloud-voice-agents and voice-ai-models

Decision: chose to create a single comprehensive cloud-voice-agents.md
rather than separate per-model docs — matches existing voice/ pattern
where each file covers a capability area, not a single model.
@gemini-code-assist
Copy link

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@coderabbitai
Copy link
Contributor

coderabbitai bot commented Feb 9, 2026

Warning

Rate limit exceeded

@marcusquinn has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 1 minutes and 24 seconds before requesting another review.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

✨ Finishing touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Post copyable unit tests in a comment
  • Commit unit tests in branch feature/t080

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@github-actions
Copy link

github-actions bot commented Feb 9, 2026

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 45 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Mon Feb 9 02:26:25 UTC 2026: Code review monitoring started
Mon Feb 9 02:26:25 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 45

📈 Current Quality Metrics

  • BUGS: 0
  • CODE SMELLS: 45
  • VULNERABILITIES: 0

Generated on: Mon Feb 9 02:26:28 UTC 2026


Generated by AI DevOps Framework Code Review Monitoring

@sonarqubecloud
Copy link

sonarqubecloud bot commented Feb 9, 2026

@marcusquinn marcusquinn merged commit 8916049 into main Feb 9, 2026
11 checks passed
marcusquinn added a commit that referenced this pull request Feb 9, 2026
@marcusquinn marcusquinn deleted the feature/t080 branch February 21, 2026 01:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant