Add Whisper Audio Transcription Support #406

Trynax · 2025-09-16T04:50:04Z

Add Whisper Audio Transcription Support

Partially addresses #376

Overview

This PR implements the Whisper audio transcription portion of issue #376, providing audio-to-text capabilities across the Echo platform.

What's Implemented

SDK: Audio models (whisper-1, whisper-large-v3) with TypeScript client integration
Server: OpenAI Whisper API integration with provider pattern architecture
Examples: Complete UI components for audio recording, upload, and transcription
Testing: Productionnready smoke tests with sample audio files

Features

Multi-model support: Fast (whisper-1) and high-accuracy (whisper-large-v3) options
Dual functionality: Audio transcription and language translation
UI: File upload, audio recording, playback controls, progress indicators
Cost tracking: $0.006/minute pricing integration as per OpenAI pricing

##Notes

Current state: Implementation complete, temporarily uses hard-coded model mapping
Next steps: Publish SDK 1.0.15+ → uncomment server validation → deploy
Testing: Local tests pass, Production tests fail (expected - models not published)

Issue #376 Status

This PR provides the audio input foundation for voice interactions in Echo apps. Users can now:

Record or upload audio files
Get accurate transcriptions via Whisper models
Build audio-enabled applications

The complete voice conversation experience can be extended in future work as needed.

…ders

…ge-v3

vercel · 2025-09-16T04:50:11Z

@Trynax is attempting to deploy a commit to the Merit Systems Team on Vercel.

A member of the Team first needs to authorize it.

Trynax added 4 commits September 12, 2025 23:22

Add initial Whisper audio transcription support: types, models, provi…

f2e720f

…ders

Add Whisper audio transcription support for whisper-1 and whisper-lar…

64b38fd

…ge-v3

Add audio transcription examples using useEchoOpenAI

ed5a461

feat: add server-side audio transcription support and tests

af75472

resolve conflict

d78ce55

Trynax mentioned this pull request Oct 18, 2025

Add OpenAI Whisper Audio Transcription Support #584

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Whisper Audio Transcription Support #406

Add Whisper Audio Transcription Support #406

Uh oh!

Trynax commented Sep 16, 2025 •

edited

Loading

Uh oh!

vercel bot commented Sep 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add Whisper Audio Transcription Support #406

Are you sure you want to change the base?

Add Whisper Audio Transcription Support #406

Uh oh!

Conversation

Trynax commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add Whisper Audio Transcription Support

Overview

What's Implemented

Features

Issue #376 Status

Uh oh!

vercel bot commented Sep 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Trynax commented Sep 16, 2025 •

edited

Loading