Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
30 changes: 30 additions & 0 deletions documentation/docs/guides/managing-goose-sessions.md
Original file line number Diff line number Diff line change
Expand Up @@ -263,6 +263,36 @@ Export sessions to Markdown to share with your team, create documentation, archi

For more details on export options, available flags, and output formats, see the [CLI commands documentation](/docs/guides/goose-cli-commands#session-export-options).

## Voice Dictation
Speak to Goose directly instead of typing your prompts.

<Tabs groupId="interface">
<TabItem value="ui" label="Goose Desktop" default>
To enable voice dictation:
1. Click `⚙️` in the upper right corner
2. Click **Advanced Settings**
3. Scroll to **Voice Dictation**
4. Toggle **Enable Voice Dictation** on
5. Choose between **OpenAI Whisper** or **ElevenLabs** as your dictation provider
6. Enter your API key for the provider you chose
7. Click the back button at the top of the settings screen to return to the chat. A microphone icon appears to the right of the text input field.

**If you don't see the microphone icon**, check the [models you have configured](/docs/getting-started/providers.md). ElevenLabs can be used as a dictation provider alongside any LLM, but OpenAI Whisper requires that you have an OpenAI model configured in Goose, even if using another LLM provider for chat.

To start using voice dictation, click the microphone icon and begin speaking. The first time, Goose will request access to your microphone. While recording, you'll see a live waveform of your audio in the input field, a timer, and the current size of your recording. Click the microphone button again to finish recording.

#### Important Notes
* You can record up to 10 minutes or 25MB of audio.
* The audio is processed by your chosen provider (OpenAI or ElevenLabs).
* Voice input is appended to any existing text in the text input field, so you can combine typing and speaking your prompts.
* Recordings are not stored locally after transcription.

</TabItem>
<TabItem value="cli" label="Goose CLI">
Voice dictation is not available in the Goose CLI.
</TabItem>
</Tabs>

## Search Within Sessions

Search allows you to find specific content within your current session. The search functionality is available in both CLI and Desktop interfaces.
Expand Down
Loading