[Cartesia] Use up-to-date opts in _sentence_stream_task #3500

seanmuirhead · 2025-09-25T01:00:29Z

Desired Behavior

The agent takes in user input, and is able to update the voice before responding via TTS.update_options()
Every utterance afterwards will be in the updated voice, effective immediately

Actual Behavior

The agent takes in user input and is able to update the voice
However, that voice is only reflected in the next turn, not the current one

Approach

I am open to other approaches. This seemed like the easiest one

Cartesia Docs:
https://docs.cartesia.ai/api-reference/tts/tts

longcw · 2025-09-25T01:14:15Z

livekit-plugins/livekit-plugins-cartesia/livekit/plugins/cartesia/tts.py

-                token_pkt = base_pkt.copy()
+                # The opts may have changed between the time this class was instantiated and the time we start receiving
+                # sentences to synthesize. We use the latest options here by doing self._tts._opts instead of self._opts.
+                token_pkt = _to_cartesia_options(self._tts._opts, streaming=True)


could you explain in what case you want to update the options after the tts_node started?

Series of Events:

User Speaks ("I want to talk to Katie")

llm_node_1 starts, calls update_options(voice=KATIE)

tts_node_1 starts with voice=KATIE

User interrupts the agent ("actually I want to speak to Max") -> llm_node_1 cancels, but tts_node continues

llm_node_2 starts, calls update_options(voice=MAX)

tts_node_1 synthesizes the LLM response, but in the KATIE voice instead of the MAX voice

Desired Behavior:

At step 6, we want the TTS to synthesize in the MAX voice, not the KATIE voice

Please let me know if this is reasonable and/or you plan to allow this functionality.
I think it is reasonable to expect the TTS to synthesize with the most up-to-date options.

llm_node_2 starts, calls update_options(voice=MAX)
tts_node_1 synthesizes the LLM response, but in the KATIE voice instead of the MAX voice

does this actually happen? a new generation will create a new tts stream, ideally there should be a tts_node_2 for the llm_node_2.

Perhaps only one LLM node persists.

The behavior can be replicated, though, by doing something like this:

In the llm_node, call update_options with the new voice.

This new voice is NOT reflected by the time we get to synthesizing. Only in the next turn is it updated.

If you make the change in this PR, the new voice will be reflected.
We need this by EOD, so will be hacking a version of the Cartesia.TTS() plugin in the meantime.

I see, it's not applied because the tts_node is created in parallel with llm_node, before the update_options in llm_node is called.

instead of using options from tts instance, we may still want each tts stream has a copy of the options. maybe we should allow to create a new tts_node in the llm_node with the updated options, this will fix the issue for all TTS.

instead of using options from tts instance, we may still want each tts stream has a copy of the options.

I agree with this. It makes sense for stream options to be immutable once instantiated.

maybe we should allow to create a new tts_node in the llm_node with the updated options

What about a tts_node.restart() or tts_node.refresh() of some sort? I can also create new tts_node from within the llm_node but less clear how I would do that. Will take a look later this week

[Cartesia] Use up-to-date opts in _sentence_stream_task

d9707dd

longcw reviewed Sep 25, 2025

View reviewed changes

seanmuirhead requested a review from longcw September 26, 2025 18:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Cartesia] Use up-to-date opts in _sentence_stream_task #3500

[Cartesia] Use up-to-date opts in _sentence_stream_task #3500

Uh oh!

seanmuirhead commented Sep 25, 2025 •

edited

Loading

Uh oh!

longcw Sep 25, 2025

Uh oh!

seanmuirhead Sep 26, 2025

Uh oh!

longcw Sep 29, 2025

Uh oh!

seanmuirhead Sep 29, 2025

Uh oh!

longcw Sep 30, 2025

Uh oh!

seanmuirhead Sep 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Cartesia] Use up-to-date opts in _sentence_stream_task #3500

Are you sure you want to change the base?

[Cartesia] Use up-to-date opts in _sentence_stream_task #3500

Uh oh!

Conversation

seanmuirhead commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Desired Behavior

Actual Behavior

Approach

Uh oh!

longcw Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

seanmuirhead Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

longcw Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

seanmuirhead Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

longcw Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

seanmuirhead Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

seanmuirhead commented Sep 25, 2025 •

edited

Loading