Skip to content

whisper: Support a variant of the whisper pipeline where encoder / decoder are stateful.#1857

Merged
kunal-vaishnavi merged 5 commits into
microsoft:mainfrom
RyanMetcalfeInt8:whisper_stateful_ov_support
Dec 4, 2025
Merged

whisper: Support a variant of the whisper pipeline where encoder / decoder are stateful.#1857
kunal-vaishnavi merged 5 commits into
microsoft:mainfrom
RyanMetcalfeInt8:whisper_stateful_ov_support

Conversation

@RyanMetcalfeInt8
Copy link
Copy Markdown
Contributor

@RyanMetcalfeInt8 RyanMetcalfeInt8 commented Nov 7, 2025

This will mainly be used to support EPCtx-wrapped OpenVINO IR's -- the same set of models that are compatible with OpenVINO GenAI.

In this case, unlike the default pipeline, the encoder doesn't pull in KV cross projection -- it only outputs encoder_hidden_states.

For the decoder portion of the pipeline, the cross & self KV cache tensors are managed internal to the model (attached to the session).

@RyanMetcalfeInt8 RyanMetcalfeInt8 changed the title whisper: Support a variant of the whisper pipeline where encoder / de… whisper: Support a variant of the whisper pipeline where encoder / decoder are stateful. Nov 7, 2025
Comment thread src/models/whisper.cpp Outdated
@RyanMetcalfeInt8 RyanMetcalfeInt8 force-pushed the whisper_stateful_ov_support branch from 9c5203a to fdb7fae Compare November 25, 2025 20:32
Comment thread src/models/whisper.h Outdated
Comment thread src/models/whisper.cpp Outdated
@RyanMetcalfeInt8
Copy link
Copy Markdown
Contributor Author

Thanks @kunal-vaishnavi -- can you trigger a re-run of the checks? I don't think that this MacOS failure is related to this changeset.

@RyanMetcalfeInt8 RyanMetcalfeInt8 force-pushed the whisper_stateful_ov_support branch from a55d04b to acbbd95 Compare December 2, 2025 13:25
@RyanMetcalfeInt8
Copy link
Copy Markdown
Contributor Author

@kunal-vaishnavi -- any insight into why this MacOS build is failing? I also see similar failures in another PR: #1900

I don't think it's related to these changesets, so perhaps there is some instability in MacOS builds currently?

@kunal-vaishnavi
Copy link
Copy Markdown
Contributor

I am still investigating. The CI failure is in the latest commit on the main branch and in an older commit as well.

@RyanMetcalfeInt8 RyanMetcalfeInt8 force-pushed the whisper_stateful_ov_support branch from acbbd95 to 3496b55 Compare December 4, 2025 15:26
@RyanMetcalfeInt8 RyanMetcalfeInt8 force-pushed the whisper_stateful_ov_support branch from 3496b55 to 7633be2 Compare December 4, 2025 16:17
@RyanMetcalfeInt8
Copy link
Copy Markdown
Contributor Author

@kunal-vaishnavi -- I rebased against latest main that included MacOS fixes, and pipeline is passing now.

@kunal-vaishnavi kunal-vaishnavi merged commit 2e62384 into microsoft:main Dec 4, 2025
15 checks passed
kunal-vaishnavi pushed a commit that referenced this pull request Dec 5, 2025
…coder are stateful. (#1857)

This will mainly be used to support EPCtx-wrapped OpenVINO IR's -- the
same set of models that are compatible with OpenVINO GenAI.

In this case, unlike the default pipeline, the encoder doesn't pull in
KV cross projection -- it only outputs encoder_hidden_states.

For the decoder portion of the pipeline, the cross & self KV cache
tensors are managed internal to the model (attached to the session).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants