Enable stateful decoding of RNNT over multiple transcribe calls #3037

titu1994 · 2021-10-21T23:55:29Z

Changelog

Enable transcribe method to store hypothesis and reuse them for stateful multi step predictions
RNNT transcribe now takes partial_hypotheses as optional input, provided in returned hypothesis from previous transcribe call
Add abstract batch_concat_states and batch_copy_states support for RNNT models to abstract away state management.
Force all greedy and beam inference to pack results into Hypothesis objects. They now contain by default the
- y_sequence : tensor form of int ids of tokens
- dec_state : CPU tuple of decoder states
- score: logprob score of current y_sequence
- (optional) alignments: 2d dangling matrix of token alignments

Example

model = EncDecRNNTModelBPE.restore_from(...)

decoding= model.cfg.decoding
decoding.strategy = 'greedy'
model.change_decoding_strategy(decoding)

# First transcription, stateless
part_hyp, _ = model.transcribe([path to audio files], batch_size=4, return_hypotheses=True)

# Second transcription, restored state
hyp, _ = model.transcribe([path to continuation of audio files], return_hypotheses=True, partial_hypothesis=part_hyp)

Signed-off-by: smajumdar <[email protected]>

lgtm-com · 2021-10-22T00:09:12Z

This pull request introduces 3 alerts when merging d7598fc into 1f36f32 - view on LGTM.com

new alerts:

2 for Unused local variable
1 for Unused import

lgtm-com · 2021-10-22T00:22:04Z

This pull request introduces 3 alerts when merging 6318e3f into 1f36f32 - view on LGTM.com

new alerts:

2 for Unused local variable
1 for Unused import

lgtm-com · 2021-10-24T06:06:49Z

This pull request introduces 3 alerts when merging 4b235eb into 9405273 - view on LGTM.com

new alerts:

2 for Unused local variable
1 for Unused import

jbalam-nv

LGTM

…IA#3037) * Start on stateful external decoding Signed-off-by: smajumdar <[email protected]> * Prepare connectors Signed-off-by: smajumdar <[email protected]> * Refactor greedy sample decoding to use Hypothesis Signed-off-by: smajumdar <[email protected]> * Refactor greedy batch first mode for Hypothesis Signed-off-by: smajumdar <[email protected]> * Update second case of greedy batch decoding Signed-off-by: smajumdar <[email protected]> * Start stateful decoding Signed-off-by: smajumdar <[email protected]> * Add guards for stateful decoding Signed-off-by: smajumdar <[email protected]> * Fix state management when no states is provided Signed-off-by: smajumdar <[email protected]> * Create Signed-off-by: smajumdar <[email protected]> * Correct logging Signed-off-by: smajumdar <[email protected]> * Begin support for stateful beam decoding Signed-off-by: smajumdar <[email protected]> * Update streaming utils with method 2 Signed-off-by: smajumdar <[email protected]> * Initiate stateful beam implementation Signed-off-by: smajumdar <[email protected]> * Reset changes Signed-off-by: smajumdar <[email protected]> * Fix style Signed-off-by: smajumdar <[email protected]> Co-authored-by: Jagadeesh Balam <[email protected]>

* Start on stateful external decoding Signed-off-by: smajumdar <[email protected]> * Prepare connectors Signed-off-by: smajumdar <[email protected]> * Refactor greedy sample decoding to use Hypothesis Signed-off-by: smajumdar <[email protected]> * Refactor greedy batch first mode for Hypothesis Signed-off-by: smajumdar <[email protected]> * Update second case of greedy batch decoding Signed-off-by: smajumdar <[email protected]> * Start stateful decoding Signed-off-by: smajumdar <[email protected]> * Add guards for stateful decoding Signed-off-by: smajumdar <[email protected]> * Fix state management when no states is provided Signed-off-by: smajumdar <[email protected]> * Create Signed-off-by: smajumdar <[email protected]> * Correct logging Signed-off-by: smajumdar <[email protected]> * Begin support for stateful beam decoding Signed-off-by: smajumdar <[email protected]> * Update streaming utils with method 2 Signed-off-by: smajumdar <[email protected]> * Initiate stateful beam implementation Signed-off-by: smajumdar <[email protected]> * Reset changes Signed-off-by: smajumdar <[email protected]> * Fix style Signed-off-by: smajumdar <[email protected]> Co-authored-by: Jagadeesh Balam <[email protected]> Signed-off-by: PeganovAnton <[email protected]>

…IA#3037) * Start on stateful external decoding Signed-off-by: smajumdar <[email protected]> * Prepare connectors Signed-off-by: smajumdar <[email protected]> * Refactor greedy sample decoding to use Hypothesis Signed-off-by: smajumdar <[email protected]> * Refactor greedy batch first mode for Hypothesis Signed-off-by: smajumdar <[email protected]> * Update second case of greedy batch decoding Signed-off-by: smajumdar <[email protected]> * Start stateful decoding Signed-off-by: smajumdar <[email protected]> * Add guards for stateful decoding Signed-off-by: smajumdar <[email protected]> * Fix state management when no states is provided Signed-off-by: smajumdar <[email protected]> * Create Signed-off-by: smajumdar <[email protected]> * Correct logging Signed-off-by: smajumdar <[email protected]> * Begin support for stateful beam decoding Signed-off-by: smajumdar <[email protected]> * Update streaming utils with method 2 Signed-off-by: smajumdar <[email protected]> * Initiate stateful beam implementation Signed-off-by: smajumdar <[email protected]> * Reset changes Signed-off-by: smajumdar <[email protected]> * Fix style Signed-off-by: smajumdar <[email protected]> Co-authored-by: Jagadeesh Balam <[email protected]>

titu1994 added 15 commits October 21, 2021 16:52

Start on stateful external decoding

1078aef

Signed-off-by: smajumdar <[email protected]>

Prepare connectors

134ede8

Signed-off-by: smajumdar <[email protected]>

Refactor greedy sample decoding to use Hypothesis

74b8437

Signed-off-by: smajumdar <[email protected]>

Refactor greedy batch first mode for Hypothesis

4679446

Signed-off-by: smajumdar <[email protected]>

Update second case of greedy batch decoding

bbae089

Signed-off-by: smajumdar <[email protected]>

Start stateful decoding

ab583b9

Signed-off-by: smajumdar <[email protected]>

Add guards for stateful decoding

0448875

Signed-off-by: smajumdar <[email protected]>

Fix state management when no states is provided

2dab5e6

Signed-off-by: smajumdar <[email protected]>

Create

173a70e

Signed-off-by: smajumdar <[email protected]>

Correct logging

7762277

Signed-off-by: smajumdar <[email protected]>

Begin support for stateful beam decoding

a51550e

Signed-off-by: smajumdar <[email protected]>

Update streaming utils with method 2

085b738

Signed-off-by: smajumdar <[email protected]>

Initiate stateful beam implementation

00b473b

Signed-off-by: smajumdar <[email protected]>

Reset changes

d7598fc

Signed-off-by: smajumdar <[email protected]>

Fix style

6318e3f

Signed-off-by: smajumdar <[email protected]>

titu1994 requested review from VahidooX and jbalam-nv and removed request for VahidooX October 22, 2021 00:10

Merge branch 'main' into rnnt_stateful_decoding

4b235eb

jbalam-nv approved these changes Oct 25, 2021

View reviewed changes

jbalam-nv merged commit 2439704 into NVIDIA:main Oct 25, 2021

titu1994 deleted the rnnt_stateful_decoding branch October 25, 2021 15:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable stateful decoding of RNNT over multiple transcribe calls #3037

Enable stateful decoding of RNNT over multiple transcribe calls #3037

titu1994 commented Oct 21, 2021 •

edited

Loading

lgtm-com bot commented Oct 22, 2021

lgtm-com bot commented Oct 22, 2021

lgtm-com bot commented Oct 24, 2021

jbalam-nv left a comment

Enable stateful decoding of RNNT over multiple transcribe calls #3037

Enable stateful decoding of RNNT over multiple transcribe calls #3037

Conversation

titu1994 commented Oct 21, 2021 • edited Loading

Changelog

Example

lgtm-com bot commented Oct 22, 2021

lgtm-com bot commented Oct 22, 2021

lgtm-com bot commented Oct 24, 2021

jbalam-nv left a comment

Choose a reason for hiding this comment

titu1994 commented Oct 21, 2021 •

edited

Loading