Remove transcription mode #219

Holzhaus · 2014-10-09T14:08:04Z

If you look at jasper.py and client/mic.py, we're using the Mic class like this:

class Mic:
    def__init__(self, speaker, passive_stt_engine, active_stt_engine):

So we pass in two different STT Engine instances:

An STT Engine instance for passive listen
An STT Engine Instance for active listen

But PocketsphinxSTT takes up to 3 lm/dict pairs:

A pair for passive listen
A pair for active listen
3 A pair for active listen (musicmode)

This is a lot duplication, because in jasper.py, we're creating two separate SST Engine Instances for active listen and passive listen, so that the active listen STT Engine instance will only use the second lm/dict pair and the passive listen STT instance will only use the first pair.
In MusicMode, a third STT instance will be created that only uses the third pair.

Given the case that someone wants to write a new module that also has the ability to start a mode like the MusicMode, he'd either have to hijack the music lm/dict pair, or he'd need to add a new mode to STT Engines and change the PocketsphinxSTTEngine code.

Thus, I'd like to simplify the STT engine dramatically by removing two of the three dict pairs (or rather Vocabulary Instances) from the PocketsphinxSTT engine and the mode parameter in transcribe accordingly.

Also, there's no need to case about custom engine-specific settings, because the get_engine() classmethod takes care of that for you.

This vastly simplifies how STT engines are instantiated. Basically, you only need these steps:

import stt

engine = stt.get_engine_by_slug('sphinx')
# Alternative
engine = stt.get_engine_by_slug('google')

# Convenience method: Instance for passive listen
stt_instance_passive = engine.get_passive_instance() 
# Convenience method: Instance for active listen
stt_instance_active = engine.get_active_instance()

# Generic method (e.g. used by MusicMode (MPDControl.py)
stt_instance_custom = engine.get_instance('my_vocab', ['SOME', 'PHRASES', 'TO', 'RECOGNIZE'])

# Now create a mic:
mic1 = Mic(tts_instance, stt_instance_passive, stt_instance_active)

# A module like musicmode now can simply do this:
mic2 = Mic(mic1.speaker, mic.passive_stt_engine, stt_instance_custom)

So what happens inside get_instance() (which is also called by the convenience methods get_passive_instance()/get_active_instance())?

the configuration for this STT engine is retrieved
if this STT engine needs a vocabulary ('sphinx' does, but 'google' does not):
- a vocabulary will be initialized with the name my_vocab.
- if this vocabulary does not exist yet or if it doesn't match all phrases, it'll be (re)compiled
an STT engine instance is created with config (and vocabulary if neccessary) and returned

Because every STT engine only has one vocabulary, the transcribe() method becomes less complex, because it doesn't need the mode argument anymore.

coveralls · 2014-10-13T12:10:49Z

Coverage increased (+2.19%) when pulling 7155c0e on Holzhaus:remove-transcription-mode into 32218ec on jasperproject:master.

Holzhaus · 2014-10-15T16:01:18Z

@crm416 @shbhrsaha Can you review this, please?

shbhrsaha · 2014-10-16T15:07:43Z

Looks slick. I'll give this a test this weekend and report back!

update: haven't forgotten about this! will post soon

shbhrsaha · 2014-11-05T06:30:05Z

This works great! Thank you for putting up with the wait. Should be good for merge.

coveralls · 2014-11-05T13:39:31Z

Coverage increased (+1.93%) when pulling 8bac1e6 on Holzhaus:remove-transcription-mode into c44b772 on jasperproject:master.

Remove transcription mode

Holzhaus added the enhancement label Oct 9, 2014

Holzhaus force-pushed the remove-transcription-mode branch from d39ccbe to 27f2cb5 Compare October 13, 2014 12:06

Holzhaus added the needstesting label Oct 13, 2014

Holzhaus force-pushed the remove-transcription-mode branch from 27f2cb5 to 7155c0e Compare October 13, 2014 12:08

Holzhaus added a commit to Holzhaus/jasper-client that referenced this pull request Oct 20, 2014

Adapt STT engine API from PR jasperproject#219 in tts.py

335cd90

shbhrsaha mentioned this pull request Oct 29, 2014

Add config options for TTS engine (+ MaryTTS support) #229

Merged

Remove TranscriptionMode and improve STT engine initialisation

8bac1e6

Holzhaus force-pushed the remove-transcription-mode branch from 7155c0e to 8bac1e6 Compare November 5, 2014 13:37

Holzhaus added a commit that referenced this pull request Nov 5, 2014

Merge pull request #219 from Holzhaus/remove-transcription-mode

56f4433

Remove transcription mode

Holzhaus merged commit 56f4433 into jasperproject:master Nov 5, 2014

Holzhaus deleted the remove-transcription-mode branch December 2, 2014 13:06

Holzhaus removed the needstesting label Jan 7, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove transcription mode #219

Remove transcription mode #219

Holzhaus commented Oct 9, 2014

coveralls commented Oct 13, 2014

Holzhaus commented Oct 15, 2014

shbhrsaha commented Oct 16, 2014

shbhrsaha commented Nov 5, 2014

coveralls commented Nov 5, 2014

Remove transcription mode #219

Remove transcription mode #219

Conversation

Holzhaus commented Oct 9, 2014

coveralls commented Oct 13, 2014

Holzhaus commented Oct 15, 2014

shbhrsaha commented Oct 16, 2014

shbhrsaha commented Nov 5, 2014

coveralls commented Nov 5, 2014