GitHub - apoorvnandan/speech-recognition-primer: This repository contains code for a tutorial on end to end automatic speech recognition.

This repository contains code for my blog post

Note: This code uses tensorflow 2.0

Files present:
bare_bones_asr.py:

code for the neural network explained in the blog
code for loading sample.wav file and creating its spectrogram
code for training given neural network with sample.wav and its transcript as input using CTC loss
code for using trained neural network to predict on an input spectrogram

prefix_beam_search.py:

code for prefix beam search as explained in the blog
you can import the function from this file directly and use it on your ctc output

from prefix_beam_search import prefix_beam_search
example_ctc_output = None  # get your ctc output from the network
alphabet = list(ascii_lowercase) + [space_token, end_token, blank_token]  # get your character vocab
lm = None  # get your language model function
print(prefix_beam_search(example_ctc, alphabet, blank_token, end_token, space_token, lm=lm))

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
bare_bones_asr.py		bare_bones_asr.py
prefix_beam_search.py		prefix_beam_search.py
sample.wav		sample.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

apoorvnandan/speech-recognition-primer

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages