On data formats and the way to ge them for dataLoader.py #9

konts6102 · 2016-01-21T07:29:19Z

First of all, I am impressed this exciting source and appreciate all contributors.

Question 1. Is it possible for kaldi scripts to create a set of input files with which ctc-stanford training script can run?

I think that all input files for an execution are: key#.txt, feat.bin and alis#.txt. Examples of these files are in the URL: http://deeplearning.stanford.edu/lexfree/timit/

Question 2. If the answer of the previous question is NO, then how can I create these three kinds of files from my wav files and transcripts of them?

I know methods and scripts to extract the mfcc and log mel filter bank features as described in the URL https://github.com/jameslyons/python_speech_features

I think that a set of feature vectors of a wav file can be an input for dataLoarder.py. But I am not sure how the rest of three files can be obtained.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

On data formats and the way to ge them for dataLoader.py #9

On data formats and the way to ge them for dataLoader.py #9

konts6102 commented Jan 21, 2016

On data formats and the way to ge them for dataLoader.py #9

On data formats and the way to ge them for dataLoader.py #9

Comments

konts6102 commented Jan 21, 2016