Voice Identification

This project is to better understand how to identify a user by their speech and further to understand what they are saying. It will be tackled in two parts:

Speaker Identification
Speech to Text Recognition

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.

Prerequisites

The following dependencies are required for this project to work. The commands given below are how to install each one.

Pip Installation:

$ sudo apt-get install python-pip python-dev --yes
$ pip install --upgrade pip --user
$ pip install --upgrade setuptools --user

SciPy Stack Installation:

$ sudo apt-get install python-numpy python-scipy python-skimage python-matplotlib ipython ipython-notebook python-pandas python-sympy python-nose --yes

TensorFlow Installation (CPU only):

$ pip install tensorflow --user

Seaborn, Pandas Installation:

$ pip install seaborn pandas --user

Jupyter Notebook Installation:

$ pip install jupyter --user

TFLearn Installation:

$ pip install tflearn --user

Librosa Installation:

$ pip install librosa --user

Pydub Installation:

$ pip install pydub --user

pyAudioAnalysis Installation:

$ pip install numpy matplotlib scipy sklearn hmmlearn simplejson eyed3 pydub --user
$ sudo apt-get install python-tk --yes
$ git clone https://github.com/tyiannak/pyAudioAnalysis.git

Python Audio Tools Installation:

$ wget https://downloads.sourceforge.net/project/audiotools/audiotools/2.16/audiotools.2.16.tar.gz?r=https%3A%2F%2Fsourceforge.net%2Fprojects%2Faudiotools%2Ffiles%2F&ts=1490043434&use_mirror=dronedata
$ mv audiotools.2.16.tar.gz\?r\=https%3A%2F%2Fsourceforge.net%2Fprojects%2Faudiotools%2Ffiles%2F audiotools.2.16.tar.gz
$ tar -xzvf audiotools.2.16.tar.gz
$ sudo apt-get install libcdio-dev libcdio-paranoia-dev
$ cd audiotools-2.16/
$ sudo make install

Test Installation with:

$ audiotools-config

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
audio_samples		audio_samples
data_prep		data_prep
python_scripts		python_scripts
DANet.ipynb		DANet.ipynb
IndependentComponentAnalysis.ipynb		IndependentComponentAnalysis.ipynb
KerasSpeakerIdentification.ipynb		KerasSpeakerIdentification.ipynb
MFCC Step By Step.ipynb		MFCC Step By Step.ipynb
MixedAudioAnalysis.ipynb		MixedAudioAnalysis.ipynb
OCISpeakerRecognition.ipynb		OCISpeakerRecognition.ipynb
Pandas Graphing.ipynb		Pandas Graphing.ipynb
README.md		README.md
SimpleSpeechRecognition.ipynb		SimpleSpeechRecognition.ipynb
SpeakerRecognition.ipynb		SpeakerRecognition.ipynb
demo.py		demo.py
intro_training.log		intro_training.log
oci_training.log		oci_training.log
pannous_speech_data.py		pannous_speech_data.py
speaker10_training.log		speaker10_training.log
speaker250_training.log		speaker250_training.log
training.log		training.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice Identification

Getting Started

Prerequisites

Pip Installation:

SciPy Stack Installation:

TensorFlow Installation (CPU only):

Seaborn, Pandas Installation:

Jupyter Notebook Installation:

TFLearn Installation:

Librosa Installation:

Pydub Installation:

pyAudioAnalysis Installation:

Python Audio Tools Installation:

About

Releases

Packages

Languages

AKBoles/Voice-Identification

Folders and files

Latest commit

History

Repository files navigation

Voice Identification

Getting Started

Prerequisites

Pip Installation:

SciPy Stack Installation:

TensorFlow Installation (CPU only):

Seaborn, Pandas Installation:

Jupyter Notebook Installation:

TFLearn Installation:

Librosa Installation:

Pydub Installation:

pyAudioAnalysis Installation:

Python Audio Tools Installation:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages