Speech-Emotion-Recognition

This project attempts to classify the emotion of a given audio file as "Calm", "Happy", "Fearful" and "Disgust". It has been documented in accordance with the Machine Learning Life Cycle consisting of 7 major stages in a Machine Learning project. Currently, the accuracy of the model is around 70%. It can recognise the emotion "Calm" with the highest accuracy but has difficulty in recognising "Fearful".

Dataset Used

The RAVDESS dataset has been used in this project. It was a total of 8 emotions out of which I pick and chose 4 emotions on which the current model has been trained.

Features

The model extracted features of speech recognition like MFCC, Chroma, Mel, Spectral Contrast and Zero Crossing Rate.
It was found that there is not much change in the accuracy of the model with Spectral contrast and zero crossing rate.

Model

The model used was mlpClassifier since this was a classification problem. The hyperparameters for the model were tuned using GridSearchCV.

Improvements

I'm planning on trying out other models with the dataset like Random Forest to see if I get increased accuracy.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
speech-emotion-recognition-ravdess-data.zip		speech-emotion-recognition-ravdess-data.zip
speechRecogn.ipynb		speechRecogn.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech-Emotion-Recognition

Dataset Used

Features

Model

Improvements

About

Releases

Packages

Languages

Seabee26/Speech-Emotion-Recognition

Folders and files

Latest commit

History

Repository files navigation

Speech-Emotion-Recognition

Dataset Used

Features

Model

Improvements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages