Music Classification with a Convolutional Neural Network

This project explores the application of a CNN to audio using 2D Convolutions. This endeavor falls under the science of Music Information Retrieval (MIR), which has some well-known applications in Recommender Systems (Spotify) and Audio Identification (Shazam).
This video gives a brief overview, similar to the README.

I listened to the misclassified clips to see what they sounded like.
Since I can't embed the clips in the README, I'll just point out that the instrumental clips do resemble other genres. In one particular example where the model's prediction was 94% Hip-Hop, the "Instrumental" clip contained a sample of a human voice talking over a beat, which very much resembled Hip-Hop.

These broad, subjective labels seem to be hard for the network to learn.

Conclusion

High-level metadata can be extracted from an audio signal
The CNN filters are able to learn the distinguishing features of broad genre classifications
The network can only be as good as our subjective labeling system
Looking into the misclassified examples can be very informative about your model and your data

Next Steps

Make scripts configurable
Continue to add more genres, including lower-level sub-genres from the full dataset
Replicate the architecture of a state of the art image classification model
Compare to other networks, such as Conv1D to LSTM

Reproducing the results

*Specifically the three-genre model (Rock, Hip-Hip, Instrumental)

Environment

Create conda environment from linux_environment.yml or mac_environment.yml

Download and convert audio

cd into src/ and run the following from inside the directory
1. download_small.sh
2. convert.py [genres separated by space]
  1. $ python convert.py Rock Hip-Hop Instrumental

Run model

From root directory, run model4.py
1. This will save the weights from the best epoch (monitoring val_loss), as well as plots of the training accuracy/loss and the confusion matrix into models/

Name		Name	Last commit message	Last commit date
Latest commit History 148 Commits
images		images
nbs		nbs
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
linux_environment.yml		linux_environment.yml
mac_environment.yml		mac_environment.yml
model3.py		model3.py
model4.py		model4.py
model5.py		model5.py
proposal.md		proposal.md
results.txt		results.txt
slides.key		slides.key

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Music Classification with a Convolutional Neural Network

Table of Contents

Data

Technology

Convolutional Neural Networks

Results

Rock vs. Hip-Hop

Rock vs. Hip-Hop vs. Instrumental

Conclusion

Next Steps

Reproducing the results

Environment

Download and convert audio

Run model

About

Releases

Packages

Languages

License

anthonybaulo/music-genre-classification

Folders and files

Latest commit

History

Repository files navigation

Music Classification with a Convolutional Neural Network

Table of Contents

Data

Technology

Convolutional Neural Networks

Results

Rock vs. Hip-Hop

Rock vs. Hip-Hop vs. Instrumental

Conclusion

Next Steps

Reproducing the results

Environment

Download and convert audio

Run model

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages