Skip to content

A compilation of available audio texture datasets with a standardised format for accessing

Notifications You must be signed in to change notification settings

aaron-dees/audioTextureDatasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

audioTextureDatasets

This repository has two functions;

  1. To act as a database for audio texture datasets with and without control parameters, for use in machine learning models.
  2. To provide useful functionalities for manipulating audio texture datasets for machine learning models.

A repository holding a collection of audio texture datasets for training machine learning model, with a stadardised approach.

Datasets

Audio Texture Datasets

Audio Texture Datasets with control parameters

Functionalities

Curerntly scripts exist for loading data and manipulating in the following ways;

File formatting

Scripts exist for manipulating audio files in a number of ways;

Audio length

Any length of audio can be split into any arbitrary length

Sample Rate

Audio can be re-sampled to;

  • 44kHz
  • 22kHz
  • 16kHz
  • 8kHz

Bit depth

Bit depth of Audio clips can be changed to;

  • 16 bit
  • 32 bit

Audio representations

Scripts exist for manipulating audio files in the time domain into a number of different representations for machine learning;

  • Granulating waveform
  • Spectrograms
  • Mel Frequency Spectrograms
  • Cepstral Coefficients
  • Mel Freq Cepstral Coefficients

About

A compilation of available audio texture datasets with a standardised format for accessing

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages