- create dataset
reads manual annotations
segments audio files and
computes spectrograms per segment
saves dataset: numpy array: [[segment_id],[spectrogram (n_mels, n_timeframes)], [label_matrix (8*n_timeframes)]]
-
adapt dataset to task
-
split dataset and normalize
-
[OPT] Analysis dataset
- train.py
-
apply_model notebook
-
pos-processing_model_output notebook