ChirpNet - A chirp detection algorithm built on top of a convolutional neural network

The previous chirp detection algorithm was based entirely on manual extraction of multiple features across time and space, where anomalies on a all features at the same time were chirps. This approach uses the sum spectrogram across all electrodes on an electrode grid to detect chirps from the spectrogram images. The core of this algoritm is a simple convolutional neural network that is trained to discriminate chirps using a simulated dataset. The detected chirps are then sorted on both the time and frequency dimension according to the models chirp probability.

What are chirps?

Chirps are brief (20-200 ms) upward-excursions of the frequency of the electrid organ discharge (EOD) of many wave-type electric fish. The example below shows a simulation of the EODs of multiple fish that each chirp 50 times at random points in time. Every black line is a frequency band of a single fish. Each black tick is the time point a chirp is simulated. The additional frequency bands are harmonics.

How can we detect them?

The main problem of chirp detection is, that chirps are too fast to resolve the temporal evolution in frequency, while maintaining a frequency resolution to distinguish individual fish on a spectrogram. A spectrogram of a chirp with sufficient frequency resolution does not capture a chirp well. If there is just a single fish in the recording, we could just filter the recording and compute an instantaneous frequency, but once there are multiple fish, the only way to separate them is by spectral analyses.

So the kind of spectrogram we need is a trade-off between the temporal and frequency resolution. We already extracted the bands of the EOD baseline frequency for each fish using a spectrogam with a frequency resolution of 0.5 Hz in the wavetracker project. Most chirps are invisible on a spectrogram with this resolution. I currently use a frequency resolution of 6 Hz with a window overlap of .99.

On these spectrograms, we can still see the "ghost" of a chirp: The chirp might not be clearly visible in its temporal evolution, but there is a blurred region where the frequency briefly peaks. But these regions last up to magnitudes longer than a real chirp and come in many shaped and forms, depending on the spectrogram resolution and parameters such as chirp duration, contrast, frequency, etc. The following image contains just a few examples from the current dataset. Each window is fixed to a frequency range of 400 Hz and a time of 240 ms.

In this project, I will build a simulated dataset using many chirp parameters and will then try to train a simple convolutional neural network as a binary image classifier to detect these "ghosts" of chirps on spectrogram images.

With the current synthetic dataset (n=15000), I reach a discrimination performance of 98%. But as soon as the frequency traces of chirping fish get close, the current version of the detector falsely assings the same chirp to multiple fish. The plot below illustrated the current state on real data.

The black markers are the points were the detector found a chirp. So what the current implementation solves, is reliable detection (on simulated data) but assignment is still an issue. When frequency bands are close to each other, one chirp is often detected on two frequency bands. This is currently solved by only taking the chirp with the higthest probability in a given time window. The downside is, that this makes it impossible that the detector finds chirps that happen simultaneously in two fish.

Another major issue is noise. I train the detector on artificial data and there is a limit to the variability of the noise I can easily simulate. So the detector is not very robust to noise. I will try to solve this by adding real noise to the training data. The following shows the detectors performance when the amplitude of the fish EODs approaches the noise level. Additionally, just increasing the lower cutoff in the decibel transformation of the power spectrogram solved many false positive detections. Additionally detecting strong vertical noise bands by thresholding and skipping those in the detection loop works well. Ultimately however, this kind of noise should be included in the training dataset. The plot below shows one of these noise bands marked in red.

UPDATE: The chirps that are falsely detected twice for different fish can be sorted by the probability the network computes for each chirp. Simply only accepting the chirp with the highest probability in a given time window (currently 20 ms) completely resolves the issue of duplicates on the current test snippet.

Issues

A chirp only lasts for 20-200 ms but the anomaly it introduces on a spectrogram with sufficient frequency resolution lasts up to a second.
- Note: Chirps are often further apart than that and the current implementation detects them well even if they are close. This is only results in issues when the exact timing of a chirp is important and the chirp rate is high.
The classifier might be able to detect chirps well, but assigning them to the correct emitter is a seperate problem.
- Note: Here I could borrow methods from the previous chirp detector, that was good at assignment but not so good with detection.
- Current solution: If the a multiple chirps are detected simultaneously for multiple fish, discarding all chirps except for the one with the highest class probability is sufficient for now to correctly assing chirps. This of course biases the detector to not beeing able to detect simultaneous chirps. So this is not fully solved.
Understand why detection of real data is completely broken after switching to pytorch gpu accelerated spectrograms. Detection of fake data still works well. There is probably a processing step I either duplicated or left out somewhere. Need to find the time to dig in to this. Before switching, detection worked flawlessly. But had to switch to try out larger datasets.
- NOTE: Because the pytorch image interpolation function produces different results than opencv.

How it works

The chirp detector is a convolutional neural network that is trained to discriminate chirps from non-chirps. The training dataset is generated by simulating the EODs of multiple fish and then adding chirps to the EODs. The chirps are simulated by adding a frequency modulation to the EODs. The chirp parameters are varied to create a large dataset. The chirp detection algorithm is then trained on this dataset. The trained model is then used to detect chirps on spectrograms of real data. The detection loop on a real dataset is as follows:

Extract a 10s window from the raw recording
Bandpass filter the signal on all electrodes
Compute the spectrogram of the filtered signal on all electrodes
Sum the spectrograms for all electrodes and transform to decibel scale
Iterate over each frequency track of the spectrogram and extract a 200ms window
Feed the window to the CNN which determines if it is a chirp or not and saves its chirp probability
If the loop is finished, group all chirps that are detected within in same 20ms window for each fish seperately. This is usually a single chirp.
Group chirps across the fish that appear within a 20ms window. This is usually one chirp beein detected twice. We choose the one with the higher chirp probability computed by the CNN.
Append the chirps for the 10s spectrogram window to the list of all chirps for the whole recording.
Repeat steps 1-9 for the next 10s window. Save the chirps the dataset when finished.

How to install

This project is currently in early development but you can participate! I purposely build this in a way that should make setup easy on any machine.

Clone the repository

git clone https://github.com/weygoldt/chirpdetector-cnn.git && cd chirpdetector-cnn

Make a virtual environment by your preferred method and activate, e.g.

pyenv virtualenv 3.11.2 chirpcnn 
pyenv local chirpcnn
# or with the built in venv
python -m venv chirpcnn
source .chirpcnn/bin/activate

Install dependencies Two things need to be installed from git to run the simulations. The rest can be installed from the requirements.txt.

pip install git+https://github.com/janscience/audioio.git
pip install git+https://github.com/janscience/thunderfish.git
pip install -r requirements.txt

How to use

The first thing to do is to open the config file config.yaml and check all paths. Both th training and testing datasets will be generated by scripts, just make sure that the paths are where you want them to be and that the directories exist.

This project is currently beeing developed and is not packaged yet. So you will have to run the scripts from the root directory of the project. The following scripts should get you started:

make_training_data.py generates a training dataset based on the parameters in the config file. The config file is well commented and should be self-explanatory.
train_model.py trains a model based on the training dataset and saves the model to the path specified in the config file.
detect_chirps.py detects chirps on a given dataset and saves the results to the path given by a flag. The dataset must be a directory containing the raw recording and the .npy files generated by the wavetracker. To run the detection on a specific dataset, you can supply a path like this: ./detect_chirps.py --path /path/to/dataset. The results will be saved to the path specified in the config file.

To see what is going on there are two plotting snippets that are commented out in the detection script. You can uncomment them to see the spectrograms and the detected chirps. The first one shows the snippets right before they enter the classifier and the second one shows a summary for each window.

To do

Project log

2023/05/09: Added performance metrics to the detector and created a benchmark dataset. Reach 90% precision and 85% recall. Depending on the complexity of the simulated benchmark dataset, the F1 score reaches up to 91 percent. But we can still improve upon that by fine tuning the chirp simulations and creating a more diverse training dataset for the no-chirp class.
2023/05/08: Succesfull tests with training on large hybrid dataset. New dataset combined with better data normalization alleviated issues with detections during amplitude drops. Vertical white noise bands with the same duration as chirps are still a problem. Explicitly added them to the training dataset now. Awaiting how this changes performance.
2023/05/06: Succesfull tests with new architecture on real data. Added k-fold crossvalidation to the training loop. Working on generating hybrid training datasets by combining simulations with real data.
2023/05/05: Succesfully switched to a deeper architecture specifically designed for audio deep learning.
2023/04/28: With some denoising and thresholding the minimum power of the frequency band I got the detector performance quite hight on real data. I cannot quantify it yet but the false positives are decreased to approximately 10%.
2023/04/21: On-the-fly spectrogram computation and subsequent chirp detection works. No need to compute extremely large spectrograms before hand anymore. Still some work to do with noise being classified as chirps. But works well in clean windows!
2023/04/14: Probably solved the issue that the same chirp is detected twice for two fish. I just take group chirps that are less than 20 ms apart and use only the one with the highest probability reported by the model and discard the rest. Even fancier implementations could use things like the dip in the baseline envelope during a chirp to determine to which fish the chirp truly belongs to.
2023/04/13: First time all chirps are correctly assigned on the real data snippet. Decraesed frequency resolution of the training dataset and made windows narrower.
2023/04/12: First semi-successfull run on a snippet of real data.
2023/04/09: First successfull run of the detector on synthetic data.

Name		Name	Last commit message	Last commit date
Latest commit History 299 Commits
assets		assets
chirpdetector-cnn		chirpdetector-cnn
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChirpNet - A chirp detection algorithm built on top of a convolutional neural network

What are chirps?

How can we detect them?

Issues

How it works

How to install

How to use

To do

Project log

About

Releases

Packages

Contributors 2

Languages

weygoldt/cnn-chirpdetector

Folders and files

Latest commit

History

Repository files navigation

ChirpNet - A chirp detection algorithm built on top of a convolutional neural network

What are chirps?

How can we detect them?

Issues

How it works

How to install

How to use

To do

Project log

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages