Audio Network Dissection (AND)

This is the official repository for ICML 2024 paper: "AND: Audio Network Dissection for Interpreting Deep Acoustic Models"
- AND is the first framework to describe the roles of hidden neurons in audio networks.
- AND provides both open-vocabulary concepts and generative natural language explainations of acoustic neurons based on LLMs, and can seamlessly adopt progressive LLMs in the future.
- AND showcases the potential use-case for audio machine unlearning by conducting concept-specific pruning.
Below we illustrate the overview of AND, which consists of 3 major Modules A-C to identify neuron concepts in audio networks as illustrated below. For more information about AND, please check out our project website.

Pipelines

Environment

pip install -r requirements.txt
huggingface-cli login
python -m spacy download en_core_web_lg

Preprocessing

The naming conventions for layers in AST and BEATs differ. Please refer to args.py.

python3 save_activations.py -tn <target_model_name> -tl <target_layers>
python3 save_discriminative_sample.py  -tn <target_model_name> -tl <target_layers>

Download BEATs checkpoints and change path in data_utils.py: https://drive.google.com/drive/folders/1FBIOj0ZMyPMbVFeQkInX-DYi0XBozOYS?usp=drive_link
Download processed file and place them into these directories. As preprocessing takes time, we provided processed file in following google drive, then you can skip
Summary Calibration and Open-concept Identification
part and run subsequent experiments:

summaries: https://drive.google.com/drive/folders/1eTF-X1nxhuhsWeys4kxbvzgLpavVmvrj?usp=drive_link

prediction: https://drive.google.com/drive/folders/11kfcToeiNbltPESzVvU5IVQpizdkxbwK?usp=drive_link

Closed-concept Identification

cd closed_concept_identification

python3 DB.py -tn <target_model_name> -tl fc
python3 TAB.py -tn <target_model_name> -tl fc
python3 ICL.py -tn <target_model_name> -tl fc

Summary Calibration and Open-concept Identification

cd summaries

python3 generate_summary.py -tl <target_layers> -dt highly
python3 generate_summary.py -tl <target_layers> -dt lowly
python3 summary_calibration.py -tn <target_model_name>

Pruning

cd pruning

python3 close_neuron_class.py -tn <target_model_name> -tl <target_layers> -ps <pruning_strategy>
python3 evaluation_confidence.py -tn <target_model_name> -tl <target_layers> -ps <pruning_strategy>

python3 close_neuron_class.py -tn <target_model_name> -tl <target_layers> -ps <pruning_strategy> -pc <list_of_pruned_concepts>
python3 evaluation_confidence.py -tn <target_model_name> -tl <target_layers> -ps <pruning_strategy> -pc <list_of_pruned_concepts>

Interpretability

cd interpretability

python3 clustering.py
python3 plot-uninterpretable-neuron.py -tn <target_model_name>

Experiment Results

For setting details and implications of each Table/Figure, please refer to our paper. Due to the randomness of Large Language Models, the figures may be slightly different from those presented in our paper. However, this does not affect our findings.

AST's Adjective Distribution

Experiments in experiments/fig3_9_adjective_distribution.ipynb, corresponding to Fig. 3 in Sec. 3.5

Last Layer Dissection Accuracy

Experiments in experiments/table4_confidence_drop_after_pruning.ipynb, corresponding to Table 2 in Sec.4.1

Confidence Drop after Pruning

Experiments in experiments/table4_confidence_drop_after_pruning.ipynb, corresponding to Table 4 in Sec. 4.3

Experiments in experiments/fig10_confidence_drop_after_pruning.ipynb, corresponding to Fig. 10 in Appendix F

Acoustic Feature Importance

Experiments in experiments/fig4_milan_experiments.ipynb, corresponding to Fig. 4 in Sec. 4.4

Number of Avg. Adjective per Layer

Experiments in experiments/fig5_adj_per_layer.ipynb, corresponding to Fig. 5 in Sec. 4.4

Neuron Interpretability

Experiments in experiments/fig6_11_12_13_14_uninterpretable_neurons.ipynb, corresponding to Figure 6 in Sec. 4.5

Note that the numbers might be slightly different due to randomness of the LLM's output or code refactoring. While the changes should be small.

Sources:

Cite this work

T.-Y. Wu¹, Y.-X. Lin¹, and T.-W. Weng, "AND: Audio Network Dissection for Interpreting Deep Acoustic Models", ICML 2024.

    @inproceedings{AND,
        title={AND: Audio Network Dissection for Interpreting Deep Acoustic Models},
        author={Tung-Yu Wu, Yu-Xiang Lin, and Tsui-Wei Weng},
        booktitle={Proceedings of International Conference on Machine Learning (ICML)},
        year={2024}
    }

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio Network Dissection (AND)

Pipelines

Environment

Preprocessing

Closed-concept Identification

Summary Calibration and Open-concept Identification

Pruning

Interpretability

Experiment Results

AST's Adjective Distribution

Last Layer Dissection Accuracy

Confidence Drop after Pruning

Acoustic Feature Importance

Number of Avg. Adjective per Layer

Neuron Interpretability

Sources:

Cite this work

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
BEATs		BEATs
closed_concept_identification		closed_concept_identification
data		data
experiments		experiments
interpretability		interpretability
pruning		pruning
summaries		summaries
summary_calibration		summary_calibration
.gitignore		.gitignore
README.md		README.md
args.py		args.py
data_utils.py		data_utils.py
requirements.txt		requirements.txt
save_activation.py		save_activation.py
save_discriminative_sample.py		save_discriminative_sample.py
sentence_utils.py		sentence_utils.py
similarity.py		similarity.py
utils.py		utils.py

Trustworthy-ML-Lab/Audio_Network_Dissection

Folders and files

Latest commit

History

Repository files navigation

Audio Network Dissection (AND)

Pipelines

Environment

Preprocessing

Closed-concept Identification

Summary Calibration and Open-concept Identification

Pruning

Interpretability

Experiment Results

AST's Adjective Distribution

Last Layer Dissection Accuracy

Confidence Drop after Pruning

Acoustic Feature Importance

Number of Avg. Adjective per Layer

Neuron Interpretability

Sources:

Cite this work

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages