Skip to content

ICCV 2023 AV4D paper - Audio-visual Sound Separation

Notifications You must be signed in to change notification settings

ali-vosoughi/avsa-sep

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 

Repository files navigation

AVSA-SEP: Audiovisual Scene-Aware Sound Separation

Overview

AVSA-SEP introduces a groundbreaking approach to sound separation in audiovisual contexts, focusing on the challenge of separating sounds that are not directly visible within video frames. This project is based on the research presented at the ICCV 2023 Workshop on AV4D: Visual Learning of Sounds in Spaces, showcasing a method to enhance the understanding of complex audiovisual scenes for improved sound separation.

Paper

For a detailed exploration of our approach and findings, consult our paper:

Authors: Yiyang Su, Ali Vosoughi, Shijian Deng, Yapeng Tian, Chenliang Xu.

Contributing

We welcome contributions to improve AVSA-SEP. Please submit an issue or pull request with your proposed changes or enhancements.

License

This project is released under the MIT License. See the LICENSE file for more details.

Citation

Please cite our work if you use AVSA-SEP in your research:

@article{su2023separating,
  title={Separating Invisible Sounds Toward Universal Audiovisual Scene-Aware Sound Separation},
  author={Su, Yiyang and Vosoughi, Ali and Deng, Shijian and Tian, Yapeng and Xu, Chenliang},
  journal={Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshop on AV4D: Visual Learning of Sounds in Spaces},
  year={2023}
}

Contact

For further information and support, please contact us.

About

ICCV 2023 AV4D paper - Audio-visual Sound Separation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published