Compose Yourself: Average-Velocity Flow Matching for One-Step Speech Enhancement

This is the implementation of Compose Yourself: Average-Velocity Flow Matching for One-Step Speech Enhancement.

Environment Requirements

# create virtual environment
conda create --name COSE python=3.9.0

# activate environment
conda activate COSE

# install required packages
pip install -r requirements.txt

How to train

python train.py --log_dir <path_to_model> --base_dir <path_to_dataset>

How to Run Inference

To enhance noisy speech using a trained checkpoint, run:

python enhancement.py \
    --test_dir <path_to_noisy> \
    --enhanced_dir <path_to_enhanced> \
    --ckpt <path_to_model_checkpoint>

How to Compute Evaluation Metrics

To calculate metrics, run:

python calc_metrics.py \
    --clean_dir <path_to_clean> \
    --noisy_dir <path_to_noisy> \
    --enhanced_dir <path_to_enhanced>

Pretrained Checkpoints (Coming Soon)

We will provide pretrained checkpoints trained on the VoiceBank-DEMAND dataset in the near future.

Stay tuned! We will update this section as soon as the checkpoint is ready for public release.

Built Upon & Related Work

This repository is built on previous outstanding works:

🔗 [SGMSE]-https://github.com/sp-uhh/sgmse

🔗 [StoRM]-https://github.com/sp-uhh/storm

🔗 [FlowMSE]-https://github.com/seongq/flowmse

Note: This work extends the above method through a one-step generation framework while retaining the complex STFT-based front-end data processing design.

Citations / References

If you use this repository or reference our work in your research, please cite us using the following BibTeX entry:

@misc{yang2025composeyourselfaveragevelocityflow,
      title={Compose Yourself: Average-Velocity Flow Matching for One-Step Speech Enhancement}, 
      author={Gang Yang and Yue Lei and Wenxin Tai and Jin Wu and Jia Chen and Ting Zhong and Fan Zhou},
      year={2025},
      eprint={2509.15952},
      archivePrefix={arXiv},
      primaryClass={cs.SD},
      url={https://arxiv.org/abs/2509.15952}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
sgmse		sgmse
.gitignore		.gitignore
README.md		README.md
calc_metrics.py		calc_metrics.py
enhancement.py		enhancement.py
performance_metrics.py		performance_metrics.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Compose Yourself: Average-Velocity Flow Matching for One-Step Speech Enhancement

Environment Requirements

How to train

How to Run Inference

How to Compute Evaluation Metrics

Pretrained Checkpoints (Coming Soon)

Built Upon & Related Work

Citations / References

About

Uh oh!

Releases

Packages

Languages

ICDM-UESTC/COSE

Folders and files

Latest commit

History

Repository files navigation

Compose Yourself: Average-Velocity Flow Matching for One-Step Speech Enhancement

Environment Requirements

How to train

How to Run Inference

How to Compute Evaluation Metrics

Pretrained Checkpoints (Coming Soon)

Built Upon & Related Work

Citations / References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages