This is the source code of D3RM paper accepted in ICASSP 2025.
Regarding the reproducement of the paper, please let me know your concerns and feel free to comment them in the Issues
part.
git clone https://github.com/hanshounsu/d3rm.git
pip -r install requirements.txt
Current project is based on pytorch-lightning 2.5.0.
Place the pretrained D3RM model in ./checkpoints/pretrained/
Download here [link]
Place the dataset folder inside ./data
python3 main_cli.py fit -c ./configs/D3RM_cli.yaml
python3 main_cli.py test -c ./logs/{TARGET_EXPERIMENT_CONFIG}
This project is licensed under The MIT License.
@misc{hskim2023d3rm,
title={D3RM : A Discrete Denoising Diffusion Refinement Model for Piano Transcription},
author={Hounsu Kim, Taegyun Kwon, Juhan Nam},
year={2024},
eprint={},
archivePrefix={arXiv},
primaryClass={cs.Sound}
}