D3RM : A Discrete Denoising Diffusion Refinement Model for Piano Transcription

This is the source code of D3RM paper accepted in ICASSP 2025. Regarding the reproducement of the paper, please let me know your concerns and feel free to comment them in the Issues part.

Installation

git clone https://github.com/hanshounsu/d3rm.git
pip -r install requirements.txt

Current project is based on pytorch-lightning 2.5.0.

Model Download

Pretrained NAR-HC baseline model [link]
Pretrained D3RM model (To appear) [link]

Place the pretrained D3RM model in ./checkpoints/pretrained/

Download MAESTRO

Download here [link]

Place the dataset folder inside ./data

Training the model

python3 main_cli.py fit -c ./configs/D3RM_cli.yaml

Inference

python3 main_cli.py test -c ./logs/{TARGET_EXPERIMENT_CONFIG}

License

This project is licensed under The MIT License.

Citations

@misc{hskim2023d3rm,
      title={D3RM : A Discrete Denoising Diffusion Refinement Model for Piano Transcription},
      author={Hounsu Kim, Taegyun Kwon, Juhan Nam},
      year={2024},
      eprint={},
      archivePrefix={arXiv},
      primaryClass={cs.Sound}
}

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.vscode		.vscode
configs		configs
data		data
images		images
transcription		transcription
.gitignore		.gitignore
README.md		README.md
command.sh		command.sh
main_cli.py		main_cli.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

D3RM : A Discrete Denoising Diffusion Refinement Model for Piano Transcription

Installation

Model Download

Download MAESTRO

Training the model

Inference

License

Citations

About

Releases

Packages

Languages

hanshounsu/d3rm

Folders and files

Latest commit

History

Repository files navigation

D3RM : A Discrete Denoising Diffusion Refinement Model for Piano Transcription

Installation

Model Download

Download MAESTRO

Training the model

Inference

License

Citations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages