Skip to content

hanshounsu/d3rm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

44 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

D3RM : A Discrete Denoising Diffusion Refinement Model for Piano Transcription

This is the source code of D3RM paper accepted in ICASSP 2025. Regarding the reproducement of the paper, please let me know your concerns and feel free to comment them in the Issues part.

Installation

git clone https://github.com/hanshounsu/d3rm.git
pip -r install requirements.txt

Current project is based on pytorch-lightning 2.5.0.

Model Download

  • Pretrained NAR-HC baseline model [link]
  • Pretrained D3RM model (To appear) [link]

Place the pretrained D3RM model in ./checkpoints/pretrained/

Download MAESTRO

Download here [link]

Place the dataset folder inside ./data

Training the model

python3 main_cli.py fit -c ./configs/D3RM_cli.yaml

Inference

python3 main_cli.py test -c ./logs/{TARGET_EXPERIMENT_CONFIG}

License

This project is licensed under The MIT License.

Citations

@misc{hskim2023d3rm,
      title={D3RM : A Discrete Denoising Diffusion Refinement Model for Piano Transcription},
      author={Hounsu Kim, Taegyun Kwon, Juhan Nam},
      year={2024},
      eprint={},
      archivePrefix={arXiv},
      primaryClass={cs.Sound}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published