-
Notifications
You must be signed in to change notification settings - Fork 672
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Constant Q-Transform #588
Comments
Hi, I'm interested in implementing CQT and I have questions about it.
|
I have implemented the librosa CQT in my project, I hope it would be useful for you guys. Several improvements can be made:
Regarding pseduo_cqt, I know that they did not use any downsampling, but I am not sure how is it different from the CQT algorithm proposed in 1992. If they are the same, then I also have this version of CQT named as CQT1992 in my project. I hope it would be useful for you guys, and I am currently also curious about how to implement inverse CQT. |
Hi all, |
There is no plan currently, but I'd welcome a pull request from the community that implements CQT and its inverse. If you are interested in working on such a pull request, please feel free to do so :)
|
Hey everyone, I am currently wrapping up torchaudio implementations of the VQT, CQT, and iCQT, that test against librosa (torchaudio resampling changes the signal too much compared to librosa after a few iterations, but the first few octaves have the same or similar values; proposed version is also much much quicker than librosa; all details in a PR to come). Do I have the green light to PR? Just wrapping up the last batch of tests 🧪 Let's get these wonderful transforms to torchaudio! Edit: link to the forked repo with changes is here |
Hey everyone, A quick follow up from the above. The librosa_vqt = cqt(
y=y,
sr=<SAMPLE_RATE>,
hop_length=<HOP_LENGTH>,
fmin=<F_MIN>,
n_bins=<N_BINS>,
bins_per_octave=<BINS_PER_OCTAVE>,
sparsity=0.,
res_type="sinc_best",
scale=False,
) Here's a sample figure comparing the proposed and librosa versions using the audio snippet from here, with:
The results are pretty much identical 😃 Opening a draft PR for now. |
Hi I no longer maintain this library, so I'm in a bit awkward position, but with the unit testing and such, this looks low risk/low maintenance cost addition. @nateanl thoughts? |
I'm down to adding this feature to TorchAudio. Although librosa already has implementation of it, enabling the feature with GPU computation can boost the training speed. |
Cool, thanks for the quick answers! I'll finish up the last few details and request your review in the coming days. |
We would like to have in torchaudio
The text was updated successfully, but these errors were encountered: