Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ONNX standalone #151

Open
tripathiarpan20 opened this issue Dec 26, 2023 · 1 comment
Open

ONNX standalone #151

tripathiarpan20 opened this issue Dec 26, 2023 · 1 comment

Comments

@tripathiarpan20
Copy link

Hi! Thanks for the amazing open source work!

I was looking through onnx_export.py and onnx_bench.py and I was wondering how to run it end to end in a standalone Colab notebook.

Specifically, how do we replace dummy_specs = torch.rand(1, 257, 60) with a mp3/wav audio (of variable time length) converted to a torch Tensor (by rmvpe model? I'm really new to speech model architectures so not sure) with the ONNX converted checkpoint.

Thanks

@stealthinu
Copy link
Contributor

I believe you are referring to the pitch estimation (rmvpe) and might be looking at the MMVC1.5 branch (v1.5.0.0_SiFiGAN).
In this context, the size of the input tensor is intentionally fixed, rather than dynamic. This is because the input size in the time dimension varies between specs and sin, d0, d1, d2, d3. ONNX cannot dynamically handle such inputs, which is why the size is fixed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants