Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VITS HiFiTTS doc #6288

Merged
merged 19 commits into from
Apr 6, 2023
Merged

VITS HiFiTTS doc #6288

merged 19 commits into from
Apr 6, 2023

Conversation

treacker
Copy link
Contributor

@treacker treacker commented Mar 24, 2023

What does this PR do ?

Update docs and model for HiFiTTS version

Collection: [TTS]

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

Signed-off-by: Evgeny Shabalin <[email protected]>
Copy link
Collaborator

@XuesongYang XuesongYang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for your updates! Besides of comments above, could you please also add sections showing how to evaluate your VITS model, just like our FastPitch tutorial did?

Signed-off-by: Evgeny Shabalin <[email protected]>
Copy link
Collaborator

@XuesongYang XuesongYang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks for updates. One question, are you using ARPABET or IPA units for VITS? If use ARPABET, the phoneme dict should be cmudict-0.7b_nv22.10, otherwise, ipa_cmudict-0.7b_nv23.01.txt

@treacker
Copy link
Contributor Author

treacker commented Apr 5, 2023

I am using IPA

@XuesongYang
Copy link
Collaborator

I am using IPA

Gotcha! then please fix the version of ipa dict with ipa_cmudict-0.7b_nv23.01.txt. i still see you used ipa_cmudict-0.7b_nv22.10.txt

Evgeny Shabalin and others added 2 commits April 5, 2023 12:44
@XuesongYang XuesongYang merged commit 515d36b into main Apr 6, 2023
@XuesongYang XuesongYang deleted the vits_doc branch April 6, 2023 00:08
hsiehjackson pushed a commit to hsiehjackson/NeMo that referenced this pull request Jun 2, 2023
* Added VITS documentation
* Typos
* Added experimental note
* Updated tutorial
* Added spectrogram visualization
* Updated ipa_cmudict version

---------

Signed-off-by: Evgeny Shabalin <[email protected]>
Co-authored-by: Xuesong Yang <[email protected]>
Signed-off-by: hsiehjackson <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants