Skip to content

Latest commit

 

History

History
81 lines (54 loc) · 5.96 KB

zs_tts.md

File metadata and controls

81 lines (54 loc) · 5.96 KB

Zero Shot TTS

Survey

Zero Shot TTS

Projects

  • csm-voice-cloning - isaiahbjork Star

  • Spark-TTS - SparkAudio Star

    An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens · (sparkaudio.github)

  • F5-TTS - lpscr Star

    A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

Products

Datasets

  • Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation, arXiv, 2501.15907, arxiv, pdf, cication: -1

    Haorui He, Zengqiang Shang, Chaoren Wang, ..., Pengyuan Zhang, Zhizheng Wu · (huggingface)

Toolkits

Misc