Skip to content

Releases: PaddlePaddle/PaddleSpeech

PaddleSpeech r0.1.1

14 Jan 03:27
3d5aac6
Compare
Choose a tag to compare

New Features

CLI :

  • Add cli stats. #1274
  • Add unit test. #1321
  • ASR: Support English: Add transformer_libirspeech model. #1297
  • ASR: Support 4 decoding methods: ctc_greedy_search, ctc_beam_search, attention, attention_rescoring. #1297
  • ASR & ST: Use the unified config. #1305 / #1312
  • ASR: Refactor the code. #1260 by @AdamBear
  • TTS: Support long input text by default. #1241
  • TTS: Add Style MelGAN and HiFiGAN. #1241

ASR

  • Refactor configs in examples. #1225

TTS

ST

  • Refactor configs in examples. #1225

Text

  • Refactor Punctuation Restoration example. #1215

Docs

  • Add topic note for releasing python packages
  • Add TTS papers. #1330
  • Add Frontend G2P topic. #1254

Others

  • Update released models and results. #1306

Acknowledgements

@zh794390558 @yt605155624 @Jackwaterveg @KPatr1ck @Mingxue-Xu @JiehangXie @grasswolfs @jerryuhoo @AdamBear @LittleChenCc @JamesLim-sy

PaddleSpeech r0.1.0

23 Dec 07:58
1656fde
Compare
Choose a tag to compare

Features

CLI : New Feature

  • Easy install by pip pip install paddlespeech
  • CLI to quick explore ASR, TTS, audio classification, speech translation and punctuation restoration.

ASR

  • Join CTC LM decoder
  • Transformer LM model
  • Improve DeepSpeech2 online model
  • Refactor some configs

TTS

CLS

  • Add audio classification example on ESC-50 and custom dataset.
  • Add audio tagging demo based on PANNs and Audioset labels.

ST

  • ST-MTL
  • FAT-ST-MTL

Docs

  • Add quick start
  • Add read the doc
  • Improve installation documentation
  • Add README for each example

Demos

  • Audio_tagging
  • Automatic_video_subtitiles
  • Metaverse
  • Punctuation_restoration
  • Speech_recognition
  • Speech_translation
  • Story_talker
  • Style_fs2
  • Text_to_speech

Others

  • Update released models and results

Acknowledgements

@zh794390558 @KPatr1ck @Jackwaterveg @yt605155624 @Mingxue-Xu @grasswolfs @jerryuhoo

DeepSpeech v2.1.1

16 Aug 03:20
0309c36
Compare
Choose a tag to compare
  1. ctc alignment
  2. refactor data pipeline
  3. autolog for deepspeech test
  4. refactor checkpoint save/load
  5. deepspeech online model
  6. mfa alignment example
  7. add text normaliztion example
  8. TLG for aishell
  9. more dataest: thchs30, aidatatang, timit etc.
  10. 8k speech example
  11. ted en-zh st example
  12. more utils

DeepSpeech v2.1.0

29 Jun 12:24
5ef4a34
Compare
Choose a tag to compare
  1. Transformer/Conformer Offline/Online ASR
  2. Unified CTC Loss for DS2 model and Transformer Model

DeepSpeech v1.1.0

25 Feb 03:42
b3c728d
Compare
Choose a tag to compare

paddle 1.8.x with python2

DeepSpeech v1.0.0

25 Feb 03:41
9f7fa29
Compare
Choose a tag to compare

master latest code