Emotion-Aware Speech Generation with Integrated Text Analysis

This repository contains the code and experiments for the final year project in Computer Science, focusing on Emotion-Aware Speech Generation with Integrated Text Analysis using emotion embeddings from a RoBERTa model. It includes various Natural Language Processing (NLP) experiments performed during an NLP course, as well as a modified version of an existing text-to-speech synthesis codebase.

Samples

Please visit the GitHub page to view comparative samples.

Or generate new ones on HuggingFace

Project Overview

The project aims to generate emotion-aware speech using a modified text-to-speech synthesis system. By integrating emotion embeddings from a RoBERTa model, the generated speech output exhibits the desired emotions as specified by the input text.

Repository Structure

FYP_Notebooks/: Contains various notebooks for different experiments and data processing methods
FastSpeech2_Text_Aware_Emotion_TTS/: Contains the modified text-to-speech synthesis codebase for emotion-aware speech generation.
Transformers_for_NLP/: Contains various NLP experiments conducted during the Data Science: Transformers for Natural Language Processing course.
Utils/: Contains the code for processing and preparing the data for training and evaluation.

Getting Started

To run the experiments and use the Emotion-Aware Speech Generation system, follow these steps:

Clone this repository: git clone https://github.com/ionut-cmd/FYP.git
Navigate to the FastSpeech2_Text_Aware_Emotion_TTS/ directory.
Follow the installation and usage instructions provided in the FastSpeech2_Text_Aware_Emotion_TTS/README.md file.

Acknowledgements

This project is based on the ming024/FastSpeech2 for text-to-speech synthesis. I would like to thank the original author for their work, which served as a starting point for this project.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.github/workflows		.github/workflows
FYP_Notebooks		FYP_Notebooks
FastSpeech2_Text_Aware_Emotion_TTS		FastSpeech2_Text_Aware_Emotion_TTS
Transformers_for_NLP		Transformers_for_NLP
Utils		Utils
.DS_Store		.DS_Store
.gitignore		.gitignore
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
LICENSE		LICENSE
README.md		README.md
_config.yaml		_config.yaml
index.md		index.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emotion-Aware Speech Generation with Integrated Text Analysis

Samples

Project Overview

Repository Structure

Getting Started

Acknowledgements

License

About

Releases

Packages

Languages

License

ionut-cmd/Emotion-Aware-TTS

Folders and files

Latest commit

History

Repository files navigation

Emotion-Aware Speech Generation with Integrated Text Analysis

Samples

Project Overview

Repository Structure

Getting Started

Acknowledgements

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages