MoeGoe Server

CPU 推理。没有用到 GPU。

Based on Fastapi. Only tts and only support some type model.

Server

Server Main Body Include:

server.py  # main
event.py # lib
config.toml # config

🐾 Tip

The dependencies for this project are based on numpy==1.22.0 , which may break system dependencies!

🪐 Install

pip install -r requirements.txt

apt install libsndfile1

Mkdir model and run server.py to start this server.

After that,Fastapi Docs -> url/docs

🪵 Set Model

Server requirements for model placement

model
|---- somemodel.pth
|---- somemodel.pth.json (== config.json)
|---- info.json

info.json

Model used for init....

{
  "model": [
    "somemodel.pth"
  ]
}

Param

POST

from pydantic import BaseModel


class TTS_REQ(BaseModel):
    model_name: str = ""
    task_id: int = 1
    text: str = "[ZH]你好[ZH]"
    speaker_id: int = 0
    audio_type: str = "ogg"  # flac wav ogg

RETURN

from pydantic import BaseModel


class TTS_REQ_DATA(BaseModel):
    code: int = 404
    msg: str = "unknown error"
    audio: str = ""
    speaker: str = ""
    model_type: str = ""

OGG

make sure the ogg is encoded with opus codec

Other

Other Api implementations https://github.com/fumiama/MoeGoe Just found out after writing, SAD

Links_

How to use

Run MoeGoe.exe

Path of a VITS model: path\to\model.pth
Path of a config file: path\to\config.json
INFO:root:Loaded checkpoint 'path\to\model.pth' (iteration XXX)

Text to speech

TTS or VC? (t/v):t
Text to read: こんにちは。
ID      Speaker
0       XXXX
1       XXXX
2       XXXX
Speaker ID: 0
Path to save: path\to\demo.wav
Successfully saved!

Voice conversion

TTS or VC? (t/v):v
Path of an audio file to convert:
path\to\origin.wav
ID      Speaker
0       XXXX
1       XXXX
2       XXXX
Original speaker ID: 0
Target speaker ID: 6
Path to save: path\to\demo.wav
Successfully saved!

HuBERT-VITS

Path of a hubert-soft model: path\to\hubert-soft.pt
Path of an audio file to convert:
path\to\origin.wav
ID      Speaker
0       XXXX
1       XXXX
2       XXXX
Target speaker ID: 6
Path to save: path\to\demo.wav
Successfully saved!

W2V2-VITS

Path of a w2v2 dimensional emotion model: path\to\model.onnx
TTS or VC? (t/v):t
Text to read: こんにちは。
ID      Speaker
0       XXXX
1       XXXX
2       XXXX
Speaker ID: 0
Path of an emotion reference: path\to\reference.wav
Path to save: path\to\demo.wav
Successfully saved!

Name		Name	Last commit message	Last commit date
Latest commit History 152 Commits
.github/workflows		.github/workflows
jieba		jieba
model		model
test		test
text		text
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
MoeGoe.py		MoeGoe.py
README.md		README.md
attentions.py		attentions.py
commons.py		commons.py
config.toml		config.toml
docker-compose.yml		docker-compose.yml
event.py		event.py
hubert_model.py		hubert_model.py
mel_processing.py		mel_processing.py
models.py		models.py
modules.py		modules.py
pm2.json		pm2.json
requirements.txt		requirements.txt
server.py		server.py
transforms.py		transforms.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MoeGoe Server

Server

🐾 Tip

🪐 Install

🪵 Set Model

Param

Other

Links_

How to use

Text to speech

Voice conversion

HuBERT-VITS

W2V2-VITS

About

Releases

Packages

Languages

License

aiastia-dockerhub/MoeGoe

Folders and files

Latest commit

History

Repository files navigation

MoeGoe Server

Server

🐾 Tip

🪐 Install

🪵 Set Model

Param

Other

Links_

How to use

Text to speech

Voice conversion

HuBERT-VITS

W2V2-VITS

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages