pip install git+https://github.com/redapesolutions/suara-kami-community
or
git clone https://github.com/redapesolutions/suara-kami-community
cd suara-kami-community
pip install . --upgrade

fixing error(optional)
1. error: command 'gcc' failed: No such file or directory
-> sudo apt install build-essential gcc
2. OSError: sndfile library not found
-> sudo apt-get install libsndfile1

Models

Speech models(ONNX)

Malay
1. "conformer_tiny"
2. "conformer_small"
English
1. "silero_en"
2. "nemo_en"
Manglish
1. "conformer_medium"
Vad
1. "silero_vad"

Language models

Malay
1. "v1"
English
1. "en

Share data

Usage: feedback PATH

For detailed information on this command, run:
  feedback --help

feedback data_to_share # folder structure should be audio and txt file with same name but different ext for example audio.wav and audio.txt in same folder
feedback data_to_share.zip # same as above
feedback audio.wav

GPU Usage

pip uninstall onnxruntime onnxruntime-gpu -y
pip install onnxruntime-gpu --upgrade

GRPC Server/Client

check server/grpc folder

Web Example

check server/web folder

Websocket/Streaming Example

check server/websocket folder

Tutorials

Issue

4.1. The model not able to recognize my name/company/brand
- The reason why the model not able to recognize because it is not in the training dataset, you can create kenlm language model to make the model recognize it correctly or use Hotword with custom weight to correctly recognize it. See tutorials/2. speech to text with language model.ipynb

4.2. The model not able to recognize common word.
- The reason might be the word not in the training set, you can make the model predict correctly by following above suggestion or create an issue with the audio and text(or text only) so that we can make it work and add as our evaluation dataset.

4.3. Need feature X
- Can create issue with example application and we will consider to add it in the next version.

4.4. How to improve the model prediction?
- You can create an issue and share with us reproducible step to that lead to wrong prediction so that we can debug the issue or you can create your own language model to improve the model prediction. Currently we provide common word language model if you use "sk filepath --decoder v1" in cli or "predict(filepath,decoder='v1')" in python code

4.5. Want to contribute (Data,Compute power,Annotation,Features)
- Can contact us at [email protected]

References:

ONNX optimization based on https://mp.weixin.qq.com/s/ZLZ4F2E_wYEMODGWzdhDRg
https://github.com/NVIDIA/NeMo
https://github.com/alphacep/vosk-server/

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
mvp		mvp
scripts		scripts
server		server
sk		sk
tutorials		tutorials
.gitignore		.gitignore
BENCHMARKS.md		BENCHMARKS.md
LICENSE		LICENSE
README.md		README.md
TODO		TODO
berani.wav		berani.wav
loadtest.html		loadtest.html
old_README.md		old_README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

suara-kami-community

Table of Contents

Setup

Models

Share data

Tutorials

Issue

About

Releases

Packages

Contributors 2

Languages

License

redapesolutions/suara-kami-community

Folders and files

Latest commit

History

Repository files navigation

suara-kami-community

Table of Contents

Setup

Models

Share data

Tutorials

Issue

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages