Tasya Core

The heart of Tasya, my (voice) assistant - or at least, a wrapper for it.

Features
Requirements
How to use
Helper scripts
TODO

Features

LLM RAG-backed responses
Text API endpoint
Voice API endpoint

Agent list

Random chatting
Weather report
Internet search

Requirements

Hardware

A recent videocard with around 16GB of VRAM for all features. Could be trimmed down to ~6GB of VRAM for only text interface.

Text interface

LLaMA-like model run with ollama. LLaMA 3 is the preferred choice

Voice interface

whisper.cpp instance
xtts-api-server instance (subject to change)

Additional features

Whispering voice generation: Yandex.Speechkit API key
Internet search: Tavily API key
Weather information: OpenWeatherMap API key
Best-in-class translation: DeepL API key

And last, but not least: pip install -r requirements.txt!

How to use

Text generation endpoint

POST /text_input

Generates a text response based on text input. Input should be formatted as JSON.

Required parameters

At least one is required. If both are specified, query will be appended to the history.

query: str - user question for the AI
history: str - chat history for generation. Provided as text block, where speakers are separated by newlines.

History is prepended as-is, so it should be formatted like <|start_header_id|>assistant<|end_header_id|>AI message...<|eot_id|> (LLaMA 3 format)

Additional parameters

session_id: str - persistent key to save chat history on the server
translate: str - two-letter language code to translate query and responses from and to

Internally model and history are used with english language. This allows interactions with AI in other languages, with a bit of quality loss

Voice generation endpoint

POST /voice_input

Generates a voice response based on text input. Input should be a multipart/form-data!

Required parameters

file: application/octet-stream - WAV-encoded voice input

Additional parameters

history: str - chat history for generation. Provided as text block, where speakers are separated by newlines.

History is prepended as-is, so it should be formatted like <|start_header_id|>assistant<|end_header_id|>AI message...<|eot_id|> (LLaMA 3 format)

session_id: str - persistent key to save chat history on the server
translate: str - two-letter language code to translate query and responses from and to

Internally model and history are used with english language. This allows interactions with AI in other languages, with a bit of quality loss

return_file: bool - whether to return the resulting audio file or play it directly on voice_player instance

Helper scripts

command.cpp

Trimmed down whisper.cpp client for voice_input endpoint. Should be compiled with whisper.cpp headers.

player.py

Simple WAV audio player over network. Should be used with VOICE_PLAYER_HOST config variable.

TODO

Try LLaMA 70b. Should make possible usage of OllamaFunctions LangChain wrapper. Also should resolve problems with "output only ...".
Try LangGraph. Currently isn't possible because no functions wrapper exists.
Add more agents for different tasks.
Tune models for real-time voice conversation.
Tune prompts (might be unnecessary with 70b model).
Find a way to run xtts-api-server with DeepSpeed on ROCm / cool Nvidia card to enable voice streaming generation.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.gitignore		.gitignore
README.md		README.md
agents.py		agents.py
command.cpp		command.cpp
config.py.example		config.py.example
history.py		history.py
llm.py		llm.py
main.py		main.py
player.py		player.py
prompts.py		prompts.py
requirements.txt		requirements.txt
tools.py		tools.py
translator.py		translator.py
ya_integration.py		ya_integration.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tasya Core

Features

Agent list

Requirements

Hardware

Text interface

Voice interface

Additional features

How to use

Text generation endpoint

Required parameters

Additional parameters

Voice generation endpoint

Required parameters

Additional parameters

Helper scripts

command.cpp

player.py

TODO

About

Releases

Packages

Languages

Myp3a/Tasya-core

Folders and files

Latest commit

History

Repository files navigation

Tasya Core

Features

Agent list

Requirements

Hardware

Text interface

Voice interface

Additional features

How to use

Text generation endpoint

Required parameters

Additional parameters

Voice generation endpoint

Required parameters

Additional parameters

Helper scripts

command.cpp

player.py

TODO

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages