yla

Run your own AI chatbot locally with Ollama, ensuring data privacy and avoiding cloud breaches. This lightweight setup requires no frameworks like React - just Ollama, a simple HTTP server, and a browser.

Evaluate and experiment LLM freely, own your AI.

Features

Local AI Control: Select from multiple configured models
Privacy First: All processing happens on your machine
No Internet Dependencies: Works completely offline
Custom Personalities: Create specialized assistants with Modelfiles
Explore Different Parameters: You can resend a message changing temperature, top_p, and top_k
Expand with New Models on Ollama: When a new open-source model is deployed on Ollama server, you can download it and use it right away.
Visualize Reasoning: Currently the reasoning part is showed, but it must be wrapped in the <think> tag (DeepSeek reasoning token), but it can be extend/modified.

Installation

1. Install Ollama

Linux:

curl -fsSL https://ollama.com/install.sh | sh

Windows:

Download from Ollama Windows Preview

2. Setup Base Models

# Start Ollama server locally
ollama serve

Ollama could be running in background, so the previous command could fail. Please check if it running at the default url localhost:11434

# Choose based on your hardware
ollama pull deepseek-r1:7b   # 7B parameter version
ollama pull deepseek-r1:32b  # Medium-sized model

Models can also be downloaded from Yla interface, but you have to specify the name in the config file

3. Launch Application

Linux:

chmod +x scripts/run_yla.sh
./scripts/run_yla.sh

Windows:

ollama serve
python -m http.server 8000
# Open http://localhost:8000/src/yla.html on your browser

Custom Model Configuration

Create and use specialized models using Ollama's Modelfiles:

Create a Modelfile in the custom_models directory:

# Example technical-assistant.txt
FROM deepseek-r1:7b
SYSTEM """
You are an expert technical assistant. 
Respond in markdown format with detailed explanations.
"""
PARAMETER num_ctx 32768

Build your custom model:

ollama create my-expert -f ./custom_models/technical-assistant.txt

Add it to config.js:

models: [
    {
        name: "my-expert:latest",
        description: "smart coding assistant",
        num_ctx: 32768,
        temperature: 0.7,
        top_k: 40,
        top_p: 0.9,
        systemMessage: "You are an expert technical assistant. 
Respond in markdown format with detailed explanations.",
        size: "4.1GB"
    }
]

Configuration Reference

Field	Description	Example
`name`	Ollama model name (exact match required). Add version in case of custom models	"my-expert:latest"
`description`	Brief description (optional)	"smart coding assistant"
`size`	Size of the model. Used only to add the information on the model panel	"4.1GB"
`num_ctx`	Context window size (tokens)	32768
`systemMessage`	Hidden behavior instructions. It does not affect the model, but it is used to show, in case of custom model, the SYSTEM message defined in the creation phase	"You are an expert technical assistant. Respond in markdown format with detailed explanations."
`temperature`	Response creativity	0.3-1.8

For custom models remember to add the version (:latest by default) in the config.

Configuration Example

// config.js
const config = {
    models: [
        {
            name: "Yla:latest",          // Must match Ollama model name
            num_ctx: 65536,         // Context window size
            temperature: 0.7,       // 0-2 (0=precise, 2=creative)
            top_k: 40,              // 1-100
            top_p: 0.9,              // 0-1
            systemMessage: "Friendly general assistant",
            size: "4.1GB"
        }
    ],
    
    api: {
        endpoint: "http://localhost:11434/v1/chat/completions",
        available_models: "http://localhost:11434/v1/models",
    }
};

Model Management

Validation: App checks installed models on launch
Selection: Choose model from initial carousel
Download: If it is not present on Ollama server, it will try to pull the model from Ollama. In case the download is allowed only for valid names
Persistence: Selected model remembered until page refresh
Visual Feedback:
- ✅ Available models - full color, clickable
- ⚠️ Missing models - grayed out with warning. Possibility of download

Note: Model names in config.js must exactly match Ollama model names. Use ollama list to verify installed models.

Troubleshooting

Model Not Found:

Verify model name matches Ollama's list
Check Ollama is running: ollama serve
Ensure model files are in custom_models directory

Performance Issues:

Reduce num_ctx for smaller context windows
Use smaller model variants (e.g., 7B instead of 32B)
Close other memory-intensive applications

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

yla

Features

Installation

1. Install Ollama

Linux:

Windows:

2. Setup Base Models

3. Launch Application

Linux:

Windows:

Custom Model Configuration

Configuration Reference

Configuration Example

Model Management

Troubleshooting

Files

README.md

Latest commit

History

README.md

File metadata and controls

yla

Features

Installation

1. Install Ollama

Linux:

Windows:

2. Setup Base Models

3. Launch Application

Linux:

Windows:

Custom Model Configuration

Configuration Reference

Configuration Example

Model Management

Troubleshooting