Dippy SN11: Creating The World's Best Open-Source Roleplay LLM

Check out the beta version of our Front-End! Also, please check our Launch Tweet for our vision of creating the world's best open-source roleplay LLM.

Introduction
Roadmap
Overview of Miner and Validator Functionality
- Miner
- Validator
Running Miners and Validators
- Running a Miner
- Running a Validator
Contributing
License

Introduction

Note: The following documentation assumes you are familiar with basic Bittensor concepts: Miners, Validators, and incentives. If you need a primer, please check out https://docs.bittensor.com/learn/bittensor-building-blocks.

The Dippy Roleplay subnet on Bittensor aims to create the world's best open-source roleplay LLM by leveraging the collective efforts of the open-source community. This subnet addresses the critical issue of loneliness, which affects a significant portion of the population and is linked to various mental and physical health problems.

Current SOTA LLMs (Claude, OpenAI etc.) are designed for the assistant use case and lack the empathetic qualities necessary for companionship. While some companies (c.ai, chai, inflection etc) have developed closed-source roleplay LLMs, the open-source alternatives lag significantly behind in performance.

Our team at Impel Intelligence Inc. knows this issue intimately through building Dippy, a proactive AI companion app for iOS. In this subnet, we will bring together the entire open-source eco-system to build the world’s best roleplay LLM.

Roadmap

Given the complexity of creating a state of the art roleplay LLM, we plan to divide the process into 3 distinct phases.

Phase 1:

Subnet launch with robust pipeline for roleplay LLM evaluation on public datasets and response length
Public model leaderboard based on evaluation criteria
Introduce Coherence and Creativity as a criteria for live model evaluation

Phase 2:

Publicly release front-end powered by top miner submitted model of the week
Segment model submission into different "expert" categories (funny, romantic, therapeutic etc)
Models with the highest score in each personality type are chosen as "expert" models and made publicly available on the front-end

Phase 3:

New Mixture of Experts model made as a baseline based on the "expert" models chosen from Phase 2
Robust pipeline to evaluate new MOE model submissions against live evaluation criteria
Expand the state of the art in roleplay LLMs through continuous iteration and data collection

Overview of Miner and Validator Functionality

Miners would use existing frameworks, fine tuning techniques, or MergeKit, to train, fine tune, or merge models to create a unique roleplay LLM. These models would be submitted to a shared Hugging Face pool.

Validators would evaluate the and assess model performance via our protocol and rank the submissions based on various metrics (empathy, conciseness etc). We will provide a suite of testing and benchmarking protocols with state-of-the-art datasets.

Running Miners and Validators

Running a Miner

Requirements

Python 3.8+
GPU with at least 24 GB of VRAM

Setup

To start, clone the repository and cd to it:

git clone https://github.com/impel-intelligence/dippy-bittensor-subnet.git
cd dippy-bittensor-subnet
pip install -e .

Submitting a model

As a miner, you're responsible for leveraging all methods available at your disposal, including but not limited to training new models, merging existing models (we recommend MergeKit), finetuning existing models, and so on to push roleplay LLMs forward.

We outline the following criteria for Phase 1:

Models should be 7B-13B parameters. Current maximum model size is 32GB.
We don't support quantized models at the moment...coming soon!
Models MUST be Safetensors Format! Check upload_models.py for how the model upload precheck works.
Please test the model by loading model using transformers.AutoModelForCausalLM.from_pretrained
(Recommended) Test the model with arbitrary inputs, before submitting, to check for NaNs.
Models we are confident will work are of the Mistral-7B and Llama-3 8B family.
We support the "alpaca", "chatml", "llama2", "llama3", "mistral", "vicuna" and "zephyr" chat templates.

Once you're happy with the performance of the model for the usecase of Roleplay, you can simply submit it to Hugging Face 🤗 and then use the following command:

python3 dippy_subnet/upload_model.py --hf_repo_id HF_REPO --wallet.name WALLET  --wallet.hotkey HOTKEY --chat_template MODEL_CHAT_TEMPLATE --model_dir PATH_TO_MODEL

Running a Validator

Requirements

Python 3.9+
API key for wandb (see below)

Setup WandB (HIGHLY RECOMMENDED - VALIDATORS PLEASE READ)

Before running your validator, it is recommended to set up Weights & Biases (wandb). The purpose of wandb is for tracking key metrics across validators to a publicly accessible page. here. We highly recommend validators use wandb, as it allows subnet developers and miners to diagnose issues more quickly and effectively, say, in the event a validator were to be set abnormal weights. Wandb logs are collected by default, and done so in an anonymous fashion, but we recommend setting up an account to make it easier to differentiate between validators when searching for runs on our dashboard. If you would not like to run WandB, you can do so by not providing the flag --wandb-key when running your validator.

Before getting started, as mentioned previously, you'll first need to register for a wandb account, and then set your API key on your system. Here's a step-by-step guide on how to do this on Ubuntu:

Step 1: Installation of WANDB

Before logging in, make sure you have the wandb Python package installed. If you haven't installed it yet, you can do so using pip:

# Should already be installed with the repo
pip install wandb

Step 2: Obtain Your API Key

Log in to your Weights & Biases account through your web browser.
Go to your account settings, usually accessible from the top right corner under your profile.
Find the section labeled "API keys".
Copy your API key. It's a long string of characters unique to your account.

Step 3: Setting Up the API Key in Ubuntu

To configure your WANDB API key on your Ubuntu machine, follow these steps:

Log into WANDB: Run the following command in the terminal:
```
wandb login
```
Enter Your API Key: When prompted, paste the API key you copied from your WANDB account settings.
- After pasting your API key, press Enter.
- WANDB should display a message confirming that you are logged in.
Verifying the Login: To verify that the API key was set correctly, you can start a small test script in Python that uses WANDB. If everything is set up correctly, the script should run without any authentication errors.
Setting API Key Environment Variable (Optional): If you prefer not to log in every time, you can set your API key as an environment variable in your ~/.bashrc or ~/.bash_profile file:
```
echo 'export WANDB_API_KEY=your_api_key' >> ~/.bashrc
source ~/.bashrc
```
Replace your_api_key with the actual API key. This method automatically authenticates you with wandb every time you open a new terminal session.

Setup

To start, clone the repository and cd to it:

git clone https://github.com/impel-intelligence/dippy-bittensor-subnet.git
cd dippy-bittensor-subnet
pip install -e .

To run the evaluation, simply use the following command:

python neurons/validator.py --wallet.name WALLET_NAME --wallet.hotkey WALLET_HOT_NAME --wandb-key WANDBKEY

To run auto-updating validator with PM2 (recommended):

pm2 start --name sn11-vali-updater --interpreter python scripts/start_validator.py -- --pm2_name sn11-vali --wallet.name WALLET_NAME --wallet.hotkey WALLET_HOT_NAME [other vali flags]

Please note that this validator will call the model validation service hosted by the dippy subnet owners. If you wish to run the model validation service locally, please follow the instructions below.

Running the model evaluation API (Optional)

Note: Currently (June 17 2024) there are some issues with the local evaluation api. We recommend using the remote validation api temporarily.

Starting a validator using your local validator API requires starting validator with --use-local-validation-api flag. Additionally, a model queue is required to push models to the validation api.

Note: Validator API needs to be installed in a different venv than validator due to pydantic version conflict.

Requirements

Python 3.9+
Linux

Setup

Install Git Lfs if not installed.

curl -s https://packagecloud.io/install/repositories/github/git-lfs/script.deb.sh | sudo bash
sudo apt-get install git-lfs

If you are running on runpod you might also need to install 'netstat'.

apt-get install net-tools

To start, clone the repository and cd into it:

git clone https://github.com/impel-intelligence/dippy-bittensor-subnet.git
cd dippy-bittensor-subnet
python3 -m venv model_validation_venv
source model_validation_venv/bin/activate
model_validation_venv/bin/pip install -e . --no-deps
model_validation_venv/bin/pip install -r requirements_val_api.txt

Run model validation API service

cd dippy_validation_api
chmod +x start_validation_service.sh
./start_validation_service.sh

Test that it's working

python3 test_api.py

And you should see a json showing that the model status is "QUEUED" Running the same command again for sanity's sake, you should see the status of the model as "RUNNING".

Stop model validation API service

chmod +x kill_validation_api.sh
./kill_validation_api.sh

Running the validator with your own validation API service running locally

# Make a separate venv for the validator because of pydantic version conflict
python -m venv validator_venv
validator_venv/bin/pip install -e .
validator_venv/bin/python neurons/validator.py --wallet.name WALLET_NAME --wallet.hotkey WALLET_HOT_NAME --use-local-validation-api
# Run model queue to push models to validation api to be evaluated
validator_venv/bin/python neurons/model_queue.py --use-local-validation-api

Model Evaluation Criteria

Model Size

A smaller model will score higher than a big model. Model size is the disk space occupied by the model repo from HF. The max model size is limited to 18GB.

Latency

A faster model will score higher than a slow model.

Output Similarity

Evaluated against datasets, a model that generates similiar resposne to groundtruth will score higher.

Vibe Matching

A model that can generate outputs with similiar length to its inputs will score higher.

Acknowledgement

Our codebase is built upon Nous Research's and MyShell's Subnets.

License

The Dippy Bittensor subnet is released under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 175 Commits
.circleci		.circleci
assests		assests
constants		constants
contrib		contrib
dippy_subnet		dippy_subnet
dippy_validation_api		dippy_validation_api
docs		docs
model		model
neurons		neurons
scoring		scoring
scripts		scripts
template		template
tests		tests
utilities		utilities
.dockerignore		.dockerignore
.gitignore		.gitignore
Dippy.png		Dippy.png
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
drawio.png		drawio.png
evaluator.Dockerfile		evaluator.Dockerfile
min_compute.yml		min_compute.yml
pyproject.toml		pyproject.toml
requirements.eval.txt		requirements.eval.txt
requirements.txt		requirements.txt
requirements_val_api.txt		requirements_val_api.txt
setup.py.bak		setup.py.bak
vapi.Dockerfile		vapi.Dockerfile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dippy SN11: Creating The World's Best Open-Source Roleplay LLM

Introduction

Roadmap

Overview of Miner and Validator Functionality

Running Miners and Validators

Running a Miner

Requirements

Setup

Submitting a model

Running a Validator

Requirements

Setup WandB (HIGHLY RECOMMENDED - VALIDATORS PLEASE READ)

Step 1: Installation of WANDB

Step 2: Obtain Your API Key

Step 3: Setting Up the API Key in Ubuntu

Setup

Running the model evaluation API (Optional)

Requirements

Setup

Run model validation API service

Test that it's working

Stop model validation API service

Running the validator with your own validation API service running locally

Model Evaluation Criteria

Model Size

Latency

Output Similarity

Vibe Matching

Acknowledgement

License

About

Releases

Packages

Contributors 9

Languages

License

impel-intelligence/dippy-bittensor-subnet

Folders and files

Latest commit

History

Repository files navigation

Dippy SN11: Creating The World's Best Open-Source Roleplay LLM

Introduction

Roadmap

Overview of Miner and Validator Functionality

Running Miners and Validators

Running a Miner

Requirements

Setup

Submitting a model

Running a Validator

Requirements

Setup WandB (HIGHLY RECOMMENDED - VALIDATORS PLEASE READ)

Step 1: Installation of WANDB

Step 2: Obtain Your API Key

Step 3: Setting Up the API Key in Ubuntu

Setup

Running the model evaluation API (Optional)

Requirements

Setup

Run model validation API service

Test that it's working

Stop model validation API service

Running the validator with your own validation API service running locally

Model Evaluation Criteria

Model Size

Latency

Output Similarity

Vibe Matching

Acknowledgement

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 9

Languages

Packages