Detox CoT

Welcome to Detox CoT repository! Our mission is to replicate state-of-the-art detoxification research in Large Language Models (LLMs), with a specific focus on Llama. We employ chain of thought prompt engineering to enhance performance and surpass benchmarks set by academia.

Prerequisites

Before running the script, ensure you have the following:

Download and set up Llama 2 models from Llama 2 recipes repository.
In a conda env with PyTorch/ CUDA available clone and download this repository.
Register for the Perspective API by Google

Note

Our experiments were conducted using 8 A100 GPUs with 40GB memory for fine-tuning and data generation.
Access the complete dataset, including finetuning and evaluation data, from AI2's RealToxicityPrompts
Check out our presentation slide deck summarising our research experience - https://docs.google.com/presentation/d/1AUWSJLxICEr5BcfJ1b5wLceU3S-hWQKmUHckSc-Vp-k/edit?usp=sharing !

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Kaggle-Jigsaw		Kaggle-Jigsaw
RealToxicityPrompts		RealToxicityPrompts
alpacea-finetune		alpacea-finetune
baseline-llama-2-7b		baseline-llama-2-7b
finetuned-llama-2-7b		finetuned-llama-2-7b
llama-2-13b		llama-2-13b
llama-2-70b		llama-2-70b
llama-2-7b		llama-2-7b
span_cnn		span_cnn
utils		utils
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Detox CoT

Prerequisites

About

Releases

Packages

Languages

harryytsao/Detox-CoT

Folders and files

Latest commit

History

Repository files navigation

Detox CoT

Prerequisites

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages