LLM-Safeguard_data

Data repository for our ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"

Official repository (for introduction and code): https://github.com/chujiezheng/LLM-Safeguard

If you find this repository useful or our work is related to your research, please kindly cite it:

@inproceedings{
  llm-safeguard,
  title={On Prompt-Driven Safeguarding for Large Language Models},
  author={Chujie Zheng and Fan Yin and Hao Zhou and Fandong Meng and Jie Zhou and Kai-Wei Chang and Minlie Huang and Nanyun Peng},
  booktitle={International Conference on Machine Learning},
  year={2024}
}

If you find the chat templates used in this project useful, also please kindly cite it:

@misc{zheng-2023-chat-templates,
  author = {Zheng, Chujie},
  title = {Chat Templates for HuggingFace Large Language Models},
  year = {2023},
  howpublished = {\url{https://github.com/chujiezheng/chat_templates}}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
data_alpaca		data_alpaca
data_harmless		data_harmless
eval_results/sampling		eval_results/sampling
eval_results_alpaca/sampling		eval_results_alpaca/sampling
eval_results_harmless/sampling		eval_results_harmless/sampling
outputs_alpaca		outputs_alpaca
trained_prompts		trained_prompts
trained_prompts_unlikelihood		trained_prompts_unlikelihood
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM-Safeguard_data

About

Releases

Packages

Languages

chujiezheng/LLM-Safeguard_data

Folders and files

Latest commit

History

Repository files navigation

LLM-Safeguard_data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages