Data repository for our ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"
Official repository (for introduction and code): https://github.com/chujiezheng/LLM-Safeguard
If you find this repository useful or our work is related to your research, please kindly cite it:
@inproceedings{
llm-safeguard,
title={On Prompt-Driven Safeguarding for Large Language Models},
author={Chujie Zheng and Fan Yin and Hao Zhou and Fandong Meng and Jie Zhou and Kai-Wei Chang and Minlie Huang and Nanyun Peng},
booktitle={International Conference on Machine Learning},
year={2024}
}
If you find the chat templates used in this project useful, also please kindly cite it:
@misc{zheng-2023-chat-templates,
author = {Zheng, Chujie},
title = {Chat Templates for HuggingFace Large Language Models},
year = {2023},
howpublished = {\url{https://github.com/chujiezheng/chat_templates}}
}