Python Risk Identification Tool for generative AI (PyRIT)

The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and ML engineers to red team foundation models and their applications.

Introduction

PyRIT is a library developed by the AI Red Team for researchers and engineers to help them assess the robustness of their LLM endpoints against different harm categories such as fabrication/ungrounded content (e.g., hallucination), misuse (e.g., bias), and prohibited content (e.g., harassment).

PyRIT automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft).

The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

Additionally, this tool allows researchers to iterate and improve their mitigations against different harms. For example, at Microsoft we are using this tool to iterate on different versions of a product (and its metaprompt) so that we can more effectively protect against prompt injection attacks.

Where can I learn more?

Microsoft Learn has a dedicated page on AI Red Teaming.

Check out our docs for more information on how to install PyRIT, our How to Guide, and more, as well as our demos.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

Name	Name	Last commit message	Last commit date
Latest commit rdheekonda Fix precommit Oct 4, 2024 425c4c4 · Oct 4, 2024 History 301 Commits
.github	.github	FIX: fixing check links (Azure#323 )	Aug 13, 2024
.vscode	.vscode	FEAT: Azure content filter scorer (Azure#206 )	May 16, 2024
assets	assets	[FEAT] HumanInTheLoop Converter (Azure#401 )	Oct 1, 2024
doc	doc	Fix precommit	Oct 4, 2024
pyrit	pyrit	FEAT emoji jailbreak (Azure#314 )	Oct 3, 2024
tests	tests	FEAT: Colloquial Wordswap Attack (Azure#406 )	Oct 2, 2024
.env_example	.env_example	FEAT: Add Centralized DB Support Using Azure (Azure#379 )	Sep 23, 2024
.flake8	.flake8	MAINT clean up copyright (Azure#297 )	Jul 25, 2024
.gitignore	.gitignore	FEAT Prompt Shield (Azure#271 )	Aug 14, 2024
.pre-commit-config.yaml	.pre-commit-config.yaml	FEAT: Add Centralized DB Support Using Azure (Azure#379 )	Sep 23, 2024
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md committed	Dec 12, 2023
LICENSE	LICENSE	Migrate PyRIT to open source repository	Jan 26, 2024
MANIFEST.in	MANIFEST.in	Include datasets in package (Azure#63 )	Feb 24, 2024
Makefile	Makefile	Remove poetry (Azure#24 )	Jan 29, 2024
NOTICE.txt	NOTICE.txt	add NOTICE file (Azure#28 )	Jan 30, 2024
README.md	README.md	DOC Fix README.md link (Azure#319 )	Aug 8, 2024
SECURITY.md	SECURITY.md	Update SECURITY.md (Azure#27 )	Jan 29, 2024
SUPPORT.md	SUPPORT.md	DOC Fix README.md link (Azure#319 )	Aug 8, 2024
component-governance.yml	component-governance.yml	Export requirements before component governance is run (Azure#35 )	Feb 6, 2024
policheck.yml	policheck.yml	FIX: Fixing policheck bug (Azure#261 )	Jun 28, 2024
pyproject.toml	pyproject.toml	FEAT: Add Centralized DB Support Using Azure (Azure#379 )	Sep 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Python Risk Identification Tool for generative AI (PyRIT)

Introduction

Where can I learn more?

Trademarks

About

Releases

Packages

Languages

License

rdheekonda/PyRIT

Folders and files

Latest commit

History

Repository files navigation

Python Risk Identification Tool for generative AI (PyRIT)

Introduction

Where can I learn more?

Trademarks

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages