AI Safety evaluations (with AI Project provisioning) #2370

pamelafox · 2025-02-20T01:11:51Z

Purpose

This PR uses the Azure AI evaluation SDK to simulate adversarial users and evaluate the results. I intentionally do not store the simulation results in the repo due to their often disturbing question content, and I only store the overall safety results.

Our baseline RAG app achieves 100% safety (all scores are "Low" or "Very low") in the 200 simulations that I ran. Yay!

Does this introduce a breaking change?

When developers merge from main and run the server, azd up, or azd deploy, will this produce an error?
If you're not sure, try it out on an old environment.

[ ] Yes
[X] No

Does this require changes to learn.microsoft.com docs?

This repository is referenced by this tutorial
which includes deployment, settings and usage instructions. If text or screenshot need to change in the tutorial,
check the box below and notify the tutorial author. A Microsoft employee can do this for you if you're an external contributor.

[ ] Yes
[X] No

Type of change

[ ] Bugfix
[X] Feature
[ ] Code style update (formatting, local variables)
[ ] Refactoring (no functional changes, no api changes)
[ ] Documentation content changes
[ ] Other... Please describe:

Code quality checklist

See CONTRIBUTING.md for more details.

The current tests all pass (python -m pytest).
I added tests that prove my fix is effective or that my feature works
I ran python -m pytest --cov to verify 100% coverage of added lines
I ran python -m mypy to check for type errors
I either used the pre-commit hooks or ran ruff and black manually on my code.

Copilot

Copilot reviewed 13 out of 13 changed files in this pull request and generated 1 comment.

Comments suppressed due to low confidence (1)

evals/safety_evaluation.py:123

This division operation could raise a ZeroDivisionError if summary_scores[evaluator]['low_count'] is zero. Consider adding a check to handle a zero denominator or use an alternative calculation that avoids division by zero.

summary_scores[evaluator]["mean_score"] = summary_scores[evaluator]["score_total"] / summary_scores[evaluator]["low_count"]

evals/safety_evaluation.py

Co-authored-by: Copilot <[email protected]>

evals/safety_evaluation.py

pamelafox added 5 commits February 19, 2025 13:23

First attempt with infra

bfa8054

Evaluate the simulated users

87bfe85

Revert launch.json changes

dd96b53

Remove unneeded infra

ddd5dec

Add links and progress tracking

f4d7026

pamelafox marked this pull request as ready for review February 20, 2025 17:54

pamelafox changed the title ~~WIP: AI Safety evaluations~~ AI Safety evaluations (with AI Project provisioning) Feb 20, 2025

pamelafox requested review from Copilot and mattgotteiner February 20, 2025 17:54

Copilot AI reviewed Feb 20, 2025

View reviewed changes

evals/safety_evaluation.py Outdated Show resolved Hide resolved

mattgotteiner reviewed Feb 20, 2025

View reviewed changes

evals/safety_evaluation.py Show resolved Hide resolved

Update evals/safety_evaluation.py

279c4a6

Co-authored-by: Copilot <[email protected]>

mattgotteiner reviewed Feb 20, 2025

View reviewed changes

evals/safety_evaluation.py Show resolved Hide resolved

mattgotteiner reviewed Feb 20, 2025

View reviewed changes

evals/safety_evaluation.py Outdated Show resolved Hide resolved

mattgotteiner reviewed Feb 20, 2025

View reviewed changes

evals/safety_evaluation.py Outdated Show resolved Hide resolved

mattgotteiner approved these changes Feb 20, 2025

View reviewed changes

pamelafox added 3 commits February 20, 2025 10:31

Reword arg, add comment on time needed

87c1f8d

Use the enum

3eca0c5

Fix one more zerodiv error

cc08121

pamelafox merged commit 31ea846 into Azure-Samples:main Feb 20, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AI Safety evaluations (with AI Project provisioning) #2370

AI Safety evaluations (with AI Project provisioning) #2370

pamelafox commented Feb 20, 2025 •

edited

Loading

Copilot AI left a comment

AI Safety evaluations (with AI Project provisioning) #2370

AI Safety evaluations (with AI Project provisioning) #2370

Conversation

pamelafox commented Feb 20, 2025 • edited Loading

Purpose

Does this introduce a breaking change?

Does this require changes to learn.microsoft.com docs?

Type of change

Code quality checklist

Copilot AI left a comment

Choose a reason for hiding this comment

pamelafox commented Feb 20, 2025 •

edited

Loading