Add fuzzy filtering to the evaluation reports #6262

peterwald · 2025-04-08T21:02:59Z

The filter is OR'd with the tag filter, and is based on a fuzzy search of the scenario name, iteration name, and chat contents.

Microsoft Reviewers: Open in CodeFlow

shyamnamboodiripad

The UX looks great. The only other feedback I have is that it would be nice to be able to search for text within metrics, diagnostics and metric metadata etc. This would enable scenarios like "which scenarios contain an evaluation result for metric xyz", "which other scenarios failed because of the same exception" etc.

There are also some more advanced cases that would likely need some form of search language / natural language search support. Examples include "filter down to all scenarios where metric xyz has value < 3", "which scenarios contain code vulnerability metric where code injection attack was detected in the metric metadata". We could wait for more feedback around the more advanced cases.

peterwald · 2025-04-09T20:02:41Z

The UX looks great. The only other feedback I have is that it would be nice to be able to search for text within metrics, diagnostics and metric metadata etc. This would enable scenarios like "which scenarios contain an evaluation result for metric xyz", "which other scenarios failed because of the same exception" etc.

There are also some more advanced cases that would likely need some form of search language / natural language search support. Examples include "filter down to all scenarios where metric xyz has value < 3", "which scenarios contain code vulnerability metric where code injection attack was detected in the metric metadata". We could wait for more feedback around the more advanced cases.

Agree that functionality would be valuable. This is intended to be an initial first step. We'll probably need to create a more advanced query function to address those more complex scenarios.

Add filtering to the evaluation reports.

ffe3a7a

peterwald requested a review from a team as a code owner April 8, 2025 21:03

peterwald self-assigned this Apr 8, 2025

peterwald added the area-ai-eval Microsoft.Extensions.AI.Evaluation and related label Apr 8, 2025

peterwald added 2 commits April 9, 2025 13:56

Convert to uFuzzy library.

4b6692d

Clean up unneeded import.

4736700

peterwald requested a review from shyamnamboodiripad April 9, 2025 19:21

shyamnamboodiripad approved these changes Apr 9, 2025

View reviewed changes

peterwald merged commit a3baf04 into dotnet:main Apr 9, 2025
6 checks passed

github-actions bot locked and limited conversation to collaborators May 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add fuzzy filtering to the evaluation reports #6262

Add fuzzy filtering to the evaluation reports #6262

Uh oh!

peterwald commented Apr 8, 2025 •

edited by dotnet-policy-service bot

Loading

Uh oh!

shyamnamboodiripad left a comment

Uh oh!

peterwald commented Apr 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add fuzzy filtering to the evaluation reports #6262

Add fuzzy filtering to the evaluation reports #6262

Uh oh!

Conversation

peterwald commented Apr 8, 2025 • edited by dotnet-policy-service bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Microsoft Reviewers: Open in CodeFlow

Uh oh!

shyamnamboodiripad left a comment

Choose a reason for hiding this comment

Uh oh!

peterwald commented Apr 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

peterwald commented Apr 8, 2025 •

edited by dotnet-policy-service bot

Loading