Return response-level scores for `CounterfactualGenerator` and `AutoEval` #68

dylanbouchard · 2024-12-16T15:40:24Z

Is your feature request related to a problem? Please describe.
Currently, CounterfactualGenerator and AutoEval return a dictionary containing only metric values. In contrast, ToxicityMetrics and StereotypeMetrics give users the option to return response-level scores for a more detailed investigation into the assessment results. The request here is to give users the option to return analogous, response-level scores for CounterfactualGenerator, and return response-level toxicity, stereotype, and counterfactual scores with AutoEval.

Describe the solution you'd like
Give users option to return response-level scores with CounterfactualGenerator and AutoEval in returned dictionary. For consistency with ToxicityMetrics and StereotypeMetrics, this should be achieved by adding a new boolean argument called return_data to CounterfactualGenerator.evaluate and AutoEval.evaluate. The dictionary format returned should be consistent across these classes: a "metrics" key with metric values, and a "data" key with responses and response-level scores.

Additional context
Along with counterfactual.py, auto.py, demo notebooks, unit tests, and readme should also be updated.

The text was updated successfully, but these errors were encountered:

dylanbouchard · 2024-12-16T15:41:07Z

@mohitcek is assigned to this issue. The requested counterfactual updates are addressed in PR #66

dylanbouchard · 2024-12-20T20:14:19Z

@mohitcek resolved this issue. Changes are reflected in v0.3.0

dylanbouchard linked a pull request Dec 17, 2024 that will close this issue

Consistent return object of AutoEval class #70

Merged

dylanbouchard closed this as completed Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return response-level scores for `CounterfactualGenerator` and `AutoEval` #68

Return response-level scores for `CounterfactualGenerator` and `AutoEval` #68

dylanbouchard commented Dec 16, 2024 •

edited

Loading

dylanbouchard commented Dec 16, 2024

dylanbouchard commented Dec 20, 2024

Return response-level scores for CounterfactualGenerator and AutoEval #68

Return response-level scores for CounterfactualGenerator and AutoEval #68

Comments

dylanbouchard commented Dec 16, 2024 • edited Loading

dylanbouchard commented Dec 16, 2024

dylanbouchard commented Dec 20, 2024

Return response-level scores for `CounterfactualGenerator` and `AutoEval` #68

Return response-level scores for `CounterfactualGenerator` and `AutoEval` #68

dylanbouchard commented Dec 16, 2024 •

edited

Loading