You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[`intent_eval`](src/lightspeed_evaluation/core/metrics/custom.py) - Evaluates whether the response demonstrates the expected intent or purpose
89
+
-[`keywords_eval`](src/lightspeed_evaluation/core/metrics/custom/keywords_eval.py) - Keywords evaluation with alternatives (ALL keywords must match, case insensitive)
89
90
- Tool Evaluation
90
91
-[`tool_eval`](src/lightspeed_evaluation/core/metrics/custom.py) - Validates tool calls and arguments with regex pattern matching
91
92
-**Script-based**
@@ -149,6 +150,10 @@ metrics_metadata:
149
150
150
151
"custom:tool_eval":
151
152
description: "Tool call evaluation comparing expected vs actual tool calls (regex for arguments)"
- OpenShift Virtualization is an extension of the OpenShift ...
228
233
attachments: [] # Attachments (Optional)
234
+
expected_keywords: [["virtualization"], ["openshift"]] # For keywords_eval evaluation
229
235
expected_response: OpenShift Virtualization is an extension of the OpenShift Container Platform that allows running virtual machines alongside containers
230
236
expected_intent: "explain a concept"# Expected intent for intent evaluation
0 commit comments