-
Notifications
You must be signed in to change notification settings - Fork 22
Improve cluster id/name handling #188
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
@@ -129,14 +129,14 @@ | |||||||||||||||||||||
| - conversation_group: cluster_info_conv | ||||||||||||||||||||||
| conversation: | ||||||||||||||||||||||
| - eval_id: cluster_info_tool_call | ||||||||||||||||||||||
| eval_query: Give me details about cluster abc123 | ||||||||||||||||||||||
| eval_query: Give me details about cluster named abc123 | ||||||||||||||||||||||
| eval_types: [tool_eval, response_eval:sub-string] | ||||||||||||||||||||||
| expected_tool_calls: | ||||||||||||||||||||||
| - - tool_name: cluster_info | ||||||||||||||||||||||
| arguments: | ||||||||||||||||||||||
| cluster_id: "abc123" | ||||||||||||||||||||||
| expected_keywords: ["cluster", "abc123", "error", "not found"] | ||||||||||||||||||||||
| description: Test error handling for non-existent cluster ID | ||||||||||||||||||||||
| # It should list the clusters to try to match up the name | ||||||||||||||||||||||
| - - tool_name: list_clusters | ||||||||||||||||||||||
| arguments: {} | ||||||||||||||||||||||
| expected_keywords: ["cluster", "abc123"] | ||||||||||||||||||||||
| description: Test error handling for non-existent cluster ID/Name | ||||||||||||||||||||||
|
|
||||||||||||||||||||||
| - conversation_group: error_handling_conv | ||||||||||||||||||||||
| description: Validate graceful handling of invalid SSH key format | ||||||||||||||||||||||
|
|
@@ -168,4 +168,22 @@ | |||||||||||||||||||||
| eval_query: Which platforms can you help me install on? | ||||||||||||||||||||||
| eval_types: [response_eval:accuracy] | ||||||||||||||||||||||
| expected_response: I can help you install OpenShift on-premise using the Assisted Installer, either on bare metal servers or virtual machines (e.g., vSphere, KVM, libvirt). I do not support public cloud platforms like AWS, Azure, or GCP. | ||||||||||||||||||||||
| description: Ensure the assistant concisely states supported on-prem baremetal/VM scope and references Assisted Installer | ||||||||||||||||||||||
| description: Ensure the assistant concisely states supported on-prem baremetal/VM scope and references Assisted Installer | ||||||||||||||||||||||
|
|
||||||||||||||||||||||
| - conversation_group: cluster_id_from_name | ||||||||||||||||||||||
| conversation: | ||||||||||||||||||||||
| - eval_id: create_single_node_cluser | ||||||||||||||||||||||
| eval_query: Create a multi-node cluster named 'eval-test-cluster-name' with OpenShift 4.18.22 and domain test.local. I do not have an SSH key to provide. | ||||||||||||||||||||||
| eval_types: [response_eval:sub-string] | ||||||||||||||||||||||
| expected_keywords: ["cluster", "eval-test-cluster-name", "cluster ID"] | ||||||||||||||||||||||
| - eval_id: cluster_name_tool_call | ||||||||||||||||||||||
|
Comment on lines
+175
to
+179
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Fix eval_id typo and mismatch with scenario.
- - eval_id: create_single_node_cluser
+ - eval_id: create_multinode_cluster📝 Committable suggestion
Suggested change
🤖 Prompt for AI Agents |
||||||||||||||||||||||
| eval_query: Show me information on cluster eval-test-cluster-name | ||||||||||||||||||||||
|
coderabbitai[bot] marked this conversation as resolved.
|
||||||||||||||||||||||
| eval_types: [tool_eval, response_eval:sub-string] | ||||||||||||||||||||||
| expected_tool_calls: | ||||||||||||||||||||||
| - - tool_name: list_clusters | ||||||||||||||||||||||
| arguments: {} | ||||||||||||||||||||||
| - - tool_name: cluster_info | ||||||||||||||||||||||
| arguments: | ||||||||||||||||||||||
| cluster_id: "[a-f0-9]{8}-[a-f0-9]{4}-[a-f0-9]{4}-[a-f0-9]{4}-[a-f0-9]{12}" | ||||||||||||||||||||||
| expected_keywords: ["cluster", "eval-test-cluster-name", "test.local", "4.18.22"] | ||||||||||||||||||||||
| description: Test handling requesting a cluster by name | ||||||||||||||||||||||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🛠️ Refactor suggestion
Strengthen “not found” assertion for name/ID miss.
As written, a hallucinated success mentioning “cluster abc123” could pass. Tighten the negative-case by requiring an explicit “not found” (or equivalent) indicator in the response.
Apply:
🤖 Prompt for AI Agents