-
Notifications
You must be signed in to change notification settings - Fork 144
chore: more accuracy test tweaks #675
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
This PR refines accuracy test expectations to reduce false negatives by making assertions more flexible. The changes ensure that valid tool calls with optional or varying parameters are not incorrectly flagged as inaccurate.
Key changes:
- Updated test expectations to accept optional parameters like
limit,responseBytesLimit,sampleSize, andoperations - Added optional
list-databasesandlist-collectionscalls that models may make before main operations - Refined prompt wording to better elicit expected behavior (e.g., "exported COMPLETE list")
Reviewed Changes
Copilot reviewed 12 out of 12 changed files in this pull request and generated no comments.
Show a summary per file
| File | Description |
|---|---|
| tests/accuracy/find.test.ts | Added optional list calls helper and applied to all test cases; refined prompt wording |
| tests/accuracy/collectionSchema.test.ts | Added optional list-collections calls and flexible parameter matchers |
| tests/accuracy/export.test.ts | Made filter argument accept empty object or undefined |
| tests/accuracy/explain.test.ts | Added flexible matcher for responseBytesLimit parameter |
| tests/accuracy/logs.test.ts | Added flexible matcher for limit parameter |
| tests/accuracy/getPerformanceAdvisor.test.ts | Added flexible matcher for operations parameter array |
| tests/integration/tools/mongodb/read/find.test.ts | Changed string concatenation to template literal for consistency |
| tests/integration/tools/mongodb/read/aggregate.test.ts | Changed string concatenation to template literal for consistency |
| tests/integration/tools/mongodb/metadata/collectionSchema.test.ts | Whitespace-only change (no functional impact) |
| src/tools/mongodb/read/find.ts | Changed multi-line string to template literal for consistency |
| src/tools/mongodb/read/aggregate.ts | Changed multi-line string to template literal for consistency |
| src/tools/mongodb/metadata/collectionSchema.ts | Whitespace-only change (no functional impact) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM but would be good to trigger the accuracy test in the branch before merging!
|
Those changes don't fix all the false negatives, but I did run them locally and improved the accuracy of some tests. The annoying part is that those are quite unpredictable so we need multiple runs to identify the flakes. |
📊 Accuracy Test Results📈 Summary
📎 Download Full HTML Report - Look for the Report generated on: 10/23/2025, 12:44:24 PM |
Proposed changes
This tweaks a few more accuracy tests so that the expectations are more loosely defined and don't flag valid tool calls as inaccurate.