Skip to content

chore: add verification hierarchy guidance#238

Merged
marcusquinn merged 2 commits intomainfrom
chore/add-verification-guidance
Jan 27, 2026
Merged

chore: add verification hierarchy guidance#238
marcusquinn merged 2 commits intomainfrom
chore/add-verification-guidance

Conversation

@marcusquinn
Copy link
Owner

@marcusquinn marcusquinn commented Jan 27, 2026

Summary

  • Adds a universal verification principle to build.txt (applies to all agents)
  • Enriches Build+ step 9 (Reflect and Validate) with a prioritised verification hierarchy

Verification Hierarchy

  1. Run available tools (tests, linters, type checkers, build commands)
  2. Use browser tools to visually verify UI changes
  3. Check primary sources (official docs, API responses, git log)
  4. Review the output yourself and provide user experience commentary
  5. If none of the above give confidence, ask the user how to verify

Motivation

Agents sometimes skip verification entirely or only verify through tests. This ensures they always find some way to confirm their work is correct, escalating to the user when automated options are uncertain.

Summary by CodeRabbit

  • Documentation
    • Added a "Verification hierarchy" checklist in the Reflect & Validate guidance, outlining five verification steps: run tools, verify UI in a browser, consult primary sources, review outputs with UX commentary, and prompt the user for further verification when needed.

✏️ Tip: You can customize this high-level summary in your review settings.

Agents should always find a way to verify their work rather than
assuming correctness. Adds a prioritised verification hierarchy:
tools > browser > primary sources > self-review > ask user.
@gemini-code-assist
Copy link

Summary of Changes

Hello @marcusquinn, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a standardized and prioritized approach to verification for agents. The changes aim to address instances where agents might skip verification or rely on limited methods, by providing a clear hierarchy of steps to confirm their work. This ensures a more robust and consistent quality assurance process across all agent outputs, ultimately leading to more reliable results.

Highlights

  • Universal Verification Principle: A new universal verification principle has been added to build.txt, which applies to all agents, ensuring they always seek to confirm their work's correctness.
  • Enhanced Verification Hierarchy: Build+ step 9, 'Reflect and Validate', has been enriched with a prioritized 5-step verification hierarchy, guiding agents through a structured process from automated tools to user consultation.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

@coderabbitai
Copy link
Contributor

coderabbitai bot commented Jan 27, 2026

Caution

Review failed

The pull request is closed.

Walkthrough

Adds verification guidance to two agent configuration files: a five-step "Verification hierarchy" checklist in .agent/build-plus.md and a corresponding line in .agent/prompts/build.txt. No functional/code changes.

Changes

Cohort / File(s) Summary
Agent verification docs
\.agent/build-plus.md, \.agent/prompts/build.txt
Introduces a "Verification hierarchy" checklist: (1) run tools, (2) verify UI with browser, (3) consult primary sources, (4) review output with UX commentary, (5) consult user if further verification needed; adds prompt guidance to ask user how to verify when required.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

Possibly related PRs

Poem

✨ Five checks line up, a careful verification song,
Tools, browsers, sources — each step rights a wrong,
UX eyes review, then ask the user true,
A tidy path to trust in what we do. 🎯

🚥 Pre-merge checks | ✅ 3
✅ Passed checks (3 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title accurately describes the main change: adding verification hierarchy guidance to the codebase as documented in both build.txt and build-plus.md.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.


Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@augmentcode
Copy link

augmentcode bot commented Jan 27, 2026

🤖 Augment PR Summary

Summary: Adds agent-facing guidance to consistently verify work using a prioritized hierarchy.

Changes: Updates .agent/build-plus.md and .agent/prompts/build.txt to codify “always verify” and outline escalation steps from automated tools to user confirmation.

🤖 Was this summary useful? React with 👍 or 👎

Copy link

@augmentcode augmentcode bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Review completed. No suggestions at this time.

Comment augment review to trigger a new review at any time.

Copy link

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a verification hierarchy for the agent, adding it to the build-plus.md documentation and as a new principle in the build.txt prompt. The documentation change is clear and well-structured. However, the corresponding prompt added to build.txt is an incomplete and somewhat inconsistent summary of the full hierarchy. I've suggested a more concise, yet complete, version for the prompt to ensure the agent has the full context and follows the intended verification process.

- Do not add code comments unless asked. Focus on why, not what.
- After code changes, run lint/typecheck commands if available (check README or configs).
- When making changes, read surrounding context first to ensure idiomatic integration.
- Always find a way to verify your work. Prefer in order: run tests/linters, use browser tools to check UI, read primary sources (docs, APIs), or review the output yourself. If none of these give confidence, ask the user how to verify.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

This summary of the verification hierarchy is inconsistent with the full version documented in .agent/build-plus.md and the PR description. It omits key verification tools like 'type checkers' and 'build commands', as well as 'git log' as a primary source. Additionally, the mention of 'tests/linters' is slightly redundant and inconsistent with line 106 which specifies 'lint/typecheck'. To ensure the agent has clear and complete instructions, consider a more structured and comprehensive summary that aligns with the documented hierarchy.

- Always verify your work. Hierarchy: 1. Tools (tests, lint, typecheck, build). 2. Browser UI. 3. Docs/APIs/git-log. 4. Self-review. 5. Ask user.

@github-actions
Copy link

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 457 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Tue Jan 27 01:58:11 UTC 2026: Code review monitoring started
Tue Jan 27 01:58:12 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 457
Tue Jan 27 01:58:12 UTC 2026: Qlty - 0 issues found, auto-formatting applied
Tue Jan 27 01:58:14 UTC 2026: Codacy analysis completed with auto-fixes

📈 Current Quality Metrics

  • BUGS: 0
  • CODE SMELLS: 457
  • VULNERABILITIES: 0

Generated on: Tue Jan 27 01:59:38 UTC 2026


Generated by AI DevOps Framework Code Review Monitoring

Address Gemini review feedback: include type checkers, build commands,
and git log to match the documented hierarchy in build-plus.md.
@sonarqubecloud
Copy link

@github-actions
Copy link

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 457 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Tue Jan 27 02:01:17 UTC 2026: Code review monitoring started
Tue Jan 27 02:01:17 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 457
Tue Jan 27 02:01:17 UTC 2026: Qlty - 0 issues found, auto-formatting applied
Tue Jan 27 02:01:19 UTC 2026: Codacy analysis completed with auto-fixes

📈 Current Quality Metrics

  • BUGS: 0
  • CODE SMELLS: 457
  • VULNERABILITIES: 0

Generated on: Tue Jan 27 02:02:47 UTC 2026


Generated by AI DevOps Framework Code Review Monitoring

@marcusquinn marcusquinn merged commit d98de39 into main Jan 27, 2026
8 of 9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant