Skip to content

Conversation

@marcusquinn
Copy link
Owner

@marcusquinn marcusquinn commented Feb 5, 2026

Summary

Comprehensive audit of TODO.md and PLANS.md to fix conflicts, update stale briefs, and add new plans.

Changes

Bug Fixes

  • Fix git merge conflict in TOON block (lines 294-328) - renumbered colliding task IDs (t109-t111 → t121-t123)
  • Resolve t104 ID collision - Tirith renumbered to t124 with cross-reference to t104 plan

Task Brief Updates

  • t024 (Dexter): Added GitHub URL, 10.6k stars, evaluation criteria
  • t029 (@penberg): Rewrote to focus on Weave project (deterministic execution for AI agents)
  • t030 (@irl_danB): Rewrote to focus on progressive-memory and clawdbot projects
  • t031 (Company orchestration): Added scope extending t109 parallel agents
  • t074 (DocStrange): Added 1.3k stars, built-in MCP server, local GPU processing
  • t080 (Voice): Updated with GPU reality (Nemotron cloud-only), MiniCPM-o + GPT-4o-Realtime
  • t103 (Pi agent review): Confirmed done with summary notes

Declined

  • t099 (Neural-Chromium): Declined - 4 stars, Windows-only, no releases
  • Removed .agent/tools/browser/neural-chromium.md subagent (266 lines)
  • Removed Neural-Chromium from browser-automation.md decision tree and quick reference

New Tasks

  • t125: Add browser-use subagent (77.8k stars, MIT) - most popular AI browser automation
  • t126: Add Skyvern subagent (20.3k stars, AGPL-3.0) - computer vision browser automation

New Plans (PLANS.md)

  • p019: Voice Integration Pipeline (6 phases: Transcription → TTS/STT → Local Pipecat → Cloud S2S → OpenCode Bridge → Shortcuts)
  • p020: SEO Tool Subagents Sprint (3 phases batching t083-t094, with corrections for deprecated/unavailable APIs)

Housekeeping

  • Updated TOON summary counts (183 total, 97 pending, 84 done, 2 declined)
  • Updated PLANS.md active_plans TOON block (12 → 14 plans)

Summary by CodeRabbit

Release Notes

  • Documentation

    • Removed experimental Neural-Chromium agent-native browser automation feature documentation and all related references from tool selection guides.
  • Chores

    • Reorganized project backlog with consolidated task entries and improved task structure.
    • Introduced SEO Tool Subagents Sprint as new active initiative to project roadmap.

- Fix git merge conflict in TOON block (renumber t109-t111 to t121-t123)
- Resolve t104 ID collision (Tirith renumbered to t124)
- Update stale research briefs (t024 Dexter, t029 Weave, t030 progressive-memory, t031 company orchestration)
- Decline t099 Neural-Chromium (4 stars, Windows-only) and remove subagent
- Remove Neural-Chromium from browser-automation.md decision tree
- Add t125 browser-use (77.8k stars) and t126 Skyvern (20.3k stars) as replacements
- Update t074 DocStrange brief (1.3k stars, built-in MCP server)
- Update t080 voice brief (GPU reality, MiniCPM-o + GPT-4o-Realtime)
- Confirm t103 Pi agent review as done
- Add Voice Integration Pipeline plan (p019) to PLANS.md
- Add SEO Tool Subagents Sprint plan (p020) to PLANS.md
- Update TOON summary counts (183 total, 97 pending, 84 done, 2 declined)
@coderabbitai
Copy link
Contributor

coderabbitai bot commented Feb 5, 2026

Walkthrough

The pull request removes experimental Neural-Chromium browser automation references from documentation and quick-reference tables, resolves merge conflicts in the TODO backlog with task reconciliation, and adds a new SEO Tool Subagents Sprint plan entry to the active planning roster.

Changes

Cohort / File(s) Summary
Browser Tool Documentation Cleanup
.agent/tools/browser/browser-automation.md, .agent/tools/browser/neural-chromium.md
Removes experimental Neural-Chromium references from the Tool Selection tree and Quick Reference table; deletes entire neural-chromium.md documentation file (265 lines).
Backlog & Planning Updates
TODO.md, todo/PLANS.md
Resolves merge-conflict markers in TODO.md with task reconciliation (t099–t126 blocks reorganized); adds new SEO Tool Subagents Sprint plan (p020) with duplicated content block in PLANS.md (+291 lines).

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

Poem

🧹 Neural-Chromium fades to dust,
Merge conflicts resolved—we must!
New SEO dreams take their flight,
Backlog trimmed, our docs feel light. ✨

🚥 Pre-merge checks | ✅ 3
✅ Passed checks (3 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title accurately summarizes the main objective—auditing and improving TODO.md and PLANS.md quality—and aligns with the changeset's primary focus on conflict resolution, task updates, and plan additions.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Post copyable unit tests in a comment
  • Commit unit tests in branch chore/todo-audit-improvements

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@gemini-code-assist
Copy link

Summary of Changes

Hello @marcusquinn, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request represents a significant audit and improvement of the project's TODO.md and PLANS.md files. The primary goal was to enhance the quality, accuracy, and clarity of task and plan documentation. This involved resolving merge conflicts, updating detailed briefs for various AI agent and tool evaluations, declining outdated or unviable tasks, and introducing new strategic tasks and plans to guide future development. The changes ensure that the project's roadmap is current and well-defined.

Highlights

  • Bug Fixes: Resolved merge conflicts by renumbering tasks t109-t111 to t121-t123 and task t104 to t124 in TODO.md.
  • Task Brief Updates: Enhanced descriptions and evaluation criteria for existing tasks including Dexter (t024), Weave (t029), progressive-memory/clawdbot (t030), Company orchestration (t031), DocStrange (t074), and Voice agents (t080). Task t103 (Pi agent review) was marked as completed with summary notes.
  • Task Declination and Removal: Task t099 (Neural-Chromium) was declined due to its experimental, Windows-only nature and low adoption, leading to the removal of its dedicated subagent file and references in browser-automation.md.
  • New Task Additions: Introduced new tasks t125 (browser-use subagent) and t126 (Skyvern subagent) to explore more mature and cross-platform AI browser automation tools.
  • New Strategic Plans: Added two new plans: p019 for a comprehensive Voice Integration Pipeline and p020 for an SEO Tool Subagents Sprint, outlining multi-phase development for each.
  • Housekeeping: Updated the TOON summary counts in TODO.md (total, pending, done, declined) and the active plans count in PLANS.md.
Changelog
  • .agent/tools/browser/browser-automation.md
    • Removed the "EXPERIMENTAL (agent-native browser, VLM vision)" section and the "Neural-Chromium" entry from the quick reference table.
  • .agent/tools/browser/neural-chromium.md
    • This file, detailing the Neural-Chromium subagent, was entirely removed.
  • TODO.md
    • Renumbered tasks t109, t110, t111 to t121, t122, t123 respectively to resolve a merge conflict.
    • Updated notes for t024 (Dexter) to include GitHub URL, star count, and evaluation criteria.
    • Rewrote notes for t029 (Penberg) to focus on the Weave project and its evaluation for reproducibility.
    • Rewrote notes for t030 (irl_danB) to focus on progressive-memory and clawdbot projects.
    • Added notes to t031 (Company orchestration) extending its scope and linking to t109.
    • Updated notes for t074 (DocStrange) to reflect its current features like built-in MCP server and local GPU processing.
    • Updated notes for t080 (Voice) to include details on GPT-4o-Realtime, MiniCPM-o, and Nemotron, reflecting GPU realities.
    • Marked t103 (Pi agent review) as completed and updated its notes with key findings.
    • Marked t099 (Neural-Chromium) as declined with a detailed rationale and suggested alternatives.
    • Added new tasks t125 (browser-use subagent) and t126 (Skyvern subagent) with detailed notes.
    • Renumbered t104 to t124 to resolve an ID collision and updated its notes.
    • Updated the TOON summary counts at the end of the file to reflect the new totals (183 total, 97 pending, 84 done, 2 declined).
    • Resolved git merge conflict markers in the TOON block.
  • todo/PLANS.md
    • Updated the TOON:active_plans count from 12 to 14.
    • Added a new plan p019: Voice Integration Pipeline with a detailed purpose, context, 6 phases, milestones, decision log, and discoveries.
    • Added a new plan p020: SEO Tool Subagents Sprint with a detailed purpose, context including corrections for deprecated APIs, 3 phases, milestones, and decision log.
Activity
  • The author performed a comprehensive audit of the TODO.md and PLANS.md files.
  • Merge conflicts within the TOON blocks were identified and resolved.
  • Numerous task descriptions were updated to reflect current information, project status, or refined scope.
  • An outdated task (Neural-Chromium) was formally declined and removed.
  • New tasks and strategic plans were introduced to expand the project's future development roadmap.
  • Overall task and plan summary counts were meticulously updated.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

@github-actions
Copy link

github-actions bot commented Feb 5, 2026

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 3 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Thu Feb 5 23:55:18 UTC 2026: Code review monitoring started
Thu Feb 5 23:55:18 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 3
Thu Feb 5 23:55:19 UTC 2026: Qlty - 0 issues found, auto-formatting applied
Thu Feb 5 23:55:21 UTC 2026: Codacy analysis completed with auto-fixes

📈 Current Quality Metrics

  • BUGS: 0
  • CODE SMELLS: 3
  • VULNERABILITIES: 0

Generated on: Thu Feb 5 23:55:23 UTC 2026


Generated by AI DevOps Framework Code Review Monitoring

@sonarqubecloud
Copy link

sonarqubecloud bot commented Feb 5, 2026

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)
todo/PLANS.md (1)

1974-1982: ⚠️ Potential issue | 🟠 Major

Fix duplicate plan ID and align active_plans index.

p016 is already used for “Install Script Integrity Hardening”; reusing it for “Parallel Agents & Headless Dispatch” will break TOON uniqueness. Also ensure this plan is listed in the active_plans TOON block if it’s meant to be active.

TODO.md (1)

76-77: ⚠️ Potential issue | 🟠 Major

Sync human vs TOON status for t004.

t004 is marked done in the human list but still pending in the TOON backlog block, which will desync automation and tooling. Please update the TOON entry to match.

Also applies to: 347-348

Copy link

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request provides a comprehensive audit and update of the TODO.md and PLANS.md files. The changes are well-documented in the PR description and align with the project's goals of improving task tracking and planning. I've identified a few minor inconsistencies and redundancies in the markdown files that could be addressed to improve clarity. Overall, this is a great housekeeping effort that improves the quality of the project's documentation.

Comment on lines +219 to 220
- Notes: Review complete. See todo/research/pi-agent-review.md. Key findings: Pi's minimal 4-tool core validates aidevops's on-demand MCP loading. Session trees (branching/rewinding) are interesting but require agent-level support. Recommended: document "remix" skill pattern, add desktop notification pattern for long tasks.
- Notes: Review complete. See todo/research/pi-agent-review.md. Key findings: Pi's minimal 4-tool core validates aidevops's on-demand MCP loading. Session trees (branching/rewinding) are the most interesting feature aidevops lacks but can't implement without agent-level support. Extension hot-reload is powerful but different architecture from aidevops's markdown subagents. Recommended: document "remix" skill pattern in build-agent.md, add desktop notification pattern for long tasks. Skip: removing MCP, rewriting in TypeScript.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

There appear to be duplicate notes for task t103. The first note on line 219 is a subset of the more detailed note on line 220. To improve clarity and avoid redundancy, consider keeping only the more comprehensive note.

Suggested change
- Notes: Review complete. See todo/research/pi-agent-review.md. Key findings: Pi's minimal 4-tool core validates aidevops's on-demand MCP loading. Session trees (branching/rewinding) are interesting but require agent-level support. Recommended: document "remix" skill pattern, add desktop notification pattern for long tasks.
- Notes: Review complete. See todo/research/pi-agent-review.md. Key findings: Pi's minimal 4-tool core validates aidevops's on-demand MCP loading. Session trees (branching/rewinding) are the most interesting feature aidevops lacks but can't implement without agent-level support. Extension hot-reload is powerful but different architecture from aidevops's markdown subagents. Recommended: document "remix" skill pattern in build-agent.md, add desktop notification pattern for long tasks. Skip: removing MCP, rewriting in TypeScript.
- Notes: Review complete. See todo/research/pi-agent-review.md. Key findings: Pi's minimal 4-tool core validates aidevops's on-demand MCP loading. Session trees (branching/rewinding) are the most interesting feature aidevops lacks but can't implement without agent-level support. Extension hot-reload is powerful but different architecture from aidevops's markdown subagents. Recommended: document "remix" skill pattern in build-agent.md, add desktop notification pattern for long tasks. Skip: removing MCP, rewriting in TypeScript.

- [ ] t104 Add Tirith terminal security guard for homograph/injection attacks #security #tools #terminal ~2h (ai:1.5h test:20m read:10m) logged:2026-02-03 ref:https://github.com/sheeki03/tirith
- Notes: Tirith (740 stars, Rust, AGPL-3.0) - terminal security tool that catches attacks browsers block but terminals don't. **30 rules across 7 categories:** 1) Homograph attacks (Cyrillic/Greek lookalikes, punycode, mixed-script). 2) Terminal injection (ANSI escapes, bidi overrides, zero-width chars). 3) Pipe-to-shell (`curl|bash`, `wget|sh`, `eval $(wget ...)`). 4) Dotfile attacks (downloads targeting ~/.bashrc, ~/.ssh/authorized_keys). 5) Insecure transport (HTTP piped to shell, `curl -k`). 6) Ecosystem threats (git clone typosquats, untrusted Docker registries, pip/npm URL installs). 7) Credential exposure (userinfo tricks, shortened URLs). **Integration options:** 1) Add to aidevops setup/onboarding as recommended install. 2) Create tirith.md subagent at tools/security/. 3) Document shell hook setup (`eval "$(tirith init)"`). 4) Consider MCP wrapper for `tirith check` command validation. **Key features:** Sub-millisecond overhead, local-only (no network calls), YAML policy config, bypass with `TIRITH=0` prefix. Install: `brew install sheeki03/tap/tirith` or `npm install -g tirith` or `cargo install tirith`.
- [ ] t124 Add Tirith terminal security guard for homograph/injection attacks #security #tools #terminal ~2h (ai:1.5h test:20m read:10m) logged:2026-02-03 ref:https://github.com/sheeki03/tirith
- Notes: Tirith (1,300 stars, Rust, AGPL-3.0) - also see t104 plan: Tirith's `tirith run` command provides verified download-then-execute as the implementation vehicle for curl|sh hardening. - terminal security tool that catches attacks browsers block but terminals don't. **30 rules across 7 categories:** 1) Homograph attacks (Cyrillic/Greek lookalikes, punycode, mixed-script). 2) Terminal injection (ANSI escapes, bidi overrides, zero-width chars). 3) Pipe-to-shell (`curl|bash`, `wget|sh`, `eval $(wget ...)`). 4) Dotfile attacks (downloads targeting ~/.bashrc, ~/.ssh/authorized_keys). 5) Insecure transport (HTTP piped to shell, `curl -k`). 6) Ecosystem threats (git clone typosquats, untrusted Docker registries, pip/npm URL installs). 7) Credential exposure (userinfo tricks, shortened URLs). **Integration options:** 1) Add to aidevops setup/onboarding as recommended install. 2) Create tirith.md subagent at tools/security/. 3) Document shell hook setup (`eval "$(tirith init)"`). 4) Consider MCP wrapper for `tirith check` command validation. **Key features:** Sub-millisecond overhead, local-only (no network calls), YAML policy config, bypass with `TIRITH=0` prefix. Install: `brew install sheeki03/tap/tirith` or `npm install -g tirith` or `cargo install tirith`.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The note for task t124 seems to have a fragment from another plan inserted mid-sentence, which makes it difficult to read. Consider rephrasing for clarity.

Suggested change
- Notes: Tirith (1,300 stars, Rust, AGPL-3.0) - also see t104 plan: Tirith's `tirith run` command provides verified download-then-execute as the implementation vehicle for curl|sh hardening. - terminal security tool that catches attacks browsers block but terminals don't. **30 rules across 7 categories:** 1) Homograph attacks (Cyrillic/Greek lookalikes, punycode, mixed-script). 2) Terminal injection (ANSI escapes, bidi overrides, zero-width chars). 3) Pipe-to-shell (`curl|bash`, `wget|sh`, `eval $(wget ...)`). 4) Dotfile attacks (downloads targeting ~/.bashrc, ~/.ssh/authorized_keys). 5) Insecure transport (HTTP piped to shell, `curl -k`). 6) Ecosystem threats (git clone typosquats, untrusted Docker registries, pip/npm URL installs). 7) Credential exposure (userinfo tricks, shortened URLs). **Integration options:** 1) Add to aidevops setup/onboarding as recommended install. 2) Create tirith.md subagent at tools/security/. 3) Document shell hook setup (`eval "$(tirith init)"`). 4) Consider MCP wrapper for `tirith check` command validation. **Key features:** Sub-millisecond overhead, local-only (no network calls), YAML policy config, bypass with `TIRITH=0` prefix. Install: `brew install sheeki03/tap/tirith` or `npm install -g tirith` or `cargo install tirith`.
- Notes: Tirith (1,300 stars, Rust, AGPL-3.0) is a terminal security tool that catches attacks browsers block but terminals don't. Also see the t104 plan: Tirith's `tirith run` command provides verified download-then-execute as the implementation vehicle for curl|sh hardening. **30 rules across 7 categories:** 1) Homograph attacks (Cyrillic/Greek lookalikes, punycode, mixed-script). 2) Terminal injection (ANSI escapes, bidi overrides, zero-width chars). 3) Pipe-to-shell (`curl|bash`, `wget|sh`, `eval $(wget ...)`). 4) Dotfile attacks (downloads targeting ~/.bashrc, ~/.ssh/authorized_keys). 5) Insecure transport (HTTP piped to shell, `curl -k`). 6) Ecosystem threats (git clone typosquats, untrusted Docker registries, pip/npm URL installs). 7) Credential exposure (userinfo tricks, shortened URLs). **Integration options:** 1) Add to aidevops setup/onboarding as recommended install. 2) Create tirith.md subagent at tools/security/. 3) Document shell hook setup (`eval "$(tirith init)"`). 4) Consider MCP wrapper for `tirith check` command validation. **Key features:** Sub-millisecond overhead, local-only (no network calls), YAML policy config, bypass with `TIRITH=0` prefix. Install: `brew install sheeki03/tap/tirith` or `npm install -g tirith` or `cargo install tirith`.

| 1 | t072 Transcription | None | Foundation - most broadly useful |
| 2 | t071 TTS/STT Models | None (parallel with Phase 1) | Model catalog for other phases |
| 3 | t081 Local Pipecat | t071, t072 | Local voice agent pipeline |
| 4 | t080 NVIDIA Nemotron | t081 | Cloud voice agent with open models |

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The description for task t080 in this table (NVIDIA Nemotron) is inconsistent with its updated title and scope elsewhere in the PR (Set up cloud voice agents and S2S models (GPT-4o-Realtime, MiniCPM-o, Nemotron)). To maintain consistency, please update this entry to reflect the broader scope of the task.

Suggested change
| 4 | t080 NVIDIA Nemotron | t081 | Cloud voice agent with open models |
| 4 | t080 Cloud voice agents and S2S models | t081 | Cloud voice agent setup |

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant