docs: update browser-automation guide with agent-browser as default by marcusquinn · Pull Request #61 · marcusquinn/aidevops

marcusquinn · 2026-01-12T01:04:24Z

Summary

Change default tool from dev-browser to agent-browser
Add visual debugging section for AI self-diagnosis
Add comprehensive session persistence guide
Add debugging checklist

Key Changes

Agent-Browser as Default

agent-browser is now the recommended first choice because:

Zero setup (no daemon to start)
AI-optimized snapshot + ref pattern
Multi-session isolation built-in
CLI-first for shell scripts and CI/CD

Visual Debugging (Don't Ask User)

New section emphasizing AI should check itself before asking user:

agent-browser screenshot /tmp/debug.png  # See current state
agent-browser errors                      # Check for errors
agent-browser console                     # View console messages
agent-browser is visible @e5              # Check element state

Session Persistence

Comprehensive guide for:

Saving/loading auth state
Cookie management
LocalStorage/SessionStorage
Multi-session with shared auth
Injecting tokens from environment variables

Debugging Checklist

Ordered checklist for self-diagnosis before asking user:

Screenshot
Errors
Console
URL check
Snapshot
Visibility
Headed mode

Summary by CodeRabbit

Documentation
- Enhanced browser automation guides with expanded coverage of session management, authentication state persistence, and cookie handling.
- Improved workflow examples for multi-page interactions and form handling.
- Added ethical guidelines section.
- Updated command references and quick reference materials.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

- Change default tool from dev-browser to agent-browser - Add visual debugging section (screenshots, errors, console) - Add session persistence guide (cookies, localStorage, auth state) - Add multi-session with shared auth patterns - Add debugging checklist for self-diagnosis - Emphasize checking yourself before asking user

gemini-code-assist · 2026-01-12T01:04:37Z

Summary of Changes

Hello @marcusquinn, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly updates the browser automation guide by establishing agent-browser as the primary tool, highlighting its advantages for AI agents and CLI-based workflows. It introduces new, crucial sections designed to enhance the autonomy and debugging capabilities of AI agents, specifically focusing on visual self-diagnosis, robust session management, and a structured troubleshooting checklist. These changes aim to streamline automation processes and reduce the need for human intervention during common issues.

Highlights

Default Tool Change: The recommended default browser automation tool has been switched from dev-browser to agent-browser. This change emphasizes agent-browser's benefits, including zero setup, AI-optimized snapshot/ref patterns, multi-session isolation, and a CLI-first approach.
Visual Debugging Guide for AI: A new section titled 'Visual Debugging (Don't Ask User - Check Yourself)' has been added. It provides AI agents with commands for self-diagnosis, such as taking screenshots, checking errors, viewing console messages, and inspecting element states, to reduce reliance on user input for troubleshooting.
Comprehensive Session Persistence: Detailed documentation on session persistence has been introduced, covering how to save and load authentication states, manage cookies, handle LocalStorage and SessionStorage, utilize multi-session with shared authentication, and inject tokens from environment variables.
Debugging Checklist: An ordered checklist is now included to guide AI agents through a structured self-diagnosis process when automation tasks fail, promoting efficient troubleshooting before escalating issues.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

sonarqubecloud · 2026-01-12T01:05:11Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

github-actions · 2026-01-12T01:05:33Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 309 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Mon Jan 12 01:04:50 UTC 2026: Code review monitoring started
Mon Jan 12 01:04:51 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 309
Mon Jan 12 01:04:51 UTC 2026: Qlty - 0 issues found, auto-formatting applied
Mon Jan 12 01:04:53 UTC 2026: Codacy analysis completed with auto-fixes

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 309
VULNERABILITIES: 0

Generated on: Mon Jan 12 01:05:32 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

gemini-code-assist

Code Review

This pull request does a great job of updating the browser automation guide to establish agent-browser as the new default tool. The addition of detailed sections on visual debugging, session persistence, and a debugging checklist significantly improves the documentation's utility for developers. The refactoring of the alternative tools sections also enhances clarity and consistency. My review includes a few suggestions to improve the code examples for consistency and clarity.

gemini-code-assist · 2026-01-12T01:05:39Z

.agent/tools/browser/browser-automation.md

 ```bash
 # Setup (one-time)
-bash ~/.aidevops/agents/scripts/dev-browser-helper.sh setup
+~/.aidevops/agents/scripts/agent-browser-helper.sh setup


For consistency with other script executions in this document (e.g., on lines 307, 350, 370) and to ensure the script runs correctly even if it doesn't have execute permissions, it's best to explicitly call it with bash.

Suggested change

~/.aidevops/agents/scripts/agent-browser-helper.sh setup

bash ~/.aidevops/agents/scripts/agent-browser-helper.sh setup

gemini-code-assist · 2026-01-12T01:05:39Z

.agent/tools/browser/browser-automation.md

-# 1. Start server (if not running)
-bash ~/.aidevops/agents/scripts/dev-browser-helper.sh start
+# 1. Setup (one-time)
+~/.aidevops/agents/scripts/agent-browser-helper.sh setup


Similar to other script calls in this file, it's recommended to explicitly use bash to execute this setup script for consistency and robustness.

Suggested change

~/.aidevops/agents/scripts/agent-browser-helper.sh setup

bash ~/.aidevops/agents/scripts/agent-browser-helper.sh setup

gemini-code-assist · 2026-01-12T01:05:39Z

.agent/tools/browser/browser-automation.md

 ```bash
-# Quick setup
-bash .agent/scripts/stagehand-python-helper.sh setup
+# Setup
+bash ~/.aidevops/agents/scripts/stagehand-helper.sh setup

-# MCP integration
-bash .agent/scripts/setup-mcp-integrations.sh stagehand-python
+# Natural language actions
+await stagehand.act("click the login button")
+await stagehand.act("fill in the email field with user@example.com")

-# Run examples
-source ~/.aidevops/stagehand-python/.venv/bin/activate
-python examples/basic_example.py
-python examples/ecommerce_automation.py "wireless headphones"
+# Structured extraction
+const data = await stagehand.extract("get product prices", z.array(z.number()))
 ```


The code block is marked as bash, but lines 353, 354, and 357 contain JavaScript syntax (await, const). This is confusing as these are not valid shell commands. To improve clarity, I suggest splitting this into two separate code blocks: a bash block for the setup command, and a javascript block to showcase the API usage examples.

coderabbitai · 2026-01-12T01:06:15Z

Caution

Review failed

The pull request is closed.

Walkthrough

Documentation overhaul for browser automation tooling, promoting agent-browser as the default tool. Adds comprehensive sections on session persistence, authentication state management, cookies, and multi-session workflows while updating all setup commands and example code blocks accordingly.

Changes

Cohort / File(s)	Summary
Browser Automation Documentation `\.agent/tools/browser/browser-automation.md`	Shifts default tool from dev-browser to agent-browser; restructures Visual Debugging and Self-diagnosis workflows; expands auth state management (save/load), cookies, LocalStorage/SessionStorage, and multi-session usage; replaces sample commands and code blocks with agent-browser equivalents; adds Ethical Guidelines section and multi-page workflow examples with sanitized forms/navigation patterns

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~22 minutes

Possibly related PRs

feat: add agent-browser support for headless browser automation CLI #59 — Directly related agent-browser tooling enhancement, including complementary documentation and script updates for the same browser automation framework

Poem

🌐 A browser reborn, agent-wise and keen,
Sessions persist through auth's green screen,
Cookies and storage, now first in line,
From dev to agent, the docs align. ✨

📜 Recent review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between ca4d3c6 and f17cce8.

📒 Files selected for processing (1)

.agent/tools/browser/browser-automation.md

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

augmentcode · 2026-01-12T01:11:35Z

🤖 Augment PR Summary

Summary: Updates the browser automation guide to make agent-browser the default tool for most automation workflows.

Changes:

Switches the default recommendation from dev-browser to agent-browser and updates the decision tree + quick reference accordingly.
Adds a “Visual Debugging” section encouraging AI self-diagnosis via screenshots, errors, console logs, and element state checks.
Introduces a new “Session Persistence” guide covering auth state save/load, cookies, and storage usage (including multi-session patterns).
Reorganizes examples to emphasize the snapshot + ref workflow and adds a step-by-step debugging checklist.

Technical Notes: Examples use --session for isolation and assume helper scripts are available under ~/.aidevops/agents/scripts/.

_{🤖 Was this summary useful? React with 👍 or 👎}

augmentcode

Review completed. 3 suggestions posted.

Comment augment review to trigger a new review at any time.

augmentcode · 2026-01-12T01:11:36Z

.agent/tools/browser/browser-automation.md

+agent-browser cookies set "session_id" "abc123"
+
+# Set cookie with options
+agent-browser cookies set "auth_token" "xyz789" --domain ".example.com" --path "/" --secure


This example uses agent-browser cookies set with --domain/--path/--secure flags, but elsewhere in our docs it’s shown as cookies set <name> <val> only; if these flags aren’t supported they’ll be ignored and make debugging auth harder (also applies to the env-var cookie example below).

_{🤖 Was this useful? React with 👍 or 👎}

augmentcode · 2026-01-12T01:11:36Z

.agent/tools/browser/browser-automation.md

+```bash
+# Start new session with saved auth
+agent-browser open https://app.example.com
+agent-browser state load ~/.aidevops/.agent-workspace/auth/example-com.json


agent-browser state load is presented as applying to the current session after open + reload; if state loading actually needs to happen at session/browser creation, this workflow won’t restore auth as written (also applies to the multi-session examples).

_{🤖 Was this useful? React with 👍 or 👎}

augmentcode · 2026-01-12T01:11:36Z

.agent/tools/browser/browser-automation.md

-#### **Python Version** 🐍 **NEW**
+**AI-powered browser automation with natural language control**

 ```bash


This fenced block is labeled bash but contains JS/TS (await stagehand.act(...)), so copying it into a shell will fail; consider using a javascript/ts fence to match the other Stagehand docs.

_{🤖 Was this useful? React with 👍 or 👎}

gemini-code-assist bot reviewed Jan 12, 2026

View reviewed changes

marcusquinn merged commit fb83249 into main Jan 12, 2026
10 of 12 checks passed

augmentcode bot reviewed Jan 12, 2026

View reviewed changes

This was referenced Jan 24, 2026

docs: update browser automation section with all tools in tested preference order #162

Merged

docs: update browser tool docs with benchmarks and add benchmark agent #163

Merged

feat(browser): add playwright-cli subagent for AI agent automation #196

Merged

	~/.aidevops/agents/scripts/agent-browser-helper.sh setup
	bash ~/.aidevops/agents/scripts/agent-browser-helper.sh setup

Conversation

marcusquinn commented Jan 12, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key Changes

Agent-Browser as Default

Visual Debugging (Don't Ask User)

Session Persistence

Debugging Checklist

Summary by CodeRabbit

Uh oh!

gemini-code-assist bot commented Jan 12, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

sonarqubecloud bot commented Jan 12, 2026

Quality Gate passed

Uh oh!

github-actions bot commented Jan 12, 2026

🔍 Code Quality Report

📈 Current Quality Metrics

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review failed

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

augmentcode bot commented Jan 12, 2026

Uh oh!

augmentcode bot left a comment

Choose a reason for hiding this comment

Uh oh!

augmentcode bot Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

augmentcode bot Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

augmentcode bot Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

marcusquinn commented Jan 12, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 12, 2026 •

edited

Loading