chore: update readme by chilijung · Pull Request #1451 · Canner/wren-engine

chilijung · 2026-03-17T04:40:22Z

reposition Wren Engine as the open context engine behind Wren AI
rewrite the root README around vision, semantic context, and agent workflows
target the OSS community building with OpenClaw, Cloud Code, VS Code, and MCP clients
clarify repository architecture, module roles, and developer entry points
refresh the supported data sources list in alphabetical order and add DuckDB

Summary by CodeRabbit

Documentation
- Added comprehensive developer and contributor guidance documentation.
- Refreshed project README with improved positioning, expanded narrative structure, updated use-case descriptions highlighting agent-centric workflows, and new community engagement messaging.

coderabbitai · 2026-03-17T04:40:37Z

📝 Walkthrough

Walkthrough

This PR adds a new AGENTS.md documentation file providing developer guidance for Codex interactions, and substantially rewrites README.md to reposition the project from MCP-centric messaging toward a broader agent-building narrative, including architecture details and updated use-case descriptions.

Changes

Cohort / File(s)	Summary
New Developer Documentation `AGENTS.md`	New file with comprehensive guidance covering project overview, repository structure, build procedures, Docker strategies, architecture details, testing workflows, and coding conventions for agent-focused development.
Project Positioning & Narrative `README.md`	Major content rewrite shifting messaging from product-centric MCP focus to open-source agent-building foundation. Restructured sections include new narrative on semantic governance, agent workflows, and architecture overview; replaced data sources list with more contextual layout.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Possibly related PRs

docs: add CLAUDE.md and settings.json for AI agent context #1440: Adds agent-focused documentation and tooling (AGENTS.md), serving complementary repository-level guidance purpose.
Update readme with MCP #1109: Modifies README.md to reshape agent/MCP-focused narrative with overlapping messaging changes.
add overview image #1154: Modifies README's introductory content and image alt attributes in the same header section.

Suggested labels

documentation

Suggested reviewers

goldmedal
wwwy3y3
douenergy

Poem

🐰 A docs refresh hops into view,
New AGENTS guide the way through and through,
README sings of agents so free,
From context to governance—semantically! 📖✨

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 inconclusive)

Check name	Status	Explanation	Resolution
Title check	❓ Inconclusive	The PR title 'chore: update readme' is vague and generic. While it references a real component (README), it fails to capture the substantive rebranding and repositioning effort described in the PR objectives.	Consider a more descriptive title that reflects the main intent, such as 'chore: reposition Wren Engine as open context engine for AI agents' or 'docs: rebrand and restructure README for agent-builder focus'.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch update/readme

📝 Coding Plan

Generate coding plan for human review comments

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Tip

Migrating from UI to YAML configuration.

Use the @coderabbitai configuration command in a PR comment to get a dump of all your UI settings in YAML format. You can then edit this YAML file and upload it to the root of your repository to configure CodeRabbit programmatically.

coderabbitai

Actionable comments posted: 4

🧹 Nitpick comments (1)

AGENTS.md (1)
80-85: Platform-specific prerequisites.

The macOS-specific setup instructions are clear. Consider noting whether these are optional or required depending on the target platform.
Optional clarification
 **Prerequisites for local strategy (one-time setup):**
+*(Required only when building for Linux on macOS)*
 ```bash
 brew install zig
 rustup target add aarch64-unknown-linux-gnu   # Apple Silicon
 rustup target add x86_64-unknown-linux-gnu    # Intel Mac
</details>

<details>
<summary>🤖 Prompt for AI Agents</summary>
Verify each finding against the current code and only fix it if needed.

In @AGENTS.md around lines 80 - 85, Update the "Prerequisites for local strategy
(one-time setup)" section to explicitly state whether the macOS-specific
commands are required or optional depending on the target platform; mention that
"brew install zig" and the two "rustup target add ..." commands apply only to
macOS (Apple Silicon vs Intel) and are unnecessary on other OSes, and add a
short note indicating which targets are required for cross-compilation versus
optional developer convenience.
</details>

</blockquote></details>

</blockquote></details>

<details>
<summary>🤖 Prompt for all review comments with AI agents</summary>
Verify each finding against the current code and only fix it if needed.

Inline comments:
In @AGENTS.md:

Line 7: The AGENTS.md claim "22+ data sources" is inconsistent with
README.md's list of 17 connectors; verify the actual connector count by counting
connector implementations (e.g., files under ibis-server/app/model/metadata) and
either update AGENTS.md to the correct number or expand the README list to
match; specifically edit the phrase "Wren Engine (OSS) ... executes them against
22+ data sources" in AGENTS.md and the connector list in README.md (lines
enumerating sources) so both reflect the verified count and ensure any summary
header (e.g., "22+ data sources") and the detailed list stay consistent.

Line 57: Update the test marker list string that starts with "Available test
markers:" to match the pytest config by replacing the inaccurate entries and
adding the missing ones so it reads: postgres, mysql, mssql, bigquery,
snowflake, clickhouse, trino, oracle, athena, duckdb, databricks, spark, doris,
local_file, s3_file, gcs_file, minio_file, functions, beta, canner, redshift;
modify the marker list text in the AGENTS.md line containing that sentence to
exactly reflect this corrected set.

Around line 110-126: Remove or update the incorrect - \sql/` — SQL parsing
and analysisbullet in the list of wren-core internals: delete that line and replace it with a brief clarification that SQL parsing is delegated to DataFusion’s sqlparser (re-exported inlib.rs) rather than handled by a wren-core/core/src/sql/module, so references to SQL parsing should point to the DataFusion/sqlparser re-export inlib.rs`.

In @README.md:

Around line 156-178: The README's "Supported Data Sources" section mentions
DuckDB but there is no DuckDB connector implementation under
ibis-server/app/model/metadata/, so either remove "DuckDB" from that list or add
a proper connector module (implement the DuckDB connector in
ibis-server/app/model/metadata/ with the same interface/exports as existing
connectors) and ensure it is registered where other connectors are discovered;
additionally add missing documented entries for ClickHouse and Doris to the same
"Supported Data Sources" list to reflect actual connectors present.

Nitpick comments:
In @AGENTS.md:

Around line 80-85: Update the "Prerequisites for local strategy (one-time
setup)" section to explicitly state whether the macOS-specific commands are
required or optional depending on the target platform; mention that "brew
install zig" and the two "rustup target add ..." commands apply only to macOS
(Apple Silicon vs Intel) and are unnecessary on other OSes, and add a short note
indicating which targets are required for cross-compilation versus optional
developer convenience.
</details>

---

<details>
<summary>ℹ️ Review info</summary>

<details>
<summary>⚙️ Run configuration</summary>

**Configuration used**: defaults

**Review profile**: CHILL

**Plan**: Pro

**Run ID**: `ae269e45-1f7a-4199-8038-6e5fb1294643`

</details>

<details>
<summary>📥 Commits</summary>

Reviewing files that changed from the base of the PR and between 3dda278e18284fd8cbbb1744e9bb1821ccc24e76 and 7868c240c1aaed3265309c820361c30e68275b0b.

</details>

<details>
<summary>📒 Files selected for processing (2)</summary>

* `AGENTS.md`
* `README.md`

</details>

</details>

coderabbitai · 2026-03-17T04:45:13Z

AGENTS.md

+
+## Project Overview
+
+Wren Engine (OSS) is an open source semantic engine for MCP clients and AI agents. It translates SQL queries through a semantic layer (MDL - Modeling Definition Language) and executes them against 22+ data sources (PostgreSQL, BigQuery, Snowflake, Spark, etc.). The engine is powered by Apache DataFusion (Canner fork).


⚠️ Potential issue | 🟡 Minor

Inconsistent data source count.

Line 7 claims "22+ data sources" but README.md lines 162-178 only list 17 sources. Verify the correct count and ensure consistency across documentation.

Suggested verification

#!/bin/bash # Description: Count actual connector implementations echo "Connectors in ibis-server:" fd -t f -e py . ibis-server/app/model/metadata/ --exec basename {} .py | grep -v '__' | grep -v 'factory' | grep -v 'base' | sort | wc -l echo -e "\nListed in README:" echo "17"

🧰 Tools

🪛 LanguageTool

[grammar] ~7-~7: Use a hyphen to join words.
Context: ...t Overview Wren Engine (OSS) is an open source semantic engine for MCP clients a...

(QB_NEW_EN_HYPHEN)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@AGENTS.md` at line 7, The AGENTS.md claim "22+ data sources" is inconsistent with README.md's list of 17 connectors; verify the actual connector count by counting connector implementations (e.g., files under ibis-server/app/model/metadata) and either update AGENTS.md to the correct number or expand the README list to match; specifically edit the phrase "Wren Engine (OSS) ... executes them against 22+ data sources" in AGENTS.md and the connector list in README.md (lines enumerating sources) so both reflect the verified count and ensure any summary header (e.g., "22+ data sources") and the detailed list stay consistent.

coderabbitai · 2026-03-17T04:45:13Z

AGENTS.md

+just format                         # ruff auto-fix + taplo
+```
+
+Available test markers: `postgres`, `mysql`, `mssql`, `bigquery`, `snowflake`, `clickhouse`, `trino`, `oracle`, `athena`, `duckdb`, `athena_spark`, `databricks`, `spark`, `doris`, `local_file`, `s3_file`, `gcs_file`, `minio_file`, `functions`, `profile`, `cache`, `unit`, `enterprise`, `beta`.


⚠️ Potential issue | 🟡 Minor

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Description: Extract registered pytest markers from pyproject.toml rg -A 50 '^\[tool\.pytest\.ini_options\]' ibis-server/pyproject.toml | rg 'markers\s*=' -A 30

Repository: Canner/wren-engine

Length of output: 1142

Update test marker list to match pytest configuration.

The listed markers contain inaccuracies: athena_spark, profile, cache, unit, and enterprise are not registered in the pytest configuration, while canner and redshift are missing from the list. Update to: postgres, mysql, mssql, bigquery, snowflake, clickhouse, trino, oracle, athena, duckdb, databricks, spark, doris, local_file, s3_file, gcs_file, minio_file, functions, beta, canner, redshift.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@AGENTS.md` at line 57, Update the test marker list string that starts with "Available test markers:" to match the pytest config by replacing the inaccurate entries and adding the missing ones so it reads: postgres, mysql, mssql, bigquery, snowflake, clickhouse, trino, oracle, athena, duckdb, databricks, spark, doris, local_file, s3_file, gcs_file, minio_file, functions, beta, canner, redshift; modify the marker list text in the AGENTS.md line containing that sentence to exactly reflect this corrected set.

coderabbitai · 2026-03-17T04:45:13Z

AGENTS.md

+**wren-core internals** (`wren-core/core/src/`):
+- `mdl/` — Core MDL processing: `WrenMDL` (manifest + symbol table), `AnalyzedWrenMDL` (with lineage), function definitions (scalar/aggregate/window per dialect), type planning
+- `logical_plan/analyze/` — DataFusion analyzer rules: `ModelAnalyzeRule` (TableScan → ModelPlanNode), scope tracking, access control (RLAC/CLAC), view expansion, relationship chain resolution
+- `logical_plan/optimize/` — Optimization passes: type coercion, timestamp simplification
+- `sql/` — SQL parsing and analysis
+
+**ibis-server internals** (`ibis-server/app/`):
+- `routers/v3/connector.py` — Main API endpoints (query, validate, dry-plan, metadata)
+- `model/metadata/` — Per-connector implementations (22 connectors), each with its own metadata handling
+- `model/metadata/factory.py` — Connector instantiation
+- `mdl/` — MDL processing: `core.py` (session context), `rewriter.py` (query rewriting), `substitute.py` (model substitution)
+- `custom_ibis/`, `custom_sqlglot/` — Ibis and SQLGlot extensions for Wren-specific behavior
+
+**Manifest types** (`wren-core-base/src/mdl/`):
+- `manifest.rs` — `Manifest`, `Model`, `Column`, `Metric`, `Relationship`, `View`, `RowLevelAccessControl`, `ColumnLevelAccessControl`
+- `builder.rs` — Fluent `ManifestBuilder` API
+- Uses `wren-manifest-macro` for auto-generating Pydantic-compatible Python classes


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Description: Verify key architecture paths mentioned in documentation paths=( "wren-core/core/src/mdl" "wren-core/core/src/logical_plan/analyze" "wren-core/core/src/logical_plan/optimize" "wren-core/core/src/sql" "ibis-server/app/routers/v3/connector.py" "ibis-server/app/model/metadata/factory.py" "ibis-server/app/mdl/core.py" "wren-core-base/src/mdl/manifest.rs" "wren-core-base/src/mdl/builder.rs" ) for path in "${paths[@]}"; do if [ -e "$path" ]; then echo "✓ $path" else echo "✗ Missing: $path" fi done

Repository: Canner/wren-engine

Length of output: 394

🏁 Script executed:

# Find sql-related directories and files in wren-core/core/src find wren-core/core/src -type d -name "*sql*" 2>/dev/null | head -20

Repository: Canner/wren-engine

Length of output: 44

🏁 Script executed:

# Search for sql parser/analyzer in the codebase rg -l "sql.*pars|parser.*sql" wren-core/core/src --type rust | head -20

Repository: Canner/wren-engine

Length of output: 514

🏁 Script executed:

# Check the directory structure of wren-core/core/src to understand module organization ls -la wren-core/core/src/

Repository: Canner/wren-engine

Length of output: 338

🏁 Script executed:

# Check the module structure in lib.rs cat wren-core/core/src/lib.rs

Repository: Canner/wren-engine

Length of output: 384

🏁 Script executed:

# Search for sql-related code across wren-core/core/src rg -i "sql" wren-core/core/src --type rust -l | head -20

Repository: Canner/wren-engine

Length of output: 989

Remove inaccurate path reference for SQL parsing.

The wren-core/core/src/sql/ module does not exist. SQL parsing is handled via DataFusion's sqlparser library (re-exported in lib.rs), not a dedicated wren-core module. Remove this bullet point or replace it with clarification that SQL parsing is delegated to DataFusion.

Current (incorrect) snippet

- `sql/` — SQL parsing and analysis

All other referenced paths are accurate and verified.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@AGENTS.md` around lines 110 - 126, Remove or update the incorrect `- \`sql/\` — SQL parsing and analysis` bullet in the list of wren-core internals: delete that line and replace it with a brief clarification that SQL parsing is delegated to DataFusion’s sqlparser (re-exported in `lib.rs`) rather than handled by a `wren-core/core/src/sql/` module, so references to SQL parsing should point to the DataFusion/sqlparser re-export in `lib.rs`.

coderabbitai · 2026-03-17T04:45:13Z

README.md

+## Supported Data Sources
+
+Wren Engine is built to work across modern data stacks, including warehouses, databases, and file-based sources.
+
+Current open source support includes connectors such as:
+
+- Amazon S3
+- Apache Spark
+- Athena
+- BigQuery
+- Databricks
+- DuckDB
+- Google Cloud Storage
+- Local files
+- MinIO
+- MySQL
+- Oracle
+- PostgreSQL
+- SQL Server
+- Snowflake
+- Trino
+- Redshift
+


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Description: Check for connector implementations in ibis-server fd -t f -e py . ibis-server/app/model/metadata/ --exec basename {} .py | sort | head -20

Repository: Canner/wren-engine

Length of output: 216

Remove DuckDB from the list or add a DuckDB connector implementation.

The README lists DuckDB as a supported data source, but no DuckDB connector exists in ibis-server/app/model/metadata/. Either remove it from the supported sources list or ensure the connector implementation is present. Additionally, ClickHouse and Doris connectors exist but are missing from the documentation.

🧰 Tools

🪛 LanguageTool

[grammar] ~160-~160: Use a hyphen to join words.
Context: ...s, and file-based sources. Current open source support includes connectors such ...

(QB_NEW_EN_HYPHEN)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@README.md` around lines 156 - 178, The README's "Supported Data Sources" section mentions DuckDB but there is no DuckDB connector implementation under ibis-server/app/model/metadata/, so either remove "DuckDB" from that list or add a proper connector module (implement the DuckDB connector in ibis-server/app/model/metadata/ with the same interface/exports as existing connectors) and ensure it is registered where other connectors are discovered; additionally add missing documented entries for ClickHouse and Doris to the same "Supported Data Sources" list to reflect actual connectors present.

update readme

7868c24

github-actions bot added the documentation Improvements or additions to documentation label Mar 17, 2026

coderabbitai bot reviewed Mar 17, 2026

View reviewed changes

goldmedal changed the title ~~update readme~~ chore: update readme Mar 17, 2026

goldmedal approved these changes Mar 17, 2026

View reviewed changes

goldmedal merged commit 357e970 into main Mar 17, 2026
10 of 12 checks passed

goldmedal deleted the update/readme branch March 17, 2026 05:15

nhaluc1005 pushed a commit to nhaluc1005/text2sql-practice that referenced this pull request Apr 3, 2026

chore: update readme (Canner#1451)

b1ddc04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: update readme#1451

chore: update readme#1451
goldmedal merged 1 commit intomainfrom
update/readme

chilijung commented Mar 17, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Mar 17, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

❌ Failed checks (1 inconclusive)

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Mar 17, 2026

Uh oh!

coderabbitai bot Mar 17, 2026

Uh oh!

coderabbitai bot Mar 17, 2026

Uh oh!

coderabbitai bot Mar 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		## Project Overview

		Wren Engine (OSS) is an open source semantic engine for MCP clients and AI agents. It translates SQL queries through a semantic layer (MDL - Modeling Definition Language) and executes them against 22+ data sources (PostgreSQL, BigQuery, Snowflake, Spark, etc.). The engine is powered by Apache DataFusion (Canner fork).

Conversation

chilijung commented Mar 17, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

❌ Failed checks (1 inconclusive)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chilijung commented Mar 17, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 17, 2026 •

edited

Loading