feat: add markdown file indexing (headings + cross-links)#399
Merged
Conversation
Parse .md/.mdx files using regex (no tree-sitter dependency) to extract: - Section nodes from headings (h1-h6) with hierarchy via CONTAINS edges - Cross-file IMPORTS edges from markdown links to other repo files Ported from #286 to resolve conflicts with kuzu→lbug rename. Co-Authored-By: Dennis Palatov <dp-web4@users.noreply.github.com> Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
|
The latest updates on your projects. Learn more about Vercel for GitHub.
|
Contributor
CI Report❌ Some checks failed Pipeline
Coverage
📋 Full run · Coverage from Ubuntu · Generated by CI |
1 task
dp-web4
added a commit
to dp-web4/GitNexus
that referenced
this pull request
Mar 20, 2026
…→lbug migration The kuzu→lbug migration (abhigyanpatwari#275) didn't carry forward three pieces from the markdown indexing PR (abhigyanpatwari#399): 1. 'Section' missing from NODE_TABLES constant — LadybugDB type system doesn't recognize Section as a valid node type 2. SECTION_SCHEMA missing from NODE_SCHEMA_QUERIES — Section table never created in the database (already fixed in abhigyanpatwari#399 merge, confirming) 3. getCopyQuery falls through to 7-column multi-lang default for Section, but Section CSV has 8 columns (includes 'level'). Causes: "Binder exception: Number of columns mismatch. Expected 7 but got 8" Reproduces on any repo with .md files. Tested fix against a 2K+ markdown file repo (40K nodes, 37K edges) — indexes in 38s with no crashes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
abhigyanpatwari
pushed a commit
that referenced
this pull request
Mar 21, 2026
Release includes markdown indexing (#399) and the Section table registration fix (#401), making PR #403 unnecessary. https://claude.ai/code/session_015WxcTDYrGi4sWY8iY8gDZP
icodebuster
pushed a commit
to icodebuster/GitNexus
that referenced
this pull request
Mar 22, 2026
* main: (67 commits) fix(server): allow private/LAN network origins in CORS (abhigyanpatwari#390) fix(ingestion): calculate confidence per resolution tier for heritage/MRO edges (abhigyanpatwari#412) fix(lbug): retry on DB lock with session-safe cleanup (abhigyanpatwari#325) fix(analyze): address review — rename --no-git to --skip-git, fix stale cache fix(analyze): address Copilot review — ESM import, CLI option, .gitignore guard docs(schema): add Community and Process node properties to cypher tool description (abhigyanpatwari#411) fix(analyze): allow indexing folders without a .git directory (abhigyanpatwari#384) token trunking updated mahalanobis threshold to be multi-dim aware fix: clarify that ORT CUDA binaries are linux/x64 only updating workflow verbage fixed prop cutoff issue for pr/issue filtering fix: update symbol and relationship counts in documentation fix: address PR abhigyanpatwari#409 review findings (P0-P3) and simplify import resolution API refactor: unify language dispatch with compile-time exhaustive tables feat: implement cross-file binding propagation for multiple languages fix: hydrate worker DB in server mode + fix LadybugDB getAll API mismatch (abhigyanpatwari#398) (abhigyanpatwari#404) docs: add gitnexus-stable-ops to community integrations fix: register Section in NODE_TABLES and NODE_SCHEMA_QUERIES (abhigyanpatwari#401) feat: add markdown file indexing (headings + cross-links) (abhigyanpatwari#399) ...
motolese
pushed a commit
to motolese/datamoto-gitnexus
that referenced
this pull request
Apr 23, 2026
…twari#399) feat: add markdown file indexing (headings + cross-links) Ports abhigyanpatwari#286 by @dp-web4 onto current main, resolving conflicts from kuzu→lbug rename. Closes abhigyanpatwari#286 Co-Authored-By: Dennis Palatov <dp-web4@users.noreply.github.com> Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
kuzu→lbugrename (refactor: migrate from KuzuDB to LadybugDB v0.15 #275).md/.mdxfiles using regex (no tree-sitter dependency) to extract Section nodes from headings and IMPORTS edges from cross-file linksSectionnode type with full schema, CSV generation, and LadybugDB relation table entriesSupersedes #286 — all review feedback from that PR has been incorporated (endLine spans, dedup, level property, .mdx support, unused code removed).
What it does
# h1through###### h6) becomeSectionnodes with hierarchy viaCONTAINSedges (File→Section, Section→Section for nested headings)[text](relative/path.md)) that resolve to files within the repo becomeIMPORTSedges between File nodesFiles changed (5 modified, 1 new)
src/core/graph/types.ts— AddSectiontoNodeLabelunion +levelpropertysrc/core/lbug/schema.ts— AddSectionnode table + relation entriessrc/core/lbug/csv-generator.ts— AddSectionto CSV writerssrc/core/ingestion/pipeline.ts— Insert markdown processing stepsrc/core/ingestion/markdown-processor.ts— New file, ~157 linesTest plan
tscclean) ✅ verified locally🤖 Generated with Claude Code
Co-Authored-By: Dennis Palatov dp-web4@users.noreply.github.com