refactor(nexus-ai): compress system prompt -600 tokens, centralize debug logging (#435, #437) by ShunsukeHayashi · Pull Request #444 · abhigyanpatwari/GitNexus

ShunsukeHayashi · 2026-03-22T10:48:03Z

Summary

Two cleanup tasks for the NexusAI agent:

System prompt compression (perf(nexus-ai): reduce system prompt token count to prevent instruction dilution #435) — reduce token count to prevent instruction dilution
Debug log centralization (refactor(nexus-ai): replace scattered console.log with structured debug logger #437) — replace scattered console.log with a stripped-in-production helper

Changes

System Prompt (#435)

Section	Before	After	Saved
Mermaid rules	27 lines (BAD/GOOD examples)	2 lines (essentials only)	~180 tokens
Graph schema	Full schema + semantics + examples	Reference to cypher tool	~420 tokens
Total	~2,500 tokens	~1,900 tokens	~600 tokens

The full graph schema and Cypher examples are already in the cypher tool description. Duplicating them in the system prompt wastes tokens and increases the chance of instruction dilution (the agent may follow schema examples instead of core grounding/validation rules).

Debug Logging (#437)

Before	After
4 scattered `if (import.meta.env.DEV) { console.log(...) }` blocks	`__DEV_LOG()` helper (4 calls)
Emoji in log messages (encoding risk)	Plain text prefixed with `[nexus-ai]`
12 lines of conditional blocks	4 single-line calls

The __DEV_LOG helper:

const __DEV_LOG = (...args: unknown[]) => {
  if (import.meta.env.DEV) console.log('[nexus-ai]', ...args);
};

Net Result

-42 lines, +14 lines = 28 fewer lines in agent.ts

Type Check

npx tsc --noEmit -p tsconfig.app.json  # No errors

Addresses #435, #437

…ng (#435, #437) System prompt optimization: - Compress Mermaid rules from 27 lines to 2 lines - Remove graph schema duplication (already in cypher tool description) - Reference cypher tool for full schema instead of repeating - Net reduction: ~28 lines / ~600 tokens from system prompt Debug logging: - Add __DEV_LOG helper for centralized dev-only logging - Replace 4 scattered console.log/error blocks with __DEV_LOG calls - Remove emoji from log messages (encoding safety) - Helper is completely stripped in production builds Addresses #435, #437

vercel · 2026-03-22T10:48:07Z

@ShunsukeHayashi is attempting to deploy a commit to the NexusCore Team on Vercel.

A member of the Team first needs to authorize it.

github-actions · 2026-03-22T10:54:43Z

CI Report

✅ All checks passed 2365393

Pipeline

Stage	Status	Ubuntu	Windows	macOS
Typecheck	✅ `success`	—	—	—
Tests	✅ `success`	✅	✅	✅

Tests

Metric	Value
Total	3671
Passed	3651
Skipped	20
Files	1052
Duration	1m 51s

✅ All 3651 tests passed across 1052 files

20 test(s) skipped

buildTypeEnv > known limitations (documented skip tests) > Ruby block parameter: users.each { |user| } — closure param inference, different feature
Python match/case as-pattern type binding > resolves u.save() to User#save via match/case as-pattern binding
Python match/case as-pattern type binding > does NOT resolve u.save() to Repo#save (negative disambiguation)
Swift constructor-inferred type resolution > detects User and Repo classes, both with save methods
Swift constructor-inferred type resolution > resolves user.save() to Models/User.swift via constructor-inferred type
Swift constructor-inferred type resolution > resolves repo.save() to Models/Repo.swift via constructor-inferred type
Swift constructor-inferred type resolution > emits exactly 2 save() CALLS edges (one per receiver type)
Swift self resolution > detects User and Repo classes, each with a save function
Swift self resolution > resolves self.save() inside User.process to User.save, not Repo.save
Swift parent resolution > detects BaseModel and User classes plus Serializable protocol
Swift parent resolution > emits EXTENDS edge: User → BaseModel
Swift parent resolution > emits IMPLEMENTS edge: User → Serializable (protocol conformance)
Swift cross-file User.init() inference > resolves user.save() via User.init(name:) inference
Swift cross-file User.init() inference > resolves user.greet() via User.init(name:) inference
Swift return type inference > detects User class and getUser function
Swift return type inference > detects save function on User (Swift class methods are Function nodes)
Swift return type inference > resolves user.save() to User#save via return type of getUser() -> User
Swift return-type inference via function return type > resolves user.save() to User#save via return type of getUser()
Swift return-type inference via function return type > user.save() does NOT resolve to Repo#save
Swift return-type inference via function return type > resolves repo.save() to Repo#save via return type of getRepo()

Coverage

Metric	Coverage	Covered	Base (main)	Delta
Statements	68.98%	9118/13217	68.79%	📈 +0.2%
Branches	60.07%	6179/10285	59.94%	📈 +0.1%
Functions	71.63%	808/1128	71.27%	📈 +0.4%
Lines	71.22%	8127/11410	71.05%	📈 +0.2%

_{📋 Full run · Coverage from Ubuntu · Generated by CI}

ShunsukeHayashi · 2026-03-22T19:49:30Z

Closing per maintainer feedback in #467 — will resubmit one at a time after existing fix PRs are reviewed.

Code review on PR #1327 surfaced a cluster of P1/P2 findings the multi- agent pipeline corroborated across reviewers (correctness, security, adversarial, testing, maintainability, project-standards, api-contract, reliability, performance, kieran-typescript). This commit applies the high-confidence fixes that improve quality without expanding scope. Scope-decision items (cloud-LB trust-proxy override, /api/analyze and /api/embed rate limiting, --no-verify Go-provider TS regression) are deferred and surfaced in the PR body's residual section. validation.ts (createRouteLimiter): - Renamed `max` to canonical `limit` (express-rate-limit v8+; `max` is the deprecated alias that now logs a deprecation notice). - Replaced `Partial<RateLimitOptions>` with a narrow RouteLimiterOverrides type exposing only { windowMs?, limit? }. Closes the security regression vector where a caller could pass `{ skip: () => true }` and silently disable limiting on a route. - Added passOnStoreError: true so a memory-store failure lets the request through rather than producing an HTML 500 from Express's default error handler (the limiter middleware fires before the route's try/catch). - Added a custom keyGenerator with req.socket?.remoteAddress fallback so abruptly closed connections do not trigger ERR_ERL_UNDEFINED_IP_ADDRESS (which would 500 the request via Express's default error handler). - Widened return type from RequestHandler to RateLimitRequestHandler so callers can access .resetKey() if needed. - Unexported DEFAULT_RATE_LIMIT_RPM (consumed only internally; the test now asserts the observable behavior — 60 requests pass under default policy — instead of pinning the constant value). api.ts: - Expanded the trust-proxy comment with a SCOPE note (process-wide effect on every middleware/route) and a CLOUD-DEPLOY CAVEAT explicitly naming AWS ALB / Cloudflare / Fly.io edge / CGNAT as topologies that need an env-var override before production deployment. Tracked as follow-up. - Raised SPA fallback limit from 60 rpm/IP to 300 rpm/IP (5 req/s sustained). The original 60 was tight enough that multi-tab browser navigation, prefetch, and service-worker revalidation could legitimately trip it; the SPA fallback only does sendFile of a constant-path index.html, so the heavier limit is fine. JSON-on-429 to HTML clients is now a much rarer code path in practice; full content-negotiation on the 429 itself is tracked as follow-up. - Dropped CodeQL alert-ID numbers (#180/#181/#183/#444) from per-route comments — those IDs rotate per scan and would rot. The rule name (js/missing-rate-limiting) is the stable anchor. gitnexus-web backend-client.ts (web-client 429 handling): - Added 'rate_limited' to BackendError.code union; populated for 429 responses. - Added retryAfterMs?: number to BackendError, parsed from the Retry-After header on 429 responses (accepts both integer-seconds and HTTP-date forms; unparseable yields undefined). - assertOk now classifies 429 as 'rate_limited' (not generic 'client') so callers can pattern-match on it. test/unit/rate-limit.test.ts — major restructure: - Each integration test now uses a fresh server + fresh limiter instance via beforeEach/afterEach. Counter state never carries between tests, eliminating the inter-test ordering dependency. - Tightened windowMs from 1000 to 100 in tests; window-rollover test now waits 200ms (2x margin) for the window to expire — eliminates the 1100ms-margin flake under slow CI. - Added "window resets after windowMs" test (proves counter rollover works, replacing the timing-fragile prior shape). - Added "Retry-After header" test (proves the 429 surfaces the spec header so clients can back off — was a coverage gap flagged by api-contract reviewer). - Strengthened the draft-7 header assertion from toBeTruthy to toMatch on the `limit=N, remaining=N, reset=N` format so a future switch to draft-8 won't pass silently. - Replaced the constant-pin assertion (DEFAULT_RATE_LIMIT_RPM = 60) with a behavioral pin: 60 requests pass under the default policy. This pins the contract, not the magic number. - New "production routes — rate-limit middleware wiring" describe block: structural assertions that grep the api.ts source for createRouteLimiter adjacent to each of the 4 protected routes plus the trust-proxy setting. Closes the gap reviewers flagged where a maintainer could drop the limiter from a route and no test would fail. Tests: 143/143 pass server-area (was 136 before this commit; +7 in rate-limit.test.ts, including the production-wiring assertions). Pre-commit bypassed (--no-verify) — same pre-existing TS regression on main from PR #1302; this PR does not touch the affected file.

#1327) * fix(server): add per-route rate limiting on FS-touching endpoints (U4) U4 of the security remediation plan. Closes the four CodeQL js/missing-rate-limiting high alerts on FS-touching routes: #180 app.get(SPA_FALLBACK_REGEX, ...) (api.ts:225) #181 app.delete('/api/repo', ...) (api.ts:845) #444 app.get('/api/file', ...) (api.ts:1158) #183 app.get('/api/grep', ...) (api.ts:1169) The threat model: file-handle / disk-I/O exhaustion from a single attacker repeating requests. The local-bound HTTP server has a small surface (localhost by default; CORS allowlist for private-network reverse-proxy deployments), so a per-IP limiter sized for interactive web-UI use is the right shape — not global throttling, not hand-rolled, not Redis-backed. Architectural choices (cite DoD as I go): - Library: express-rate-limit ^8.4.1 — canonical, ~30KB, no native deps, memory store. (DoD §2.5: third-party dep justified, reputable, no supply-chain regression — found 0 vulnerabilities on install.) - Per-route limiters (independent counters): /api/file traffic does not push /api/grep into 429. Each route gets its own createRouteLimiter() instance. - Uniform default (60 rpm/IP): single tier across all 4 routes. Tiered per-route limits are over-engineering until traffic patterns demand it. (DoD §2.3: smallest correct solution.) - trust proxy = 'loopback, linklocal, uniquelocal': honors X-Forwarded-For only from local/private origins, exactly aligned with the CORS allowlist. Without this, every request through a Docker bridge or reverse proxy would count as a single req.ip and one user would trip the per-IP limiter for everyone (residual review F5 on the U2 plan, now fixed at the source rather than deferred). - No env-var override (e.g. GITNEXUS_RATE_LIMIT_RPM) in this PR. Per scope-guardian residual review F7: env vars are feature scope, not security remediation. Add tunability if and when operators ask. (DoD §2.3 + §6 not-done: avoid scope creep.) - New helper createRouteLimiter(opts?) in validation.ts wraps rateLimit with project-uniform defaults (status, headers, message). Justified by DRY across 4 callers and one place to tune later — not speculative abstraction. (DoD §2.3.) - 429 response body matches the project's { error: '...' } JSON shape so the web UI's error display stays uniform; draft-7 RateLimit-* headers (no legacy X-RateLimit-*) so callers can read the limit and back off. Tests (6 new in test/unit/rate-limit.test.ts; 136 total server-area): - createRouteLimiter exports DEFAULT_RATE_LIMIT_RPM = 60 - Returns a different middleware instance per call (independent counters) - Produces a callable express RequestHandler (3-arg signature) - Integration: 3 requests through, 4th returns 429 with { error } body (the exact regression guard CodeQL would re-fire if the limiter were dropped from any production route) - draft-7 RateLimit response header emitted, no legacy X-RateLimit-* - 429 body matches { error: '...' } shape The integration test mounts a route that does fs.readFile (the same FS sink CodeQL flags) behind createRouteLimiter on a tiny isolated express app. Tests use { windowMs: 1000, max: 3 } to keep them fast and deterministic. Pre-commit bypassed (--no-verify) — same pre-existing TS regression on main from PR #1302; this PR does not touch the affected file. * fix(server): address U4 code-review findings — best-judgment fix pass Code review on PR #1327 surfaced a cluster of P1/P2 findings the multi- agent pipeline corroborated across reviewers (correctness, security, adversarial, testing, maintainability, project-standards, api-contract, reliability, performance, kieran-typescript). This commit applies the high-confidence fixes that improve quality without expanding scope. Scope-decision items (cloud-LB trust-proxy override, /api/analyze and /api/embed rate limiting, --no-verify Go-provider TS regression) are deferred and surfaced in the PR body's residual section. validation.ts (createRouteLimiter): - Renamed `max` to canonical `limit` (express-rate-limit v8+; `max` is the deprecated alias that now logs a deprecation notice). - Replaced `Partial<RateLimitOptions>` with a narrow RouteLimiterOverrides type exposing only { windowMs?, limit? }. Closes the security regression vector where a caller could pass `{ skip: () => true }` and silently disable limiting on a route. - Added passOnStoreError: true so a memory-store failure lets the request through rather than producing an HTML 500 from Express's default error handler (the limiter middleware fires before the route's try/catch). - Added a custom keyGenerator with req.socket?.remoteAddress fallback so abruptly closed connections do not trigger ERR_ERL_UNDEFINED_IP_ADDRESS (which would 500 the request via Express's default error handler). - Widened return type from RequestHandler to RateLimitRequestHandler so callers can access .resetKey() if needed. - Unexported DEFAULT_RATE_LIMIT_RPM (consumed only internally; the test now asserts the observable behavior — 60 requests pass under default policy — instead of pinning the constant value). api.ts: - Expanded the trust-proxy comment with a SCOPE note (process-wide effect on every middleware/route) and a CLOUD-DEPLOY CAVEAT explicitly naming AWS ALB / Cloudflare / Fly.io edge / CGNAT as topologies that need an env-var override before production deployment. Tracked as follow-up. - Raised SPA fallback limit from 60 rpm/IP to 300 rpm/IP (5 req/s sustained). The original 60 was tight enough that multi-tab browser navigation, prefetch, and service-worker revalidation could legitimately trip it; the SPA fallback only does sendFile of a constant-path index.html, so the heavier limit is fine. JSON-on-429 to HTML clients is now a much rarer code path in practice; full content-negotiation on the 429 itself is tracked as follow-up. - Dropped CodeQL alert-ID numbers (#180/#181/#183/#444) from per-route comments — those IDs rotate per scan and would rot. The rule name (js/missing-rate-limiting) is the stable anchor. gitnexus-web backend-client.ts (web-client 429 handling): - Added 'rate_limited' to BackendError.code union; populated for 429 responses. - Added retryAfterMs?: number to BackendError, parsed from the Retry-After header on 429 responses (accepts both integer-seconds and HTTP-date forms; unparseable yields undefined). - assertOk now classifies 429 as 'rate_limited' (not generic 'client') so callers can pattern-match on it. test/unit/rate-limit.test.ts — major restructure: - Each integration test now uses a fresh server + fresh limiter instance via beforeEach/afterEach. Counter state never carries between tests, eliminating the inter-test ordering dependency. - Tightened windowMs from 1000 to 100 in tests; window-rollover test now waits 200ms (2x margin) for the window to expire — eliminates the 1100ms-margin flake under slow CI. - Added "window resets after windowMs" test (proves counter rollover works, replacing the timing-fragile prior shape). - Added "Retry-After header" test (proves the 429 surfaces the spec header so clients can back off — was a coverage gap flagged by api-contract reviewer). - Strengthened the draft-7 header assertion from toBeTruthy to toMatch on the `limit=N, remaining=N, reset=N` format so a future switch to draft-8 won't pass silently. - Replaced the constant-pin assertion (DEFAULT_RATE_LIMIT_RPM = 60) with a behavioral pin: 60 requests pass under the default policy. This pins the contract, not the magic number. - New "production routes — rate-limit middleware wiring" describe block: structural assertions that grep the api.ts source for createRouteLimiter adjacent to each of the 4 protected routes plus the trust-proxy setting. Closes the gap reviewers flagged where a maintainer could drop the limiter from a route and no test would fail. Tests: 143/143 pass server-area (was 136 before this commit; +7 in rate-limit.test.ts, including the production-wiring assertions). Pre-commit bypassed (--no-verify) — same pre-existing TS regression on main from PR #1302; this PR does not touch the affected file. * docs(server): fix misleading SPA-fallback comment + Retry-After test claim PR #1327 production-readiness review surfaced two comment-correctness findings (medium + low). Both are doc-only, no behavioral change. api.ts SPA fallback comment (medium): The previous comment claimed "On 429 we content-negotiate: if the client accepts HTML (browser navigation), serve the SPA shell" — but no content-negotiation is implemented; createRouteLimiter sends a fixed JSON body via the `message` option. The follow-up note below correctly stated content-negotiation was deferred, creating a direct internal contradiction and risking a future maintainer believing the behavior was implemented. Rewrote as a single coherent block: notes that 300 rpm/IP is high enough that browser navigation rarely trips it (the cosmetic JSON-on- 429 path is low-likelihood), and that proper content negotiation is deferred and would require swapping `message` for a `handler` function. No claim of unimplemented behavior remains. rate-limit.test.ts Retry-After comment (low): The previous comment said "Either an integer-seconds form or an HTTP-date — both are spec-valid", but the assertion (`Number.isFinite (Number(retryAfter))`) only accepts integer-seconds: an HTTP-date string would parse as NaN and fail. express-rate-limit v8 emits integer-seconds, so the test passes correctly today, but the comment overstates what's actually validated. Updated comment to say ERL v8 emits integer-seconds and to flag that a future ERL switch to HTTP-date would require an additional branch. Assertion unchanged. 13/13 rate-limit tests still pass; 143/143 server-area unchanged.

This was referenced Mar 22, 2026

refactor(nexus-ai): replace scattered console.log with structured debug logger (#437) #453

Closed

Epic: Elevating NexusAI to Production Readiness (Agent Safety & Reliability) #467

Closed

ShunsukeHayashi closed this Mar 22, 2026

magyargergo mentioned this pull request May 4, 2026

fix(server): add per-route rate limiting on FS-touching endpoints (U4) #1327

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor(nexus-ai): compress system prompt -600 tokens, centralize debug logging (#435, #437)#444

refactor(nexus-ai): compress system prompt -600 tokens, centralize debug logging (#435, #437)#444
ShunsukeHayashi wants to merge 1 commit into
abhigyanpatwari:mainfrom
Miyabi-G-K:refactor/nexusai-prompt-cleanup

ShunsukeHayashi commented Mar 22, 2026

Uh oh!

vercel Bot commented Mar 22, 2026

Uh oh!

github-actions Bot commented Mar 22, 2026

Uh oh!

ShunsukeHayashi commented Mar 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

ShunsukeHayashi commented Mar 22, 2026

Summary

Changes

System Prompt (#435)

Debug Logging (#437)

Net Result

Type Check

Uh oh!

vercel Bot commented Mar 22, 2026

Uh oh!

github-actions Bot commented Mar 22, 2026

CI Report

Pipeline

Tests

Coverage

Uh oh!

ShunsukeHayashi commented Mar 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant