test(server): harden gateway, MCP, REST, orchestration, connections, auth, proxy by buremba · Pull Request #684 · lobu-ai/lobu

buremba · 2026-05-13T12:57:49Z

Tests-only. Bundles 5 hardening passes from parallel teammates into one PR by layer.

Coverage added

Branch merged	Area	Tests
`test/harden-proxy-egress` (cherry-pick)	worker proxy, egress judge, SSRF, CRLF	71
`test/harden-mcp`	MCP proxy, tool approval, SSRF guard, cross-agent JWT isolation	149
`test/harden-rest-api`	REST CRUD auth/org-scoping, Slack OAuth replay, `/lobu` prefix	45
`test/harden-orchestration`	worker spawn/queue, watcher-run-race regression	56
`test/harden-connections`	Chat SDK adapters, platform isolation, webhook idempotency	76
`test/harden-auth`	JWT, settings-cookie, redirect-URI, PKCE, policy-store	106

Non-test changes

None — purely new __tests__/*.test.ts files.

Suspected real bugs (filed as issues, not fixed here)

GET /internal/connections requires no auth — anonymous enumeration of every connection. (rest-api)
MCP tool-approval falls back to allow when tool annotations can't be fetched (e.g. SSRF block). (mcp)
NAT64 prefix 64:ff9b::/96 not decoded → SSRF reach into private space. (proxy)
Cross-platform leakage if a caller omits connectionId from an InteractionService event. (connections)
OpenClawProgressProcessor.processEvent null-derefs on malformed message_update. (worker)
Dead 24h TOKEN_EXPIRATION_MS guard in api-auth-middleware (real window is 2h). (auth)

Brittle existing tests called out (recommend follow-up)

secret-proxy.test.ts — one assertion is expect(true).toBe(true) (dead test).
connections/__tests__/multi-tenant-integration.test.ts — all vitest .todo() stubs, never runs.
mcp-proxy.test.ts "caches tools" / "retries on Server not initialized" — count exact fetch calls, brittle.
agent-routes.test.ts — only covers happy path.
config.test.ts — asserts getAuthConfig returns a Promise and nothing else.

🤖 Generated with Claude Code

45 deterministic bun:test tests covering missing/expired auth (401), cross-org agent and connection isolation (403/404), Slack OAuth callback with missing/expired/replayed state, stale-timestamp rejection, /lobu prefix routing, agentId input validation, and connection webhook 404 for unknown connections. Documents the unauthenticated /internal/connections endpoint as a known security gap.

…tests 56 new bun:test cases covering: watcher-run-race regression (classifyQueue never emits connector lanes), spawn-error/crash cleanup, stale worker reconciliation, maxDeployments concurrency limit, workspace dir creation + agentId validation, WORKER_ENV_* prefix-stripping, nix package injection prevention, killWorker double-exit safety, grant sync cache, deployment name determinism, and backoff correctness.

…ation Add 76 deterministic bun:test tests covering: - isSecretField heuristic (true/false for 17 field names) - isTelegramConfig / isSlackConfig narrowing type guards - InteractionService URL scheme guard (rejects javascript:, data:, file:) - InteractionService platform field on all 4 emitted event types - registerInteractionBridge: events with non-matching connectionId are ignored - registerInteractionBridge cleanup: all 4 event-type listeners removed - Duplicate event id idempotency (5-minute dedup window blocks second emit) - Two bridges on different connections don't cross-contaminate - Telegram polling-mode selection (auto/webhook/polling x publicGatewayUrl) - parseSlackTeamJoinEvent edge cases (bot, deleted, missing fields, bad JSON) - registerSlackPlatformHandlers DM vs group channel detection - Unique event ids per postStatusMessage / postLinkButton - beforeCreateHook ordering guarantee - postOauthLink delegates with linkType=oauth

New test files: - mcp-proxy-edge-cases.test.ts: SSRF guard (7 IPv4 private CIDRs + IPv6 loopback), cross-agent JWT isolation, tool-registry collision routing, wildcard grant coverage, onToolBlocked callback metadata, body-size 413, SSE-framed JSON-RPC parsing, in-memory session TTL/isolation, executeToolDirect happy+error paths, requiresToolApproval annotation logic - mcp-tool-call.test.ts: callMcpTool happy path, isError wrapping, 403/404/502 forwarding, empty-content fallback, fetch-throws, header correctness, askUserQuestion/uploadUserFile/getChannelHistory error propagation - tool-policy-edge-cases.test.ts: isDirectPackageInstallCommand (detected + non-detected + documented conservative over-detections), wildcard prefix patterns in buildToolPolicy, normalizeToolList coercion, deny-over-allow priority, case-insensitive allow/deny matching

New test files: - mcp-proxy-edge-cases.test.ts: SSRF guard (7 IPv4 private CIDRs + IPv6 loopback contract), cross-agent JWT isolation, tool-registry collision routing, wildcard grant (/mcp/id/tools/*), onToolBlocked metadata, body-size 413, SSE-framed JSON-RPC parsing, in-memory session TTL, executeToolDirect happy+error, requiresToolApproval annotation logic - mcp-tool-call.test.ts: callMcpTool happy path, isError wrapping, 403/404/502 forwarding, empty-content fallback, fetch-throws, header correctness, AskUser/UploadFile/GetChannelHistory error propagation - tool-policy-edge-cases.test.ts: isDirectPackageInstallCommand (detected + non-detected + conservative over-detections documented), wildcard prefix patterns, normalizeToolList coercion, deny-over-allow priority, case-insensitive matching

71 new deterministic bun tests across 6 describe blocks: - isBlockedIpAddress: private IPv4 ranges, IPv6 loopback/ULA/multicast, IPv4-mapped ::ffff: (dotted + hex forms), zone-ID stripping; documents NAT64 gap (64:ff9b::/96 not currently decoded) - Domain-blocking edge cases: wildcard boundary (evilexample.com vs .example.com), exact-vs-subdomain, case-insensitive, blocklist in unrestricted mode, IP-literal CONNECT (0.0.0.0, 127.0.0.1, 192.168.1.1), IPv6-bracket CONNECT returns 400 (bug documented), DNS rebinding mix - CRLF injection: verifies escapeHeaderValue() prevents \r\n in judge reason from creating extra HTTP header lines; documents that CRLF is collapsed to a space in the status-line reason - VerdictCache: method/path/CONNECT key independence, policyHash invalidation, extraPolicy change → hash change, cross-agent hash isolation - CircuitBreaker: success resets failure counter, re-trips after reset, per-policy isolation, unknown-hash default (closed), half-open exclusivity - PolicyStore.resolve: exact > wildcard, longer wildcard > shorter, agentId isolation, no-bundle, implicit 'default' judge, missing judge → undefined (fail closed), clear(), root-domain wildcard, case-insensitive, cross-agent policyHash independence - EgressJudge: default Haiku model, per-rule model override, circuit-open short-circuit, cache metadata on cached decision, single-failure source distinction (judge-error vs circuit-open)

codecov-commenter · 2026-05-13T12:59:25Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

❌ Patch coverage is 61.53846% with 35 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
packages/cli/src/commands/eval.ts	0.00%	15 Missing ⚠️
packages/core/src/utils/retry.ts	47.82%	12 Missing ⚠️
packages/core/src/logger.ts	0.00%	4 Missing ⚠️
packages/cli/src/commands/chat.ts	0.00%	2 Missing ⚠️
packages/core/src/worker/auth.ts	50.00%	2 Missing ⚠️

📢 Thoughts on this report? Let us know!

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b7306f6ca7

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-13T13:02:29Z

+
+  test("wrong-key token → null (key rotation / leakage scenario)", () => {
+    // Encrypt with ALT_KEY, then switch back to TEST_KEY: AES-GCM tag fails.
+    process.env.ENCRYPTION_KEY = ALT_KEY;


Reset encryption cache before rotating settings keys

When this wrong-key test runs after any earlier encrypt/decrypt call, @lobu/core has already memoized the first ENCRYPTION_KEY, so assigning process.env.ENCRYPTION_KEY = ALT_KEY here does not actually change the key used by encrypt(). The token is still produced with the cached test key and verifySettingsToken() will accept it instead of returning null, making the new suite order-dependent/CI-failing unless the test calls __resetEncryptionKeyCacheForTests() around key changes.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-13T13:02:29Z

+    process.env.ENCRYPTION_KEY =
+      "fedcba9876543210fedcba9876543210fedcba9876543210fedcba9876543210";


Reset encryption cache before expecting worker-key rejection

freshToken() encrypts with the current key and primes the memoized key in @lobu/core; changing process.env.ENCRYPTION_KEY afterward does not affect verifyWorkerToken() in this process. In this scenario the middleware still decrypts the token with the cached original key and returns 200, so the new key-rotation test fails unless the encryption cache is reset before/after mutating the env var.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-13T13:02:29Z

+   * registered for that connection. The `platform` field on the event payload
+   * is metadata, not a routing key — isolation comes from `connectionId`.


Preserve platform as an explicit renderer gate

These tests now document platform as mere metadata and assert that connectionId alone provides isolation, but the repo-level platform-isolation rule says every platform renderer must filter on the event's explicit platform field. With the behavior encoded this way, a same-connection event carrying the wrong platform would still be considered valid by the test suite instead of being rejected, so this should add/expect a same-connection platform-mismatch no-op rather than codifying connection-only routing.

Useful? React with 👍 / 👎.

…test - bun-security-tests job: switch from cd-into-core to `make build-packages` so transitive @lobu/connector-sdk + agent-worker dist are present when the gateway tests import them. - packages/server/vitest.config.ts: exclude src/auth/__tests__/oauth-utils.test.ts (a bun:test unit file landed via #684 in a vitest-scanned directory; vitest can't resolve the bun:test specifier and the test doesn't need a DB anyway).

…ch (WS9) Final piece of the WS1–WS8 security sweep. Adds a dedicated CI job and regression-prevention guards, plus extends the WS5 token-revocation check to call sites that previously bypassed it. PIECE A — security-tests workflow - New .github/workflows/security-tests.yml runs the security suites in isolation so a regression on them shows up under a clear name rather than as collateral in the broader CI job. Three jobs: - static-guards: runs scripts/check-security-patterns.sh - bun-security-tests: encryption, worker-auth, nix-attr-ref, secret-proxy lifecycle/harden, egress-judge timeout/circuit, manifest-enforcement, proxy SSRF hardening, revoked-token-store, proxy-hardening - vitest-security-tests: webhook-signatures (Slack HMAC) under Node + Postgres Out of scope per repo memory (vitest CI gap): the ~30 stale vitest integration files. Documented in the workflow header. PIECE B — static regression guards - scripts/check-security-patterns.sh greps the tree for: 1. window.confirm/alert/prompt outside node_modules (banned by packages/web/DESIGN_GUIDELINES.md — inline confirms only). 2. SQL string-concatenation onto literal SELECT/WHERE fragments. Findings can be suppressed with a same-line `// security-allowed: <reason>` comment. 3. The loose Nix charset regex [A-Za-z0-9._-]+ re-introduced inside packages/server/src/gateway/orchestration/. WS3 replaced this with a strict attribute-ref validator. - One pre-existing hit in packages/server/src/tools/get_content.ts was annotated security-allowed (scopedQuery is an internally-built SQL fragment, every WHERE clause uses $N placeholders). PIECE C — WS5 revocation reach verifySettingsSession already enforces jti revocation internally, so every call site that awaits it inherits the check. For verifyWorkerToken, several non-middleware call sites still bypassed the revoked-token store: - gateway/auth/mcp/proxy.ts (authenticateRequest): now async, checks isRevoked() before returning a session. - gateway/auth/mcp/config-service.ts (buildWorkerMcpConfig): rejects a revoked jti and returns an empty MCP config. - gateway/gateway/index.ts (authenticateWorker): now async, denies the SSE/response/session-context endpoints when the jti is revoked. - gateway/routes/internal/middleware.ts (authenticateWorker): consults the singleton store and returns 401 on a revoked jti. - gateway/routes/public/agent.ts (requireAgentOwnership): checks jti before authorizing the worker-token branch. - gateway/proxy/http-proxy.ts (validateProxyAuth): uses a new sync fast-path RevokedTokenStore.isRevokedCached(jti) — the CONNECT path must stay sync, and same-process revokes hit the cache immediately. New test: revoked-token-store.test.ts "authenticateWorker (internal middleware) — revocation reach" — pins that a revoked jti returns 401 from the internal middleware path that was previously bypassed. Drive-by cleanup so the build passes: - restored getClientIp export in gateway/utils/rate-limiter.ts (removed in #684, broke secret-proxy.ts imports on main). - fixed pre-existing TS errors in packages/core/src/__tests__/ utils-json.test.ts and command-registry.test.ts introduced by #685. - fixed pre-existing biome noEmptyBlockStatements in command-registry.test.ts. - updated the stale NAT64 gap test in proxy-hardening.test.ts — the 64:ff9b::/96 prefix is now decoded and blocked, so the assertion flips from .toBe(false) to .toBe(true).

…ch (WS9) (#693) * ci(security): focused gate, static guards, and broader revocation reach (WS9) Final piece of the WS1–WS8 security sweep. Adds a dedicated CI job and regression-prevention guards, plus extends the WS5 token-revocation check to call sites that previously bypassed it. PIECE A — security-tests workflow - New .github/workflows/security-tests.yml runs the security suites in isolation so a regression on them shows up under a clear name rather than as collateral in the broader CI job. Three jobs: - static-guards: runs scripts/check-security-patterns.sh - bun-security-tests: encryption, worker-auth, nix-attr-ref, secret-proxy lifecycle/harden, egress-judge timeout/circuit, manifest-enforcement, proxy SSRF hardening, revoked-token-store, proxy-hardening - vitest-security-tests: webhook-signatures (Slack HMAC) under Node + Postgres Out of scope per repo memory (vitest CI gap): the ~30 stale vitest integration files. Documented in the workflow header. PIECE B — static regression guards - scripts/check-security-patterns.sh greps the tree for: 1. window.confirm/alert/prompt outside node_modules (banned by packages/web/DESIGN_GUIDELINES.md — inline confirms only). 2. SQL string-concatenation onto literal SELECT/WHERE fragments. Findings can be suppressed with a same-line `// security-allowed: <reason>` comment. 3. The loose Nix charset regex [A-Za-z0-9._-]+ re-introduced inside packages/server/src/gateway/orchestration/. WS3 replaced this with a strict attribute-ref validator. - One pre-existing hit in packages/server/src/tools/get_content.ts was annotated security-allowed (scopedQuery is an internally-built SQL fragment, every WHERE clause uses $N placeholders). PIECE C — WS5 revocation reach verifySettingsSession already enforces jti revocation internally, so every call site that awaits it inherits the check. For verifyWorkerToken, several non-middleware call sites still bypassed the revoked-token store: - gateway/auth/mcp/proxy.ts (authenticateRequest): now async, checks isRevoked() before returning a session. - gateway/auth/mcp/config-service.ts (buildWorkerMcpConfig): rejects a revoked jti and returns an empty MCP config. - gateway/gateway/index.ts (authenticateWorker): now async, denies the SSE/response/session-context endpoints when the jti is revoked. - gateway/routes/internal/middleware.ts (authenticateWorker): consults the singleton store and returns 401 on a revoked jti. - gateway/routes/public/agent.ts (requireAgentOwnership): checks jti before authorizing the worker-token branch. - gateway/proxy/http-proxy.ts (validateProxyAuth): uses a new sync fast-path RevokedTokenStore.isRevokedCached(jti) — the CONNECT path must stay sync, and same-process revokes hit the cache immediately. New test: revoked-token-store.test.ts "authenticateWorker (internal middleware) — revocation reach" — pins that a revoked jti returns 401 from the internal middleware path that was previously bypassed. Drive-by cleanup so the build passes: - restored getClientIp export in gateway/utils/rate-limiter.ts (removed in #684, broke secret-proxy.ts imports on main). - fixed pre-existing TS errors in packages/core/src/__tests__/ utils-json.test.ts and command-registry.test.ts introduced by #685. - fixed pre-existing biome noEmptyBlockStatements in command-registry.test.ts. - updated the stale NAT64 gap test in proxy-hardening.test.ts — the 64:ff9b::/96 prefix is now decoded and blocked, so the assertion flips from .toBe(false) to .toBe(true). * ci(security): block-level allowlist comments + 3-line lookback in guard The static guard's grep-only filter required an inline 'security-allowed:' comment, which is unreadable on multi-line template-literal concatenations in execute-data-sources.ts. Switch to a 3-line lookback so the marker can live on a preceding comment line, and annotate the legitimate validated- table-name CTE constructions in execute-data-sources.ts that the guard flagged on its first CI run (these were called out as safe in WS2's audit: table identifiers come from QUERYABLE_TABLE_NAMES whitelist, orgP is a $N placeholder). * ci: bump to re-trigger workflows

buremba added 10 commits May 13, 2026 06:09

Merge test/harden-mcp

7176b25

Merge test/harden-rest-api

907363a

Merge test/harden-orchestration

0435439

Merge test/harden-connections

edeff37

buremba merged commit 81e6d91 into main May 13, 2026
13 of 15 checks passed

buremba deleted the test/harden-server branch May 13, 2026 12:59

chatgpt-codex-connector Bot reviewed May 13, 2026

View reviewed changes

buremba mentioned this pull request May 13, 2026

ci(security): focused gate, static guards, and broader revocation reach (WS9) #693

Merged

This was referenced May 18, 2026

fix(server): remove unauthenticated GET /internal/connections #846

Merged

fix(interactions): require connectionId to prevent cross-platform leakage #847

Merged

fix: security + correctness bundle (#687 #688 #690 #691) #851

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(server): harden gateway, MCP, REST, orchestration, connections, auth, proxy#684

test(server): harden gateway, MCP, REST, orchestration, connections, auth, proxy#684
buremba merged 10 commits into
mainfrom
test/harden-server

buremba commented May 13, 2026

Uh oh!

Uh oh!

codecov-commenter commented May 13, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 13, 2026

Uh oh!

chatgpt-codex-connector Bot May 13, 2026

Uh oh!

chatgpt-codex-connector Bot May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		process.env.ENCRYPTION_KEY =
		"fedcba9876543210fedcba9876543210fedcba9876543210fedcba9876543210";

		* registered for that connection. The `platform` field on the event payload
		* is metadata, not a routing key — isolation comes from `connectionId`.

Conversation

buremba commented May 13, 2026

Coverage added

Non-test changes

Suspected real bugs (filed as issues, not fixed here)

Brittle existing tests called out (recommend follow-up)

Uh oh!

Uh oh!

codecov-commenter commented May 13, 2026

Codecov Report

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 13, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 13, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants