fix: address external review feedback (title-caps fact, testkit ZSet fidelity, mock-leak, doctor semver, replay seeding) by ComBba · Pull Request #6 · Two-Weeks-Team/vibe-mod

ComBba · 2026-05-12T08:10:15Z

Summary

Addresses the still-valid items from the external reviewers on PR #1 / PR #2 (gemini-code-assist, CodeRabbit, Codex).

Fixes

r_shouting_title starter rule was checking the wrong thing (Codex P2). It used content.upperCaseRatio, which buildPostFactBag derives from the post body — so an ALL-CAPS title on a link post (empty body) never matched, despite the rule's name/sourceNL. Added a new fact content.title.upperCaseRatio (rule-schema FactPaths; fact-bag computes it from the title via a shared upperCaseRatioOf; the system prompt picks it up automatically) and pointed the rule at it. New test: "shouty body, calm title ⇒ no match".
devvit-testkit ZSet fidelity (gemini): zAdd no longer keeps duplicate members — real Redis ZADD updates the existing member's score; zRange now honours reverse with by: 'score' and the branch structure is simplified.
devvit-doctor version check (gemini): node-engine comparison now covers major.minor.patch, not just major.minor.
system-prompt.test (CodeRabbit): clarification examples must have a non-empty suggestedAnswers, not merely an array.
test/setup.ts beforeEach (CodeRabbit): vi.clearAllMocks() only wipes call history, not implementations — a per-test mockResolvedValue/mockImplementation on getPostById/getCommentById would leak into the next test. Now mockReset() those two and explicitly re-establish every Reddit double's default implementation each run.
replay harness: fixtures gain redisHashes / redisZsets (previously only string keys could be seeded), and the runner installs a generic Reddit "thing" stub so rollback/remove/lock/flair flows complete in a replay. New fixtures/undo-action.json demonstrates a rollback round-trip replay.

Not changed (with reason)

The remaining "Codex usage limit reached" / "CodeRabbit rate-limited" notices on PR test+fix: vitest/route tests, acceptance gates, starter rules, @devvit SDK alignment, CI #1/chore(dev): lint/format/hooks, reusable test kit, replay & doctor CLIs, CI hardening #2 are bot status messages, not findings.

Verify

npx tsc --noEmit — clean
npx vitest run — 151 passed, 1 skipped (replay-runner)
npx eslint . --max-warnings 0 / npx prettier --check . — clean
npm run acceptance — 4/4
npm run replay fixtures/undo-action.json — shows the rollback succeeding
CI: green.

🤖 Generated with Claude Code

…t / codex) - starter rule r_shouting_title evaluated the *body* uppercase ratio, not the title — add a `content.title.upperCaseRatio` fact (rule-schema FactPaths + fact-bag computation; system-prompt picks it up automatically) and point the rule at it. New test covers "shouty body, calm title => no match". (codex P2) - devvit-testkit: zAdd no longer allows duplicate ZSet members (real ZADD updates the existing member's score); zRange honours `reverse` with `by: 'score'` and the branch structure is simplified. (gemini) - devvit-doctor: node engine comparison now covers major.minor.patch. (gemini) - system-prompt.test: clarification examples must have a non-empty `suggestedAnswers`, not just an array. (coderabbit) - test/setup beforeEach: clearAllMocks() doesn't reset mock *implementations* — mockReset() getPostById/getCommentById and re-establish every reddit double's default impl so a per-test mockResolvedValue/mockImplementation can't leak. (coderabbit) - replay: fixtures gain `redisHashes` / `redisZsets` (was string keys only) and the runner installs a generic Reddit "thing" stub so rollback/remove/lock/flair flows complete; add fixtures/undo-action.json (rollback round-trip replay). 152 tests pass (1 skipped); tsc/lint/format clean; acceptance 4/4. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

chatgpt-codex-connector · 2026-05-12T08:10:20Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

coderabbitai · 2026-05-12T08:10:23Z

Warning

Rate limit exceeded

@ComBba has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 1 minute and 31 seconds before requesting another review.

You’ve run out of usage credits. Purchase more in the billing tab.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: f93a83af-8c25-4b69-a46d-ecdb32e45a9e

📥 Commits

Reviewing files that changed from the base of the PR and between 2610f71 and 76f8e2b.

📒 Files selected for processing (14)

HANDOFF.md
fixtures/undo-action.json
scripts/devvit-doctor.ts
scripts/replay.ts
src/server/evaluator.test.ts
src/server/fact-bag.test.ts
src/server/fact-bag.ts
src/shared/rule-schema.ts
src/shared/starter-rules.test.ts
src/shared/starter-rules.ts
src/shared/system-prompt.test.ts
test/devvit-testkit.ts
test/replay-runner.test.ts
test/setup.ts

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch fix/review-feedback

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist

Code Review

This pull request implements several improvements and bug fixes, notably introducing a content.title.upperCaseRatio fact to correctly identify shouty titles independently of the post body. Other changes include enhancing the Redis testkit's zAdd and zRange behavior, refining Node.js version checks in devvit-doctor, and improving test mock isolation by resetting implementations in test/setup.ts. Feedback focuses on the ASCII-only limitation of the uppercase ratio calculation which may impact internationalization, a bug in handling negative indices in the rank-based zRange emulation, and potential state leakage in tests when using shared mock objects for Reddit entities.

gemini-code-assist · 2026-05-12T08:11:36Z

+function upperCaseRatioOf(s: string): number {
+  const letters = s.replace(/[^A-Za-z]/g, '');
+  return letters.length === 0 ? 0 : (letters.match(/[A-Z]/g)?.length ?? 0) / letters.length;
+}


The upperCaseRatioOf function is strictly limited to ASCII letters (A-Z, a-z). This means it will return a ratio of 0 for content written in non-Latin scripts (e.g., Cyrillic, Greek, or CJK), even if the text is in an uppercase equivalent. While this suffices for the current starter rules, it limits the tool's effectiveness for international subreddits. Consider using a Unicode-aware approach (e.g., checking Unicode categories) if global support is intended.

gemini-code-assist · 2026-05-12T08:11:36Z

+      const end = stop < 0 ? arr.length : stop + 1;
+      return arr.slice(start, end);


The current logic for handling negative stop indices in the rank-based zRange does not correctly emulate Redis behavior for values other than -1. In Redis, stop is an inclusive index where -1 is the last element and -2 is the second to last. The current implementation treats all negative values as the end of the array, which would make ZRANGE key 0 -2 return the same result as ZRANGE key 0 -1.

Suggested change

const end = stop < 0 ? arr.length : stop + 1;

return arr.slice(start, end);

const end = stop < 0 ? arr.length + stop + 1 : stop + 1;

return arr.slice(start, end);

gemini-code-assist · 2026-05-12T08:11:36Z

+    fakeReddit.getPostById.mockResolvedValue(stubThing);
+    fakeReddit.getCommentById.mockResolvedValue(stubThing);


Using mockResolvedValue(stubThing) causes all calls to getPostById and getCommentById to return the exact same object instance. If a single replay request interacts with multiple posts or comments, state changes (like removed or flair) on one will leak to all others. It is safer to use mockImplementation to return a fresh clone for each call.

Suggested change

fakeReddit.getPostById.mockResolvedValue(stubThing);

fakeReddit.getCommentById.mockResolvedValue(stubThing);

fakeReddit.getPostById.mockImplementation(async () => ({ ...stubThing }));

fakeReddit.getCommentById.mockImplementation(async () => ({ ...stubThing }));

feat(compose): clarify select + ban/mute helpText + toast summary (audit findings #1, #4, #6)

fix: address external review feedback (title-caps fact, testkit ZSet fidelity, mock-leak, doctor semver, replay seeding)

…Text + toast 1-line summary Audit-driven UX quick wins for the compose flow. Full audit + scenario rationale: - claudedocs/2026-05-14-compose-flow-audit.md (10 findings vs README promise matrix) - docs/demo-scenario.md (60s demo, README-aligned, Hypothesis 3) ## What changes (3 audit findings landed) ### Finding #1 (CRITICAL for demo) — Clarify modal renders LLM suggestedAnswers as a dropdown The LLM has *always* returned `{ needsClarification, question, suggestedAnswers: [...] }` (see system-prompt.ts:80-85), but the server discarded `suggestedAnswers` and showed only a free-text textarea. Moderators had to paraphrase the question. Now: when `suggestedAnswers` is present, render a `select` field with one click per option PLUS a `clarificationAnswerOther` paragraph as an override / escape hatch. When the LLM returns no suggestions, fall back to the original free-text path. `compose-rule-submit` defensively unwraps SELECTION-array values (PR #39 pattern) and prefers the "Other" override when non-empty. ### Finding #4 — ban/mute toggle gets explanatory helpText on both forms Mod mental model = "checkbox grants ban permission". Real behaviour = "checkbox lets compile succeed when the LLM emits ban/mute". Added a single shared `ALLOW_GUARDED_HELP` string used by the compose form and the clarify modal so both forms explain the toggle the same way. ### Finding #6 — Success toast carries a 1-line summary + an explicit menu pointer Previously: `Compiled rule "X". Dry-run started — check Dashboard in 30s.` The dashboard menu name is "vibe-mod: View rules + log" and Devvit toasts have no buttons, so "check Dashboard" is unreachable for a first-time mod. Now: `Compiled rule "X". → post: modqueue. Dry-run started — open the subreddit ⋯ menu → "vibe-mod: View rules + log" to see preview.` The 1-line `summarizeRule()` makes the deterministic JSON contract visible without leaving the toast (audit finding #2 alternative — the full preview form stays a post-publish item). ## Verification - `npm run check` 4/4 gates green (typecheck + lint + Prettier + 25 compose tests + 3 @devvit/test + acceptance G1–G4) - 6 new tests in routes-compose.test.ts cover: select rendering, free-text fallback, SELECTION-array unwrap, "Other" override precedence, helpText presence, summary in success toast. ## Why this PR (not bundled with module split) Demo video records this UI. UX changes must land before the README screenshot captures (Phase 3) and the Playwright video automation (Phase 4). Module split (Phase 2) is a separate, independent refactor. Refs: claudedocs/2026-05-14-compose-flow-audit.md (Phase 1.5/1.5b), docs/demo-scenario.md (Phase 2.5b)

feat(compose): clarify select + ban/mute helpText + toast summary (audit findings #1, #4, #6)

ComBba merged commit 1d1969b into main May 12, 2026
2 checks passed

ComBba deleted the fix/review-feedback branch May 12, 2026 08:11

gemini-code-assist Bot reviewed May 12, 2026

View reviewed changes

This was referenced May 12, 2026

feat(openai): tune compile call (reasoning_effort: none, verbosity: low), default gpt-5.4-mini #11

Merged

feat(compose): clarify select + ban/mute helpText + toast summary (audit findings #1, #4, #6) #43

Merged

ComBba added a commit that referenced this pull request May 14, 2026

Merge pull request #43 from Two-Weeks-Team/fix/compose-ux-quickwins

4f0192d

feat(compose): clarify select + ban/mute helpText + toast summary (audit findings #1, #4, #6)

ComBba added a commit that referenced this pull request May 15, 2026

Merge pull request #6 from Two-Weeks-Team/fix/review-feedback

312d481

fix: address external review feedback (title-caps fact, testkit ZSet fidelity, mock-leak, doctor semver, replay seeding)

ComBba added a commit that referenced this pull request May 15, 2026

Merge pull request #43 from Two-Weeks-Team/fix/compose-ux-quickwins

bdc1afc

feat(compose): clarify select + ban/mute helpText + toast summary (audit findings #1, #4, #6)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: address external review feedback (title-caps fact, testkit ZSet fidelity, mock-leak, doctor semver, replay seeding)#6

fix: address external review feedback (title-caps fact, testkit ZSet fidelity, mock-leak, doctor semver, replay seeding)#6
ComBba merged 1 commit into
mainfrom
fix/review-feedback

ComBba commented May 12, 2026

Uh oh!

chatgpt-codex-connector Bot commented May 12, 2026

Uh oh!

coderabbitai Bot commented May 12, 2026

Rate limit exceeded

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 12, 2026

Uh oh!

gemini-code-assist Bot May 12, 2026

Uh oh!

gemini-code-assist Bot May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		const end = stop < 0 ? arr.length : stop + 1;
		return arr.slice(start, end);

		fakeReddit.getPostById.mockResolvedValue(stubThing);
		fakeReddit.getCommentById.mockResolvedValue(stubThing);

Conversation

ComBba commented May 12, 2026

Summary

Fixes

Not changed (with reason)

Verify

Uh oh!

chatgpt-codex-connector Bot commented May 12, 2026

Uh oh!

coderabbitai Bot commented May 12, 2026

Rate limit exceeded

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 12, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant