feat: add repository context and judge model features by GangGreenTemperTatum · Pull Request #2 · spaceraccoon/vulnerability-spoiler-alert-action

GangGreenTemperTatum · 2026-02-14T21:01:54Z

big fan, love the research and thank you for the awesome inspiration @spaceraccoon 🙏 i wanted to try tackle this problem after some self-experience with it and in particularly quoting:

However, there was still a lot of false positives, including “bugs-but-not-really-exploitable-vulnerabilities”. This time, I tuned the context further:

Problem

reduces false positives in vulnerability detection. currently claude only sees the diff (max 15KB) without understanding what the code does, leading to potential misclassifications (i've had a few)

Solution

adds two opt-in features to improve detection accuracy (below)

Features:

repository context: fetches up to 3 modified files (pre-patch) from parent commit to give claude understanding of what the code does, not just the diff
judge model: second claude call that reviews primary detections and can reject them. only runs when primary analysis flags a vulnerability, saving api calls

Improvements:

better detection accuracy through additional code context
reduced false positives via adversarial review pattern
graceful degradation on errors (missing files, judge failures)

Before & After Screenshots

test workflow:

Tests

24 unit tests pass (6 new tests added for new functions)
typescript type checking passes
new e2e test workflow (.github/workflows/test-action.yml) runs on PRs
tests both features enabled and disabled for baseline comparison
workflow analyzes last 3 commits from expressjs/express as integration test

to run tests locally:

npm test              # unit tests
npm run typecheck     # type checking
npm run build         # build dist/

Deploy Notes

new action inputs:

enable-repo-context : (optional, default: false) fetch modified files for context (max 3 files, 3KB each)
enable-judge : (optional, default: false) review detections with second model to reduce false positives
judge-model : (optional, default: same as primary model) claude model to use for judge

Copilot

Pull request overview

Adds optional “repo context” and a “judge” pass to improve vulnerability-patch detection accuracy for this GitHub Action by giving the LLM more pre-patch code and enabling a second review step to reduce false positives.

Changes:

Add new action inputs for repository context fetching and a judge model pass.
Fetch up to 3 modified files’ contents (pre-patch) and inject into the analysis prompt.
Add judgeAnalysis to validate positive detections and skip issue creation if the judge disagrees.

Reviewed changes

Copilot reviewed 9 out of 15 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
`src/types.ts`	Extends `ActionInputs` with repo-context/judge flags and judge model.
`src/index.ts`	Wires new inputs, fetches repo context, and conditionally runs judge before creating issues.
`src/github.ts`	Implements parsing modified paths from diff and fetching file content for context.
`src/analyzer.ts`	Adds repoContext to the analysis prompt and introduces `judgeAnalysis`.
`src/__tests__/github.test.ts`	Adds unit tests for `getModifiedFilesContent`.
`src/__tests__/analyzer.test.ts`	Adds unit tests for `judgeAnalysis`.
`action.yml`	Defines new action inputs and defaults.
`README.md`	Documents the new inputs and usage example.
`.github/workflows/test-action.yml`	Adds an e2e workflow that runs the action with features on/off.
`.gitignore`	Ignores `CLAUDE.md`.
`dist/index.js`	Built output reflecting new features.
`dist/types.d.ts`	Built type output for new inputs.
`dist/github.d.ts`	Built type output for `getModifiedFilesContent`.
`dist/analyzer.d.ts`	Built type output for `judgeAnalysis` and updated `analyzeCommit` signature.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…cation

- Fix GitHub API parent commit fetch (doesn't support ~1 syntax) - Add code fence escaping to prevent prompt injection - Add test for initial commits (no parent) - Add verbose logging for analysis results - Add test mode to force vulnerability detection

Copilot

Pull request overview

Copilot reviewed 9 out of 15 changed files in this pull request and generated 12 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 10 out of 16 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 8 out of 14 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 8 out of 14 changed files in this pull request and generated 6 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 8 out of 14 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

spaceraccoon · 2026-02-16T08:39:31Z

Thanks for your contributions @GangGreenTemperTatum ! This looks amazing - I agree that full-file context will be helpful looking at how the OSS-Fuzz team also feeds this as part of their context. I'll target to finish my review over the Chinese New Year break and bring this in as soon as possible.

GangGreenTemperTatum · 2026-02-16T12:06:45Z

Thanks for your contributions @GangGreenTemperTatum ! This looks amazing - I agree that full-file context will be ..

Thanks so much @spaceraccoon ! Appreciate the kind words, of course no problem or rush at all! Again, thank you for the awesome work and inspiration, I'm heavily following your work and using this project, it's awesome.

Happy new year!! :) Enjoy the celebrations. Let me know if there's anything I can help with and have a great day!

spaceraccoon

Thanks for your patience @GangGreenTemperTatum ! I've added some comments, main points:

Stricter parsing of judge output to a fixed type we define earlier, similar to VulnerabilityAnalysis (use a JudgeAnalysis type?)
Handle edge case for spaces in git diff file paths
Use type: boolean for the new Action inputs rather than string, so we can save on the need to cast them to boolean manually.

I'm happy to merge these in a separate working branch and update these myself as well if you want!

GangGreenTemperTatum · 2026-02-18T13:20:52Z

hey @spaceraccoon , thanks for being awesome as always!

made some changes in chore: pr feedback:

stricter judge parsing: added a JudgeAnalysis type in types.ts and simplified judgeAnalysis() to parse directly to it using the same JSON.parse("{" + content.text) pattern as analyzeCommit(). on parse failure it returns agrees: true so it still fails gracefully (skips filtering).
boolean inputs: changed both enable-repo-context and enable-judge to type: boolean with default: false in action.yml, and switched to core.getBooleanInput() in index.ts to avoid manual string casting.
quoted paths in git diff: updated the extractModifiedPaths regex to handle both quoted ("a/path with spaces") and unquoted paths, with a test covering the quoted case.
readme: updated defaults to false and changed example values from strings to bare booleans.

i hope this looks good, let me know if there's anything i can help with and i appreciate your time - it's been great working with you on this!

spaceraccoon

Thanks @GangGreenTemperTatum! Will merge and push as a new minor version release.

spaceraccoon · 2026-02-18T16:09:49Z

Closing and re-opening to retry CodeQL ref: https://github.com/orgs/community/discussions/159026#discussioncomment-13122154

spaceraccoon · 2026-02-18T16:23:14Z

Thanks a ton @GangGreenTemperTatum! I've got it running at https://github.com/spaceraccoon/vulnerability-spoiler-alert and will monitor for issues. I noticed a weird bug at https://github.com/spaceraccoon/vulnerability-spoiler-alert/actions/runs/22147833025/job/64030138126 (404-ed fetching one file, although the rest were fine), but will get to it later.

GangGreenTemperTatum · 2026-02-18T18:27:40Z

Thanks a ton @GangGreenTemperTatum! I've got it running at https://github.com/spaceraccoon/vulnerability-spoiler-alert and will monitor for issues. I noticed a weird bug at https://github.com/spaceraccoon/vulnerability-spoiler-alert/actions/runs/22147833025/job/64030138126 (404-ed fetching one file, although the rest were fine), but will get to it later.

absolutely @spaceraccoon , thank you!! yes, i am also watching the RSS on that so will let you know if theres anything funky I will let you know! interesting, i did some digging and can see the commit exists in nodejs/node.:

Hash: d7a755153a1f3dd071d2ae8af8f577edbd717ad8
Message: doc: remove incorrect mention of module in typescript.md
Date: 2026-02-18
PR: docs: Remove incorrect mention of module nodejs/node#61839

The 404is because the file tools/dep_updaters/update-test426-fixtures.sh doesn't exist at that commit — not because the commit itself is missing. That commit only touches docs (typescript.md), so it wouldn't have that shell script at that pat which i think makes sense and im noodling on it atm

GangGreenTemperTatum · 2026-02-18T18:29:42Z

hey @spaceraccoon , sorry for the additional ping - whilst writing the above i had an outstanding notification for spaceraccoon/vulnerability-spoiler-alert#32 - did you push up to the live site prior to that? (im thinking this will help us with the above thesis)

spaceraccoon · 2026-02-18T18:40:58Z

hey @spaceraccoon , sorry for the additional ping - whilst writing the above i had an outstanding notification for spaceraccoon/vulnerability-spoiler-alert#32 - did you push up to the live site prior to that? (im thinking this will help us with the above thesis)

Yep! I’ve already enabled the new context + judge, as well as the fix for the 404 problem before this issue finding

GangGreenTemperTatum added 2 commits February 14, 2026 15:54

feat: add repository context and judge model features

5bd56c9

chore: set enable to true and docs

3195ee9

Copilot AI review requested due to automatic review settings February 14, 2026 21:01

Copilot started reviewing on behalf of GangGreenTemperTatum February 14, 2026 21:02 View session

GangGreenTemperTatum marked this pull request as draft February 14, 2026 21:02

GangGreenTemperTatum mentioned this pull request Feb 14, 2026

feat: ads/judge n context GangGreenTemperTatum/vulnerability-spoiler-alert-action#1

Open

fix: seed state file in test workflow to force commit analysis

468c41e

Copilot AI reviewed Feb 14, 2026

View reviewed changes

Comment thread action.yml Outdated

Comment thread .github/workflows/test-action.yml Outdated

Comment thread src/github.ts

Comment thread src/github.ts Outdated

Comment thread src/analyzer.ts

GangGreenTemperTatum added 2 commits February 14, 2026 16:10

chore: enhance bump test outputs

af7dd3b

feat: add test mode to force vulnerability detection for judge verifi…

32afe2b

…cation

Copilot AI review requested due to automatic review settings February 14, 2026 21:11

Copilot started reviewing on behalf of GangGreenTemperTatum February 14, 2026 21:12 View session

GangGreenTemperTatum added 2 commits February 14, 2026 16:14

fix: resolve parent SHA correctly and escape code fences

1008af7

Copilot AI reviewed Feb 14, 2026

View reviewed changes

fix: resolve parent SHA correctly and add judge verification

45c595c

Copilot AI review requested due to automatic review settings February 14, 2026 21:25

Copilot started reviewing on behalf of GangGreenTemperTatum February 14, 2026 21:25 View session

Copilot AI reviewed Feb 14, 2026

View reviewed changes

Comment thread .github/workflows/test-action.yml Outdated

Comment thread .github/workflows/test-action.yml Outdated

Comment thread src/github.ts

Comment thread src/analyzer.ts Outdated

GangGreenTemperTatum force-pushed the ads/judge-n-context branch from 37237a3 to 18e8492 Compare February 14, 2026 21:34

GangGreenTemperTatum requested a review from Copilot February 14, 2026 23:20

Copilot started reviewing on behalf of GangGreenTemperTatum February 14, 2026 23:21 View session

Copilot AI reviewed Feb 14, 2026

View reviewed changes

Comment thread README.md Outdated

Comment thread src/analyzer.ts Outdated

GangGreenTemperTatum force-pushed the ads/judge-n-context branch from 18e8492 to 2d975ac Compare February 14, 2026 23:29

Copilot AI review requested due to automatic review settings February 14, 2026 23:32

GangGreenTemperTatum force-pushed the ads/judge-n-context branch from 2d975ac to 4422f71 Compare February 14, 2026 23:32

GangGreenTemperTatum marked this pull request as ready for review February 14, 2026 23:32

Copilot started reviewing on behalf of GangGreenTemperTatum February 14, 2026 23:32 View session

Copilot AI reviewed Feb 14, 2026

View reviewed changes

Comment thread src/analyzer.ts

Comment thread src/index.ts Outdated

Comment thread README.md Outdated

Comment thread src/analyzer.ts

Comment thread src/index.ts

Comment thread README.md Outdated

feat: add repository context and judge model features

2f64de3

GangGreenTemperTatum force-pushed the ads/judge-n-context branch from 4422f71 to 2f64de3 Compare February 15, 2026 00:27

GangGreenTemperTatum requested a review from Copilot February 15, 2026 00:29

Copilot started reviewing on behalf of GangGreenTemperTatum February 15, 2026 00:29 View session

Copilot AI reviewed Feb 15, 2026

View reviewed changes

Comment thread action.yml Outdated

Comment thread action.yml

spaceraccoon requested changes Feb 18, 2026

View reviewed changes

Comment thread src/analyzer.ts Outdated

Comment thread src/github.ts Outdated

Comment thread action.yml Outdated

Comment thread action.yml Outdated

Comment thread src/index.ts Outdated

Comment thread README.md Outdated

chore: pr feedback

4c59da5

spaceraccoon self-requested a review February 18, 2026 16:06

spaceraccoon approved these changes Feb 18, 2026

View reviewed changes

spaceraccoon closed this Feb 18, 2026

spaceraccoon reopened this Feb 18, 2026

spaceraccoon merged commit a27cc86 into spaceraccoon:main Feb 18, 2026
2 checks passed

Conversation

GangGreenTemperTatum commented Feb 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Before & After Screenshots

Tests

Deploy Notes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

spaceraccoon commented Feb 16, 2026

Uh oh!

GangGreenTemperTatum commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

spaceraccoon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

GangGreenTemperTatum commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GangGreenTemperTatum commented Feb 14, 2026 •

edited

Loading

GangGreenTemperTatum commented Feb 16, 2026 •

edited

Loading

GangGreenTemperTatum commented Feb 18, 2026 •

edited

Loading