Feat: Add prompt injection detection settings UI + update logging #4651

dorien-koelemeijer · 2025-09-16T09:46:50Z

PR Description

UI configuration for prompt injection detection

Allow users to enable prompt injection detection and the confidence threshold through the UI instead of having to manually update goose config file.

Update security logging

Noticed that security findings didn't get assigned a unique id (😱), so fixed that (SEC-{uuid})
User decisions (allow/deny) are logged with this finding ID for easier log analysis

Some general cleanup

Remove some unnecessary/unused code

…commit

…ysis

DOsinga

I had a quick look, but I fear this feels rather vibe codey - there's some duplication, unrelated changes, I wonder if you could do a bit of a review more yourself?

crates/goose-server/src/routes/reply.rs

crates/goose/src/agents/agent.rs

ui/desktop/src/components/settings/security/SecurityToggle.tsx

dorien-koelemeijer · 2025-09-18T08:18:42Z

I had a quick look, but I fear this feels rather vibe codey - there's some duplication, unrelated changes, I wonder if you could do a bit of a review more yourself?

You're right, I wanted to get it done too quickly. Having another look now and will do some cleanup. Thanks for the comments

crates/goose-server/src/routes/reply.rs

ui/desktop/src/components/settings/security/SecuritySection.tsx

…rectly

michaelneale · 2025-09-23T01:05:33Z

looks like linting issues now

michaelneale · 2025-09-24T03:23:59Z

so the config is now saved as:

security_threshold: 0.2
security_enabled: true

not in a security: section - is that expected? (also I noted it didnt' save the threshold until I changed it, which I assume means default?)

@dorien-koelemeijer there are a few other changes not GUI related, what is best way to check this is still healthy?

looks ok otherwise, @DOsinga happier with shape of code?

michaelneale · 2025-09-24T03:28:54Z

thread safety/oncelock stuff looks good too

dorien-koelemeijer · 2025-09-24T07:26:50Z

so the config is now saved as:
security_threshold: 0.2
security_enabled: true
not in a security: section - is that expected? (also I noted it didnt' save the threshold until I changed it, which I assume means default?)

@dorien-koelemeijer there are a few other changes not GUI related, what is best way to check this is still healthy?

looks ok otherwise, @DOsinga happier with shape of code?

It'll be security.threshold: xx and security.enabled: true/false. But I think most people will just go through the UI settings now, since it's a lot easier.

Re: non-GUI related changes, you can simply test the changes in goose CLI/desktop version locally. I've tested until the last lint commit and everything was still fine - will do some final testing in the next couple of hours, but it should all be good.

michaelneale · 2025-09-24T07:30:32Z

@dorien-koelemeijer but when I saw it in the config.yaml - it wasn't grouped under security (was an underscore) - just wanted to know if that was expected?

dorien-koelemeijer · 2025-09-24T08:58:46Z

@dorien-koelemeijer but when I saw it in the config.yaml - it wasn't grouped under security (was an underscore) - just wanted to know if that was expected?

Really? Let me have a look as well 👀 In any case, I'll have to update instructions once this is merged. Would be great to get both this PR and the other one together in one release to prevent confusion

dorien-koelemeijer · 2025-09-24T09:43:02Z

@dorien-koelemeijer but when I saw it in the config.yaml - it wasn't grouped under security (was an underscore) - just wanted to know if that was expected?

Really? Let me have a look as well 👀 In any case, I'll have to update instructions once this is merged. Would be great to get both this PR and the other one together in one release to prevent confusion

I think I tried to keep things backwards compatible first and still had that config saved - you're right, it's definitely with underscores

Seems like it would make sense to keep underscores, since all config is saved like that? I guess it depends on whether this PR gets released at the same time as the other one, then we don't have to have things be backwards compatible. Honestly, probably easiest to keep it in line with all other config though and use underscores?

dorien-koelemeijer · 2025-09-24T10:10:39Z

so the config is now saved as:
security_threshold: 0.2
security_enabled: true
not in a security: section - is that expected? (also I noted it didnt' save the threshold until I changed it, which I assume means default?)
@dorien-koelemeijer there are a few other changes not GUI related, what is best way to check this is still healthy?
looks ok otherwise, @DOsinga happier with shape of code?
It'll be security.threshold: xx and security.enabled: true/false. But I think most people will just go through the UI settings now, since it's a lot easier.

Re: non-GUI related changes, you can simply test the changes in goose CLI/desktop version locally. I've tested until the last lint commit and everything was still fine - will do some final testing in the next couple of hours, but it should all be good.

Have tested both desktop and CLI version - all seems fine still. Testing will be the same as for the other PR if you wanted to give it a go as well to be safe

DOsinga

Looks much better, yeah! I think you can delete even more LLM comments - I always do

crates/goose/src/security/security_inspector.rs

ui/desktop/src/components/settings/security/SecurityToggle.tsx

* main: docs: Change community page sections (block#4984) docs: remove temporary Hacktoberfest issue templates (block#4982) Create multi-channel researcher prompt (block#4947) docs: Add Community Content section to Community Page (block#4964) Allow empty API Key when registering custom provider (block#4977) Feat: Add prompt injection detection settings UI + update logging (block#4651) Make create_session work concurrently (block#4954) Lifei/create save recipe to file (block#4895)

…ock#4651) Signed-off-by: Itz-Agasta <[email protected]>

dorien-koelemeijer added 2 commits September 15, 2025 18:06

Configure prompt injection scanning through settings in UI - initial …

deade94

…commit

Make sure config carries over between sessions

9998609

dorien-koelemeijer marked this pull request as draft September 16, 2025 09:46

dorien-koelemeijer force-pushed the feat/add-prompt-injection-detection-settings-ui branch 2 times, most recently from 6c7866a to 82a0c31 Compare September 16, 2025 10:35

dorien-koelemeijer marked this pull request as ready for review September 16, 2025 10:37

dorien-koelemeijer force-pushed the feat/add-prompt-injection-detection-settings-ui branch from 8cd5566 to 82a0c31 Compare September 16, 2025 13:58

some style changes + log user decisions with security id for log anal…

6562601

…ysis

dorien-koelemeijer force-pushed the feat/add-prompt-injection-detection-settings-ui branch from 8393cf5 to 6562601 Compare September 16, 2025 18:07

michaelneale self-assigned this Sep 16, 2025

lint

5b18b58

dorien-koelemeijer changed the title ~~Feat: Add prompt injection detection settings UI + update security logging~~ Feat: Add prompt injection detection settings UI + update logging Sep 17, 2025

DOsinga reviewed Sep 17, 2025

View reviewed changes

crates/goose-server/src/routes/reply.rs Outdated Show resolved Hide resolved

crates/goose/src/agents/agent.rs Outdated Show resolved Hide resolved

ui/desktop/src/components/settings/security/SecurityToggle.tsx Outdated Show resolved Hide resolved

clean up code - pt1

6965560

dorien-koelemeijer force-pushed the feat/add-prompt-injection-detection-settings-ui branch 2 times, most recently from 4bd65ae to 1c89c0a Compare September 18, 2025 11:46

cleanup pt2

a79ec4e

dorien-koelemeijer force-pushed the feat/add-prompt-injection-detection-settings-ui branch from 1c89c0a to a79ec4e Compare September 18, 2025 11:47

fix

71db6c8

michaelneale assigned DOsinga and unassigned michaelneale Sep 18, 2025

dorien-koelemeijer force-pushed the feat/add-prompt-injection-detection-settings-ui branch from 56bad4b to f57511e Compare September 19, 2025 10:11

small cleanup

7a9d9d4

dorien-koelemeijer force-pushed the feat/add-prompt-injection-detection-settings-ui branch from 9b5c660 to 7a9d9d4 Compare September 19, 2025 10:13

clean up fetching security finding id for logging

0ae76d5

dorien-koelemeijer force-pushed the feat/add-prompt-injection-detection-settings-ui branch from bd7293a to 0ae76d5 Compare September 19, 2025 11:03

enable prompt injection settings cleanup

d28c448

michaelneale reviewed Sep 19, 2025

View reviewed changes

crates/goose-server/src/routes/reply.rs Show resolved Hide resolved

michaelneale reviewed Sep 19, 2025

View reviewed changes

ui/desktop/src/components/settings/security/SecuritySection.tsx Outdated Show resolved Hide resolved

dorien-koelemeijer added 2 commits September 20, 2025 08:16

fix

04b1de4

remove securitysection.tsx, and include in chatsettingssection.tsx di…

4396b0f

…rectly

lint

bb9bea5

michaelneale mentioned this pull request Sep 24, 2025

[Security] Implement Security Option for Handling Tool-Call Chains #4691

Closed

DOsinga approved these changes Sep 24, 2025

View reviewed changes

fix: address PR comments - various fixes in different places

0557a02

DOsinga approved these changes Sep 25, 2025

View reviewed changes

update security config naming

b4fa07e

michaelneale self-assigned this Sep 30, 2025

fix

720398e

DOsinga merged commit e6a5692 into block:main Oct 3, 2025
10 checks passed

Itz-Agasta pushed a commit to Itz-Agasta/goose that referenced this pull request Oct 7, 2025

Feat: Add prompt injection detection settings UI + update logging (bl…

26470ed

…ock#4651) Signed-off-by: Itz-Agasta <[email protected]>

This was referenced Oct 8, 2025

chore(release): release version 1.10.0 #5060

Closed

release/1.10.0 #5101

Closed

Feat: Add prompt injection detection settings UI + update logging #4651

Feat: Add prompt injection detection settings UI + update logging #4651

Uh oh!

Conversation

dorien-koelemeijer commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Description

UI configuration for prompt injection detection

Update security logging

Some general cleanup

Uh oh!

DOsinga left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dorien-koelemeijer commented Sep 18, 2025

Uh oh!

Uh oh!

Uh oh!

michaelneale commented Sep 23, 2025

Uh oh!

michaelneale commented Sep 24, 2025

Uh oh!

michaelneale commented Sep 24, 2025

Uh oh!

dorien-koelemeijer commented Sep 24, 2025

Uh oh!

michaelneale commented Sep 24, 2025

Uh oh!

dorien-koelemeijer commented Sep 24, 2025

Uh oh!

dorien-koelemeijer commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dorien-koelemeijer commented Sep 24, 2025

Uh oh!

DOsinga left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dorien-koelemeijer commented Sep 16, 2025 •

edited

Loading

dorien-koelemeijer commented Sep 24, 2025 •

edited

Loading