Adding sentences from the AdvBench dataset

## Description (Actual Behavior)
Currently the sentences in the API files and the API consider harmful and biased prompts.


## Expected Behavior
It is expected that the API also identifies prompts that are adversarial and might constitute an LLM attack.


## Possible Approach
Include some prompts from AdvBench, a benchmark for adversarial prompts and expand the API to identify them.


## Steps to Reproduce
N/A


## Context
N/A

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding sentences from the AdvBench dataset #30

Description (Actual Behavior)

Expected Behavior

Possible Approach

Steps to Reproduce

Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Adding sentences from the AdvBench dataset #30

Description

Description (Actual Behavior)

Expected Behavior

Possible Approach

Steps to Reproduce

Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions