FEAT Add scale scorer #274

romanlutz · 2024-07-08T23:48:18Z

Description

So far we don't have a scale scorer that can accommodate more flexible scales than Likert and which don't require level-specific descriptions.

TAP, PAIR, and Crescendo all uses some variation of this. It could help to have a bigger scale and not requiring detailed descriptions of each level which are hard to create and sometimes difficult to distinguish. One could argue that there's no point in distinct levels if there's no describable distinction them but that's a whole different can of worms 😄

Tests and Documentation

Added tests. Holding off on extensive documentation for now since it uses a slightly different signature on the scoring methods which includes the task. If we end up doing the same on all existing scorers we can do a larger adjustment of the docs.

pyrit/datasets/score/scales/scale_system_prompt.yaml

…scale_scorer

pyrit/score/self_ask_scale_scorer.py

…scale_scorer

…/PyRIT into romanlutz/scale_scorer

…scale_scorer

…lated feedback)

dlmgary

Not blocking the PR merge but wanted to make sure comments were read before merging! Feel free to reach out to chat, changes are pretty minor. :)

pyrit/score/self_ask_scale_scorer.py

dlmgary · 2024-07-21T12:57:22Z

pyrit/score/scorer.py

@@ -16,36 +17,39 @@ class Scorer(abc.ABC):
    scorer_type: ScoreType

    @abstractmethod
-    async def score_async(self, request_response: PromptRequestPiece) -> list[Score]:
+    async def score_async(self, request_response: PromptRequestPiece, *, task: Optional[str] = None) -> list[Score]:


Question. Shouldn't the definition be as follow?

Suggested change

async def score_async(self, request_response: PromptRequestPiece, *, task: Optional[str] = None) -> list[Score]:

async def score_async(self, *, request_response: PromptRequestPiece, task: Optional[str] = None) -> list[Score]:

That's the pattern we've been used throughout the code. It will force this PR to modify more files but will keep things consistent and more readable.

I didn't put it there for three reasons. One is what you mentioned: it will break backwards compatibility. Perhaps not a huge reason, granted!

The other is that request_response is not a great name IMO and if I had to bet we'll change that sooner or later at which point that would mean another breaking change (or deprecation over several versions which is incredibly painful).

Lastly, as long as there's only 1 positional arg it's still clear. As soon as you have multiple then the order could be either way and things can get confusing. The task is truly "optional" in the sense that most scorers don't currently use it, so treating it differently makes sense to me.

pyrit/score/self_ask_category_scorer.py

dlmgary · 2024-07-21T13:00:35Z

pyrit/score/scorer.py


        Returns:
            list[Score]: A list of Score objects representing the results.
        """
        raise NotImplementedError("score_async method not implemented")

    @abstractmethod
-    def validate(self, request_response: PromptRequestPiece):
+    def validate(self, request_response: PromptRequestPiece, *, task: Optional[str] = None):


Same suggestion as below.

Suggested change

def validate(self, request_response: PromptRequestPiece, *, task: Optional[str] = None):

def validate(self, *, request_response: PromptRequestPiece, task: Optional[str] = None):

see #274 (comment)

pyrit/score/self_ask_conversation_objective_scorer.py

pyrit/score/self_ask_scale_scorer.py

dlmgary · 2024-07-21T13:04:36Z

pyrit/score/self_ask_scale_scorer.py

+        chat_target: PromptChatTarget,
+        scale_path: Optional[Path] = None,
+        scale: Optional[Scale] = None,
+        memory: MemoryInterface = None,


Suggested change

memory: MemoryInterface = None,

memory: Optional[MemoryInterface] = None,

Pre-commit hooks should've complained about this! :P

dlmgary · 2024-07-21T13:07:29Z

pyrit/score/self_ask_scale_scorer.py

+        self._system_prompt = scoring_instructions_template.apply_custom_metaprompt_parameters(**system_prompt_kwargs)
+
+        self._chat_target: PromptChatTarget = chat_target


nit: remove extra line here.

Suggested change

self._system_prompt = scoring_instructions_template.apply_custom_metaprompt_parameters(**system_prompt_kwargs)

self._chat_target: PromptChatTarget = chat_target

self._system_prompt = scoring_instructions_template.apply_custom_metaprompt_parameters(**system_prompt_kwargs)

self._chat_target: PromptChatTarget = chat_target

I suppose the rule of thumb is that if flake8 / black etc. don't mind then neither do we (?)

pyrit/score/self_ask_scale_scorer.py

…async

…scale_scorer

@dlmgary

As discussed I'm resetting this one and will check with you offline @dlmgary ! If needed I'll start a separate follow-up PR

add scale scorer

70fae8c

romanlutz commented Jul 8, 2024

View reviewed changes

pyrit/datasets/score/scales/scale_system_prompt.yaml Show resolved Hide resolved

romanlutz added 4 commits July 11, 2024 16:31

expand scale scorer to accept task as a separate arg

14008ba

Merge branch 'main' of https://github.com/Azure/PyRIT into romanlutz/…

91eb07e

…scale_scorer

pre-commit linting

40b48c1

mypy, other simplifications, new tests

b11ab4e

romanlutz marked this pull request as ready for review July 12, 2024 02:17

Merge branch 'main' into romanlutz/scale_scorer

f7138b4

romanlutz mentioned this pull request Jul 12, 2024

[FEAT] Implement PAIR #255

Merged

rdheekonda reviewed Jul 12, 2024

View reviewed changes

pyrit/score/self_ask_scale_scorer.py Outdated Show resolved Hide resolved

rdheekonda reviewed Jul 12, 2024

View reviewed changes

pyrit/score/self_ask_scale_scorer.py Show resolved Hide resolved

rdheekonda approved these changes Jul 12, 2024

View reviewed changes

rlundeen2 reviewed Jul 12, 2024

View reviewed changes

pyrit/score/self_ask_scale_scorer.py Outdated Show resolved Hide resolved

rlundeen2 approved these changes Jul 12, 2024

View reviewed changes

rlundeen2 reviewed Jul 15, 2024

View reviewed changes

pyrit/score/self_ask_scale_scorer.py Outdated Show resolved Hide resolved

romanlutz mentioned this pull request Jul 15, 2024

FEAT: Add tree of attacks with pruning #210

Merged

romanlutz added 4 commits July 15, 2024 21:54

Merge branch 'main' of https://github.com/Azure/PyRIT into romanlutz/…

3284150

…scale_scorer

Merge branch 'romanlutz/scale_scorer' of https://github.com/romanlutz…

0fefefc

…/PyRIT into romanlutz/scale_scorer

Merge branch 'main' of https://github.com/Azure/PyRIT into romanlutz/…

f274b10

…scale_scorer

Expand signature of all scorers to accommodate task (and smaller unre…

8c31858

…lated feedback)

dlmgary previously requested changes Jul 21, 2024

View reviewed changes

rlundeen2 reviewed Jul 22, 2024

View reviewed changes

pyrit/score/self_ask_scale_scorer.py Outdated Show resolved Hide resolved

romanlutz added 5 commits July 22, 2024 16:28

remove to-str conversion of score value, remove duplicate score_text_…

1ffef3c

…async

types

babaa40

types

50f1465

Merge branch 'main' of https://github.com/Azure/PyRIT into romanlutz/…

f71c953

…scale_scorer

fix test outputs

0d02025

dlmgary self-requested a review July 23, 2024 01:08

Merge branch 'main' into romanlutz/scale_scorer

c05a1c9

romanlutz merged commit 61a7502 into Azure:main Jul 23, 2024
4 checks passed

romanlutz deleted the romanlutz/scale_scorer branch July 23, 2024 04:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT Add scale scorer #274

FEAT Add scale scorer #274

romanlutz commented Jul 8, 2024 •

edited

Loading

dlmgary left a comment

dlmgary Jul 21, 2024

romanlutz Jul 22, 2024

dlmgary Jul 21, 2024

romanlutz Jul 22, 2024

dlmgary Jul 21, 2024

romanlutz Jul 22, 2024

dlmgary Jul 21, 2024

romanlutz Jul 22, 2024

	async def score_async(self, request_response: PromptRequestPiece, *, task: Optional[str] = None) -> list[Score]:
	async def score_async(self, *, request_response: PromptRequestPiece, task: Optional[str] = None) -> list[Score]:

	def validate(self, request_response: PromptRequestPiece, *, task: Optional[str] = None):
	def validate(self, *, request_response: PromptRequestPiece, task: Optional[str] = None):

	memory: MemoryInterface = None,
	memory: Optional[MemoryInterface] = None,

		self._system_prompt = scoring_instructions_template.apply_custom_metaprompt_parameters(**system_prompt_kwargs)

		self._chat_target: PromptChatTarget = chat_target

FEAT Add scale scorer #274

FEAT Add scale scorer #274

Conversation

romanlutz commented Jul 8, 2024 • edited Loading

Description

Tests and Documentation

dlmgary left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

romanlutz commented Jul 8, 2024 •

edited

Loading