-
Notifications
You must be signed in to change notification settings - Fork 238
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
generator: vision nims #959
Conversation
…esn't work; handle misformed NIM headers explicity; handle oversized images
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Test look good.
arak LLM vulnerability scanner v0.9.0.16.post1 ( https://github.com/leondz/garak ) at 2024-10-28T16:17:57.248817
📜 logging to /home/jemartin/.local/share/garak/garak.log
🦜 loading generator: NIM: microsoft/phi-3-vision-128k-instruct
📜 reporting to /home/jemartin/.local/share/garak/garak_runs/garak.ba3fbd2c-70fd-459f-b938-890b14609e6c.report.jsonl
🕵️ queue of probes: visual_jailbreak.FigStep
visual_jailbreak.FigStep visual_jailbreak.FigStep: FAIL ok on 65/ 70 (failure rate: 7.14%)
📜 report closed :) /home/jemartin/.local/share/garak/garak_runs/garak.ba3fbd2c-70fd-459f-b938-890b14609e6c.report.jsonl
📜 report html summary being written to /home/jemartin/.local/share/garak/garak_runs/garak.ba3fbd2c-70fd-459f-b938-890b14609e6c.report.html
✔️ garak run complete in 135.21s
garak LLM vulnerability scanner v0.9.0.16.post1 ( https://github.com/leondz/garak ) at 2024-10-28T16:22:59.141907
📜 logging to /home/jemartin/.local/share/garak/garak.log
🦜 loading generator: NIM: microsoft/phi-3-vision-128k-instruct
📜 reporting to /home/jemartin/.local/share/garak/garak_runs/garak.df5ba817-adcb-4660-9c6b-e5507e601dab.report.jsonl
🕵️ queue of probes: encoding.InjectBase64
encoding.InjectBase64 encoding.DecodeMatch: PASS ok on 55/ 55
📜 report closed :) /home/jemartin/.local/share/garak/garak_runs/garak.df5ba817-adcb-4660-9c6b-e5507e601dab.report.jsonl
📜 report html summary being written to /home/jemartin/.local/share/garak/garak_runs/garak.df5ba817-adcb-4660-9c6b-e5507e601dab.report.html
✔️ garak run complete in 7.94s
DEFAULT_PARAMS = { | ||
"strict_modality_match": False, | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This seems reasonable to be in the harness for now, however I see some possible contention as noted in the description questions.
I suspect strict modality matching requirements may lean toward being a responsibility of the probe to define requirements. I suspect the technique employed in the probe will set the requirement for strict match as generators may often support various modalities.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Interested to hear more
Signed-off-by: Jeffrey Martin <[email protected]>
29f2574
to
a24577a
Compare
resolves #691
nim.NIMVision
, following thehuggingface.LLaVa
patternattempt
to allow non-string promptsOpenAICompatible
Verification
garak -m nim.NIMVision -n microsoft/phi-3-vision-128k-instruct -p visual_jailbreak.FigStep -g 1 --parallel_attempts 16
Questions:
NIMVision._prepare_prompt()
?huggingface.LLaVa
now needs strict matching. Tagging Enable more complexprompts
#602 for good and Prompt Architecture Enhancement for Better Multi-modal Red Teaming #658.