McGill NLP
Research group within McGill University and Mila focusing on various topics in natural language processing.
Pinned Loading
Repositories
Showing 10 of 52 repositories
- llmsafety Public Forked from JailbreakBench/jailbreakbench
A fork of JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]
McGill-NLP/llmsafety’s past year of commit activity - mcgill-nlp.github.io Public
McGill-NLP/mcgill-nlp.github.io’s past year of commit activity - DIVERS-Bench Public
McGill-NLP/DIVERS-Bench’s past year of commit activity - weblinx-browsergym Public
McGill-NLP/weblinx-browsergym’s past year of commit activity - mSTEB Public
McGill-NLP/mSTEB’s past year of commit activity - bias-bench Public
ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.
McGill-NLP/bias-bench’s past year of commit activity - AdversarialTriggers Public
TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models
McGill-NLP/AdversarialTriggers’s past year of commit activity - agent-reward-bench Public
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
McGill-NLP/agent-reward-bench’s past year of commit activity - meaning-change Public
McGill-NLP/meaning-change’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…