On Evaluating Explanation Utility for Human-AI Decision Making in NLP

Repository for the EMNLP 2024 Findings paper “On Evaluating Explanation Utility for Human-AI Decision Making in NLP” by Fateme Hashemi Chaleshtori, Atreya Ghosal, Alexander Gill, Purbid Bambroo, and Ana Marasović

Environment Setup

Make a new python3.10 environment:

conda create python=3.10 --name expl_utility

Install the dependencies from the requirement file:

pip3 install -r requirements.txt

Developing Models for Tasks

To finetune Flan-T5-3B on ContractNLI, SciFact-Open, ILDC, and EvidenceInference-v2 datasets:

cd model_development/data
sh download_data.sh
cd ..
sh run_ft.sh <dataset_name> -> [scifact-open, contract-nli, evidenceinference, ildc]  <batch_size>  <epoch>  <learning_rate>  <save_dir>

Other than dataset_name, the rest of the arguments are optionsl. Example:

sh run_ft.sh scifact-open 2 10 0.00005 ./ftmodels/scifact_open/

Developing Deferral Model

Run the following command to finetune Llama-2-13B on the deferral task:

cd deferral
sh run_def.sh

Prompts for zero-shot and few-shot learning with GPT-4 are provided in the deferral_model directory.

Human Study Templates

You can find Qualtrics templates for different studies under the human_study_templates directory. These templates can help replicate the human studies according to our guidelines.

To quickly view the format of the human studies, follow the links provided in the human_study_templates directory.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
deferral		deferral
human_study_templates		human_study_templates
model_developement		model_developement
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

On Evaluating Explanation Utility for Human-AI Decision Making in NLP

Environment Setup

Developing Models for Tasks

Developing Deferral Model

Human Study Templates

About

Releases

Packages

Languages

License

utahnlp/nlp-explanation-utility-guideline

Folders and files

Latest commit

History

Repository files navigation

On Evaluating Explanation Utility for Human-AI Decision Making in NLP

Environment Setup

Developing Models for Tasks

Developing Deferral Model

Human Study Templates

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages