MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems

This is the implementation of the EMNLP 2020 paper:

MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems. Zhaojiang Lin, Andrea Madotto, Genta Indra Winata, Pascale Fung [PDF]

Citation:

If you use any source codes or datasets included in this toolkit in your work, please cite the following paper. The bibtex is listed below:

@article{lin2020mintl,
    title={MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems},
    author={Zhaojiang Lin and Andrea Madotto and Genta Indra Winata and Pascale Fung},
    journal={arXiv preprint arXiv:2009.12005},
    year={2020}
}

Abstract:

In this paper, we propose Minimalist Transfer Learning (MinTL) to simplify the system design process of task-oriented dialogue systems and alleviate the over-dependency on annotated data. MinTL is a simple yet effective transfer learning framework, which allows us to plug-and-play pre-trained seq2seq models, and jointly learn dialogue state tracking and dialogue response generation. Unlike previous approaches, which use a copy mechanism to "carryover" the old dialogue states to the new one, we introduce Levenshtein belief spans (Lev), that allows efficient dialogue state tracking with a minimal generation length. We instantiate our learning framework with two pretrained backbones: T5 (Raffel et al., 2019) and BART (Lewis et al., 2019), and evaluate them on MultiWOZ. Extensive experiments demonstrate that: 1) our systems establish new state-of-the-art results on end-to-end response generation, 2) MinTL-based systems are more robust than baseline methods in the low resource setting, and they achieve competitive results with only 20% training data, and 3) Lev greatly improves the inference efficiency.

Dependency

Check the packages needed or simply run the command

❱❱❱ pip install -r requirements.txt

Experiments Setup

We used the preprocess script from DAMD. Please check setup.sh for data preprocessing.

Experiments

T5 End2End

❱❱❱ python train.py --mode train --context_window 2 --pretrained_checkpoint t5-small --cfg seed=557 batch_size=32

T5 DST

❱❱❱ python DST.py --mode train --context_window 3 --cfg seed=557 batch_size=32

BART End2End

❱❱❱ python train.py --mode train --context_window 2 --pretrained_checkpoint bart-large-cnn --gradient_accumulation_steps 8 --lr 3e-5 --back_bone bart --cfg seed=557 batch_size=8

BART DST

❱❱❱ python DST.py --mode train --context_window 3 --gradient_accumulation_steps 10 --pretrained_checkpoint bart-large-cnn --back_bone bart --lr 1e-5 --cfg seed=557 batch_size=4

check run.py for more information.

Name	Name	Last commit message	Last commit date
Latest commit zlinao Update requirements.txt Apr 1, 2021 2dd042b · Apr 1, 2021 History 11 Commits
damd_multiwoz	damd_multiwoz	minor	Sep 25, 2020
plot	plot	initial commit	Sep 25, 2020
.gitignore	.gitignore	data	Sep 25, 2020
BART.py	BART.py	minor	Sep 25, 2020
DST.py	DST.py	minor	Sep 25, 2020
LICENSE	LICENSE	Initial commit	Sep 24, 2020
README.md	README.md	Update README.md	Oct 17, 2020
T5.py	T5.py	minor	Sep 25, 2020
T5_shared.py	T5_shared.py	minor	Sep 25, 2020
pretraining.py	pretraining.py	Add files via upload	Oct 26, 2020
requirements.txt	requirements.txt	Update requirements.txt	Apr 1, 2021
run.py	run.py	minor	Sep 25, 2020
setup.sh	setup.sh	minor	Sep 25, 2020
train.py	train.py	minor	Sep 25, 2020
utils.py	utils.py	minor	Sep 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems

Citation:

Abstract:

Dependency

Experiments Setup

Experiments

About

Releases

Packages

Languages

License

zlinao/MinTL

Folders and files

Latest commit

History

Repository files navigation

MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems

Citation:

Abstract:

Dependency

Experiments Setup

Experiments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages