Structural Adapters in Pretrained Language Models for AMR-to-Text Generation

This repository contains the code for the EMNLP 2021 paper "Structural Adapters in Pretrained Language Models for AMR-to-Text Generation". Please cite the paper if you find it useful.

StructAdapt is an adapter-based method that encodes graph structures into pretrained language models. Contrary to prior work, StructAdapt effectively models interactions among the nodes based on the graph connectivity, only training graph structure-aware adapter parameters.

Figure 1(a) shows a standard adapter, whereas Figure 1(b) illustrates StructAdapt.

Datasets

In our experiments, we use the following datasets: AMR2017, AMR2020.

Environment

The easiest way to proceed is to create a conda environment:

conda create -n structadapt python=3.6

Further, install PyTorch and PyTorch Geometric:

conda install -c pytorch pytorch=1.6.0
pip install torch-scatter==2.0.5 -f https://data.pyg.org/whl/torch-1.6.0+${CUDA}.html
pip install torch-sparse==0.6.8 -f https://data.pyg.org/whl/torch-1.6.0+${CUDA}.html
pip install torch-geometric==1.6.1

where {CUDA} is your CUDA version. Please, refer to PyTorch Geometric installation page for more details.

Finally, install the packages required:

pip install -r requirements.txt

Preprocess

Convert the dataset into the format required for the model:

./preprocess/preprocess_AMR.sh <dataset_folder>

Finetuning

For training StructAdapt using the AMR dataset, execute:

./train.sh <model> <data_dir> <gpu_id>

Options for <model> are t5-small, t5-base, t5-large.

Example:

./train.sh t5-large data/processed_amr 0

Decoding

For decoding, run:

./test.sh <model> <checkpoint_folder> <data_dir> <gpu_id>

Example:

./test.sh t5-base ckpt-folder data/processed_amr 0

Trained Model

A checkpoint of StrutAdapt (T5-large) trained on LDC2020T02 dataset can be found here. This model achieves a BLEU score of 48.21. The outputs can be downloaded here.

More

For more details regarding hyperparameters, please refer to HuggingFace.

Contact person: Leonardo Ribeiro, [email protected]

Citation

@inproceedings{ribeiro-etal-2021-structural,
    title = "Structural Adapters in Pretrained Language Models for {AMR}-to-{T}ext Generation",
    author = "Ribeiro, Leonardo F. R.  and
      Zhang, Yue  and
      Gurevych, Iryna",
    booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2021",
    address = "Online and Punta Cana, Dominican Republic",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.emnlp-main.351",
    pages = "4269--4282",
    abstract = "Pretrained language models (PLM) have recently advanced graph-to-text generation, where the input graph is linearized into a sequence and fed into the PLM to obtain its representation. However, efficiently encoding the graph structure in PLMs is challenging because such models were pretrained on natural language, and modeling structured data may lead to catastrophic forgetting of distributional knowledge. In this paper, we propose StructAdapt, an adapter method to encode graph structure into PLMs. Contrary to prior work, StructAdapt effectively models interactions among the nodes based on the graph connectivity, only training graph structure-aware adapter parameters. In this way, we incorporate task-specific knowledge while maintaining the topological structure of the graph. We empirically show the benefits of explicitly encoding graph structure into PLMs using StructAdapt, outperforming the state of the art on two AMR-to-text datasets, training only 5.1{\%} of the PLM parameters.",
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
img		img
outputs		outputs
preprocess		preprocess
transformers		transformers
LICENSE.txt		LICENSE.txt
NOTICE.txt		NOTICE.txt
README.md		README.md
__init__.py		__init__.py
callbacks.py		callbacks.py
convert_graph_tokenizer.py		convert_graph_tokenizer.py
finetune.py		finetune.py
lightning_base.py		lightning_base.py
requirements.txt		requirements.txt
test.sh		test.sh
train.sh		train.sh
utils.py		utils.py
utils_graph2text.py		utils_graph2text.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Structural Adapters in Pretrained Language Models for AMR-to-Text Generation

Datasets

Environment

Preprocess

Finetuning

Decoding

Trained Model

More

Citation

About

Releases

Packages

Languages

License

UKPLab/StructAdapt

Folders and files

Latest commit

History

Repository files navigation

Structural Adapters in Pretrained Language Models for AMR-to-Text Generation

Datasets

Environment

Preprocess

Finetuning

Decoding

Trained Model

More

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages