Hierarchical Visual Context Fusion Transformer

The source code for Multimodal Relation Extraction via a Mixture of Hierarchical Visual Context Learners.

Data preprocessing

Due to the large size of MNRE dataset, please download the dataset from the original repository.

Unzip the data and rename the directory as mnre, which should be placed in the directory data:

mkdir data logs ckpt

We also use the detected visual objects provided in previous work, which can be downloaded using the commend:

cd data/
wget 120.27.214.45/Data/re/multimodal/data.tar.gz
tar -xzvf data.tar.gz

Install all necessary dependencies:

pip install -r requirements.txt

The best hyperparameters we found have been witten in run_mre.sh file.

You can simply run the bash script for multimodal relation extraction:

bash run_mre.sh

Name	Name	Last commit message	Last commit date
Latest commit liuxiyang641 update license Feb 21, 2024 9b6a662 · Feb 21, 2024 History 6 Commits
models	models	init project	Oct 12, 2023
processor	processor	init project	Oct 12, 2023
schedulers	schedulers	init project	Oct 12, 2023
utilities	utilities	init project	Oct 12, 2023
.gitignore	.gitignore	init project	Oct 12, 2023
LICENSE	LICENSE	update license	Feb 21, 2024
README.md	README.md	update README	Oct 13, 2023
requirements.txt	requirements.txt	init project	Oct 12, 2023
run.py	run.py	init project	Oct 12, 2023
run_mre.sh	run_mre.sh	init project	Oct 12, 2023