Dynamic Key-value Memory Enhanced Multi-step Graph Reasoning for Knowledge-based Visual Question Answering

This repo contains the official implementation for the AAAI paper Dynamic Key-value Memory Enhanced Multi-step Graph Reasoning for Knowledge-based Visual Question Answering

Abstract

Knowledge-based visual question answering (VQA) is a vision-language task that requires an agent to correctly answer image-related questions using knowledge that is not presented in the given image. It is not only a more challenging task than regular VQA but also a vital step towards building a general VQA system. Most existing knowledge-based VQA systems process knowledge and image information similarly and ignore the fact that the knowledge base (KB) contains complete information about a triplet, while the extracted image information might be incomplete as the relations between two objects are missing or wrongly detected. In this paper, we propose a novel model named dynamic knowledge memory enhanced multi-step graph reasoning (DMMGR), which performs explicit and implicit reasoning over a key-value knowledge memory module and a spatial-aware image graph, respectively. Specifically, the memory module learns a dynamic knowledge representation and generates a knowledge-aware question representation at each reasoning step. Then, this representation is used to guide a graph attention operator over the spatial-aware image graph. Our model achieves new stateof-the-art accuracy on the KRVQR and FVQA datasets. We also conduct ablation experiments to prove the effectiveness of each component of the proposed model.

Illustration of Our Method

Runing

sh run_graph.sh

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
experiment_configs		experiment_configs
images		images
model_zoo		model_zoo
testing		testing
x		x
README.md		README.md
run.py		run.py
run_graph.py		run_graph.py
run_graph.sh		run_graph.sh
run_ques_estimator.py		run_ques_estimator.py
test_cgrm_dataset.py		test_cgrm_dataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dynamic Key-value Memory Enhanced Multi-step Graph Reasoning for Knowledge-based Visual Question Answering

Abstract

Illustration of Our Method

Runing

About

Releases

Packages

Languages

Mingxiao-Li/DMMGR

Folders and files

Latest commit

History

Repository files navigation

Dynamic Key-value Memory Enhanced Multi-step Graph Reasoning for Knowledge-based Visual Question Answering

Abstract

Illustration of Our Method

Runing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages