universal_attention

I was curious to see whether neural networks could learn task-independent visual attention -- how to pick out what's imporant in a scene without necessarily knowing what it is. This repository contains the code for models and experiments I used to play around with this idea. Written for the final project of Caltech's CNS186 (Vision) class.

Intuition

Some things are a lot more salient to humans than others, just by their visual characteristics (bright colors, sharp contrasts, ...). It's often possible to pick out objects from background without knowing too much about the objects themselves. Therefore "universal attention" seems learnable.

Hopefully once the network has a good sense of where things are in an image, it can use that to learn faster, since cutting out a lot of noise background pixels should give the network a cleaner signal to train on.

How it works

I train a variant of ResNet50 with attention (Jetley et al. 2018) and see what the model learns to focus on. In order (hopefully) to learn a broad notion of what's interesting in an image, I meta-train the network with Reptile (Nichol et al. 2018) on six datasets:

Then, to see if knowing where things are helps the network learn what they are, I evaluate by transfer-learning the model to Caltech101.

Running the code

This project manages dependencies through Pipenv. To install dependencies, run pipenv install from the project directory.

Then, the project can be operated through the three executable scripts under scripts/:

run_reptile_experiment.py runs a single experiment with manually picked hyperparameters.
train_target.py evaluates an existing checkpoint on the target task (Caltech101).
hyperopt_attention.py runs hyperparameter optimization code using Hyperopt.

Scripts are run through Pipenv like pipenv run python scripts/some_script.py --run_name=my_run --other_flag=something_else.

All three scripts are configured with command-line flags; run scripts/some_script.py --helpfull to see the full list. They dump their logs in --tb_log_dir (default: outputs/logs) and checkpoints in --checkpoint_dir (default: outputs/checkpoints).

Project structure

├── data/: Default download / usage directory for all data.
├── outputs/
│   ├── checkpoints/: Default directory to save checkpoints in.
│   └── logs/: Default directory to save TensorBoard logs in.
├── images/: Examples of the meta-learned model's attention maps.
├── scripts/
│   ├── hyperopt_attention.py: Run hyperparameter optimization with HyperOpt.
│   ├── run_reptile_experiment.py: Run the meta-training --> evaluation pipeline.
│   └── train_target.py: Evaluate a pre-trained checkpoint on the target dataset.
└── universal_attention/: Holds project source code.
    └── ...

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data		data
images		images
outputs		outputs
scripts		scripts
universal_attention		universal_attention
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

universal_attention

Intuition

How it works

Running the code

Project structure

About

Releases

Packages

Contributors 2

Languages

License

maxwells-daemons/universal_attention

Folders and files

Latest commit

History

Repository files navigation

universal_attention

Intuition

How it works

Running the code

Project structure

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages