Code for Paper "Transformers can navigate mazes with Multi-Step Prediction"

Installation

Install PyTorch
pip install pytest submitit hydra-core hydra-submitit-launcher loguru tqdm gitpython transformers lightning matplotlib datasets sortedcontainers maze-dataset pymongo numpy maze-dataset

If you want to run A* mazes (from https://github.com/facebookresearch/searchformer/)

Install mongodb
Download maze.gz and maze.vocabulary.gz from https://github.com/facebookresearch/searchformer/blob/main/doc/mongodb.md
add those to your mongodb
mongorestore --gzip --archive=maze.gz
mongorestore --gzip --archive=maze.vocabulary.gz

adjust locations: search for "TODO" and you will find them:

main.py --> code snapshot dir
train_defaults.yaml --> logs dir
train_defaults.yaml --> data dir

Run next token (AR) Baseline

Locally

python main.py -m mode=local model=gpt dataset=maze datamodule.grid_n=4

use_wandb=False or True to enable or disable debugging

Run MLM-U

python main.py -m mode=local model=past dataset=maze datamodule.grid_n=4

PAST is an encoder-decoder model that runs best with mlm-u (model.train_mode=absorbing). GPT is the best model for AR (left to right next token prediction)

Contributing

See the CONTRIBUTING file for how to help out.

License

This project is Apache 2.0 licensed, as found in the LICENSE file.

The stargraph dataset has been adapted from https://github.com/gregorbachmann/Next-Token-Failures/

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows		.github/workflows
configs		configs
notebooks		notebooks
recipe		recipe
website		website
.DS_Store		.DS_Store
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
main.py		main.py
test_all.py		test_all.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code for Paper "Transformers can navigate mazes with Multi-Step Prediction"

Installation

If you want to run A* mazes (from https://github.com/facebookresearch/searchformer/)

Run next token (AR) Baseline

Run MLM-U

Contributing

License

About

Releases

Packages

Languages

License

facebookresearch/maze_navigation_MLMU

Folders and files

Latest commit

History

Repository files navigation

Code for Paper "Transformers can navigate mazes with Multi-Step Prediction"

Installation

If you want to run A* mazes (from https://github.com/facebookresearch/searchformer/)

Run next token (AR) Baseline

Run MLM-U

Contributing

License

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages