Name		Name	Last commit message	Last commit date
parent directory ..
transformer		transformer
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
apply_bpe.py		apply_bpe.py
install.py		install.py
install.sh		install.sh
learn_bpe.py		learn_bpe.py
metadata.yaml		metadata.yaml
origin		origin
preprocess.py		preprocess.py
requirements.txt		requirements.txt
run.sh		run.sh
train.py		train.py
translate.py		translate.py

README.md

Attention is all you need: A Pytorch Directml Implementation

This is a PyTorch Directml implementation of the Transformer model in "Attention is All You Need" (Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin, arxiv, 2017).

This sample is extracted from pytorch benchmark, and has been slightly changed to apply torch-directml.

Usage

1) Install the prerequisites and prepare data

From inside the attention_is_all_you_need directory, run the following script:

python install.py

2) Train the model

python train.py -data_pkl .data/m30k_deen_shr.pkl -log m30k_deen_shr -embs_share_weight -proj_share_weight -label_smoothing -save_model trained -b 128 -warmup 128000 -epoch 400 -use_dml

3) Test the model

python translate.py -data_pkl .data/m30k_deen_shr.pkl -model trained.chkpt -output prediction.txt -use_dml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

attention_is_all_you_need

attention_is_all_you_need

README.md

Attention is all you need: A Pytorch Directml Implementation

Usage

1) Install the prerequisites and prepare data

2) Train the model

3) Test the model

Files

attention_is_all_you_need

Directory actions

More options

Directory actions

More options

Latest commit

History

attention_is_all_you_need

Folders and files

parent directory

README.md

Attention is all you need: A Pytorch Directml Implementation

Usage

1) Install the prerequisites and prepare data

2) Train the model

3) Test the model