Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch
- Implement multiple targets.
- Compare results to those reported in paper.
Main script based on Pytorch Examples.
Attention models taken from the Annotated transformer.
Scripts for downloading the data from salesforce's awd-lstm-lm.