This repository was archived by the owner on Jul 7, 2023. It is now read-only.
Storing encoder-decoder attention history at Transformer's cache during fast decoding.#1602
Merged
lukaszkaiser merged 3 commits intotensorflow:masterfrom Jun 20, 2019
Merged