Skip to content

ysmiraak/tau

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

documentations

quick start

dependencies

to reproduce our results

mkdir data data/raw data/master pred ckpt log

download datasets

download the following parallel corpora from europarl and untar to the data/raw directory

prepare the data

cd src
./data.py

train a new model

  • run scripts train[0-5].py in succession, where [0-5] is the trial number
  • the checkpoints will be saved in ckpt
  • the tensorboard summaries will be saved in log
  • by the naming pattern m[0-5]_
  • a checkpoint number also is appended for the checkpoints

evaluate translation

  • run eval_all.py for translating between all language pairs
  • run eval_nl_da.py for translating only between dutch and danish
  • the trial number (C.trial) and the checkpoint number (C.ckpt or ckpt) needs to be set first
  • the translations for the evaluation set will be saved in pred
  • run sacrebleu --force -tok intl -b -i with the path to the predicted translation
  • and the path to the reference translation (saved as data/master/eval_*.txt by data.py)

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages