Skip to content

Latest commit

 

History

History
15 lines (6 loc) · 603 Bytes

parallel_text.md

File metadata and controls

15 lines (6 loc) · 603 Bytes

Parellel text / bitext Sample hello, how are you مرحبا، كيف الحال fine and you تمام وانت كيفك

Methods in MT

  1. Cancat source and target texts (with markers), use autoregressive RNN with the input sequence of the source language to generate the sequence of the target language

  2. use encoder-decoder (seq2seq): input sequence is provided to encoder (RNN or LSTM or GRU), generate context vector (last hidden state), use context vector as input to decoder (RNN or LSTM or GRU) to generate the sequence in the target language