Can you detail the preprocessing needed to be done with text during finetuning when using your pretrained model? #12

wailoktam · 2020-07-16T02:30:02Z

The part about tokenizing with Mecab is clear but what about the sub-word tokenization? And what if there are words found in the data used for finetuning but not found in the data used for pretraining? Some guide on using your pretrained model would be great.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can you detail the preprocessing needed to be done with text during finetuning when using your pretrained model? #12

Can you detail the preprocessing needed to be done with text during finetuning when using your pretrained model? #12

wailoktam commented Jul 16, 2020

Can you detail the preprocessing needed to be done with text during finetuning when using your pretrained model? #12

Can you detail the preprocessing needed to be done with text during finetuning when using your pretrained model? #12

Comments

wailoktam commented Jul 16, 2020