You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The serialized data wikipedia_segment ed_part_NN.bin refer WikiNBookCorpusPretrainingDataCreator which has been deleted in the latest code. Adding the following can avoid the issue.
class WikiNBookCorpusPretrainingDataCreator(PretrainingDataCreator):
pass
BERT_pretrain.ipynb
instructs to download https://bertonazuremlwestus2.blob.core.windows.net/public/bert_data.tar.gz for the preprocessed data. The tar file contains data in512/wikipedia_pretrain
, but it should be512/wiki_pretrain
.The text was updated successfully, but these errors were encountered: