-
Notifications
You must be signed in to change notification settings - Fork 538
[BERT] Distributed Training Support #478
Comments
@eric-haibin-lin Perhaps any support from official codes will be better and reliable. Thanks. |
Hi @ymjiang I have some code locally and I am still testing it. In general you need to:
|
@eric-haibin-lin Thank you very much for the tips. I have another question about changing the |
Hi @ymjiang , BucketSampler is a way to create data batches of similar lengths for a dataset. https://github.com/szha/KDD18-Gluon/blob/master/05_data_pipeline/2-data-pipeline.ipynb Gluonnlp has the concept of "streams". In particular, dataset stream is an iterator which loads a dataset(typically a file) at a time. If your training data consists of multiple files, you can pass the SplitSampler to DatasetStream so that each machine iterate through a subset of files |
@eric-haibin-lin I will take a look, thanks! |
Added in #665 |
As title.
Related issues:
apache/mxnet#14124
apache/incubator-mxnet#14073
apache/incubator-mxnet#14072
apache/mxnet#11061
apache/mxnet#14125
apache/mxnet#14126
The text was updated successfully, but these errors were encountered: