[MODEL] BERT conversion scripts, SciBERT, BioBERT, ClinicalBERT#735
[MODEL] BERT conversion scripts, SciBERT, BioBERT, ClinicalBERT#735szha merged 18 commits intodmlc:masterfrom
Conversation
Codecov Report
|
Codecov Report
@@ Coverage Diff @@
## master #735 +/- ##
==========================================
+ Coverage 89.95% 90.61% +0.65%
==========================================
Files 64 64
Lines 6064 6064
==========================================
+ Hits 5455 5495 +40
+ Misses 609 569 -40
|
|
@eric-haibin-lin this PR only touches code in the script folder. All other changes are due to #732 and can be ignored while reviewing this. |
2c7a7ba to
501189c
Compare
|
Job PR-735/4 is complete. |
501189c to
d3a9c68
Compare
|
Job PR-735/5 is complete. |
Typically Optimizer parameters are not exported for Bert models (eg. https://github.com/google-research/bert). As this does not always hold, we have to handle the case that the parameters are still present (eg. https://github.com/allenai/scibert).
d3a9c68 to
20950a7
Compare
20950a7 to
c32f538
Compare
|
Job PR-735/7 is complete. |
|
Job PR-735/8 is complete. |
Confirmed that the scibert_scivocab_uncased loaded from PyTorch produces the same output as the tensorflow version (based on the compare_tf_gluon_model.py).
d229a41 to
5614e5b
Compare
|
Job PR-735/14 is complete. |
scripts/bert/conversion_tools/infer_pytorch_gluon_parameter_name_mapping.py
Outdated
Show resolved
Hide resolved
|
Job PR-735/15 is complete. |
|
Job PR-735/16 is complete. |
|
Job PR-735/17 is complete. |
This refactors the tf Bert conversion scripts based on #732.
@eric-haibin-lin