DistilBERT for token classification #1792

stefan-it · 2019-11-11T16:35:53Z

Hi,

this PR adds a DistilBertForTokenClassification implementation (mainly inspired by the BERT implementation) that allows to perform sequence labeling tasks like NER or PoS tagging.

Additionally, the run_ner.py example script was modified to fully support DistilBERT for NER tasks.

I did a small comparison between BERT (large, cased), RoBERTa (large, cased) and DistilBERT (base, uncased) with the same hyperparameters as specified in the example documentation (one run):

Model	F-Score Dev	F-Score Test
`bert-large-cased`	95.59	91.70
`roberta-large`	95.96	91.87
`distilbert-base-uncased`	94.34	90.32

Unit test for the DistilBertForTokenClassification implementation is also added.

LysandreJik · 2019-11-11T17:26:48Z

This is great, thanks @stefan-it!

thomwolf · 2019-11-14T21:40:58Z

This is great, thanks a lot @stefan-it.
I've added your quick benchmark in the readme.

codecov-io · 2019-11-14T21:48:13Z

Codecov Report

Merging #1792 into master will increase coverage by 0.03%.
The diff coverage is 97.29%.

@@            Coverage Diff             @@
##           master    #1792      +/-   ##
==========================================
+ Coverage   84.03%   84.07%   +0.03%     
==========================================
  Files          94       94              
  Lines       14032    14069      +37     
==========================================
+ Hits        11792    11828      +36     
- Misses       2240     2241       +1

Impacted Files	Coverage Δ
transformers/tests/modeling_distilbert_test.py	`99.18% <100%> (+0.08%)`	⬆️
transformers/modeling_distilbert.py	`95.87% <96.15%> (+0.02%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b5d330d...05db5bc. Read the comment docs.

tedgoddard · 2019-11-18T17:24:35Z

I just wanted to say thanks for this PR, it was just what I was looking for at the time.

stefan-it added 4 commits November 11, 2019 16:18

modeling: add DistilBertForTokenClassification implementation

1c7253c

module: add DistilBertForTokenClassification import

1806eab

examples: add DistilBert support for NER fine-tuning

2b07b9e

tests: add test case for DistilBertForTokenClassification implementation

94e5525

added small comparison between BERT, RoBERTa and DistilBERT

05db5bc

thomwolf merged commit 74ce8de into huggingface:master Nov 14, 2019

stefan-it mentioned this pull request Nov 25, 2019

Minor bug fixes on run_ner.py #1918

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DistilBERT for token classification #1792

DistilBERT for token classification #1792

Uh oh!

stefan-it commented Nov 11, 2019

Uh oh!

LysandreJik commented Nov 11, 2019

Uh oh!

thomwolf commented Nov 14, 2019

Uh oh!

codecov-io commented Nov 14, 2019 •

edited

Loading

Uh oh!

tedgoddard commented Nov 18, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

DistilBERT for token classification #1792

DistilBERT for token classification #1792

Uh oh!

Conversation

stefan-it commented Nov 11, 2019

Uh oh!

LysandreJik commented Nov 11, 2019

Uh oh!

thomwolf commented Nov 14, 2019

Uh oh!

codecov-io commented Nov 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

tedgoddard commented Nov 18, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

codecov-io commented Nov 14, 2019 •

edited

Loading