Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
58 commits
Select commit Hold shift + click to select a range
1142960
First commit
Oct 22, 2020
9bb0060
Added TapasForSequenceClassification
Oct 25, 2020
ac313f3
Improved documentation
Oct 25, 2020
e6eeed9
Improved loss calculation and documentation of TapasForQuestionAnswering
Oct 26, 2020
3383e79
Improved documentation
Oct 26, 2020
13cadf7
Some more improved documentation
Oct 26, 2020
13c7cda
Added fix for conversion of TapasForSequenceClassification
Oct 26, 2020
8f80607
First attempt at using nn.linear rather than nn.parameter for TapasFo…
Oct 26, 2020
3114c8c
Bug fix with TapasForSequenceClassification
Oct 26, 2020
da1d0e0
Simplified aggregation head using nn.Linear
Oct 26, 2020
78fc0f4
Improved testing
Oct 27, 2020
58ff811
Minor fix
Oct 27, 2020
86aa165
Fix numbering of models
Oct 27, 2020
eb99107
Added herbert since this was not included
Oct 27, 2020
e259580
Added data utilities which were also not included
Oct 27, 2020
42d5d27
Fixed README
Oct 28, 2020
538a67e
Fixed docs comments LysandreJik and replaced add_start_docstrings_to_…
Oct 30, 2020
4564135
Make TapasConfig inherit from PretrainedConfig instead of BertConfig …
Oct 30, 2020
2f44197
Added tapas utilities to modeling_tapas
Nov 1, 2020
9418015
Make TapasTokenizer inherit from PreTrainedTokenizer rather than Bert…
Nov 1, 2020
c236808
Added missing imports to tokenization_tapas
Nov 1, 2020
7ee992e
Added tokenization_tapas_utilities to tokenization_tapas
Nov 2, 2020
11c0a0d
Address PatrickVonPlaten's comments
Nov 2, 2020
37d8870
Address @LysandreJik's comments
Nov 4, 2020
da59db9
Fix important typo
Nov 4, 2020
e3e2304
Style
LysandreJik Nov 12, 2020
81d56b2
TAPAS tokenizer & tokenizer tests (#8482)
LysandreJik Nov 16, 2020
6407254
Merge pull request #2 from huggingface/tapas-style
NielsRogge Nov 16, 2020
162c17d
Improve numeric value parsing
Nov 17, 2020
bd3ccd5
Add prev_label_ids logic
Nov 17, 2020
8889bb4
Major documentation improvements
Nov 18, 2020
d7c3f64
Implement correct padding of label_ids, numeric_values and numeric_va…
Nov 18, 2020
d4d1ec5
Further improve documentation
Nov 18, 2020
e802b39
Added [EMPTY] special token
Nov 19, 2020
bce3db4
Revert support of empty special token
Nov 19, 2020
1ce6526
Added initial implementation of [EMPTY] token
Nov 20, 2020
b95cef7
Use return_dict=True by default
Nov 25, 2020
a54aa90
Added conversion of TapasModel
Nov 25, 2020
0363264
Address some comments by @patrickvonplaten (mainly docs)
Nov 30, 2020
6b05d83
Do not pass entire config to calculate aggregation loss
Nov 30, 2020
9400277
Bug fix
Nov 30, 2020
4b4a1b4
Address most docs related comments by @sgugger
Dec 1, 2020
8a936f0
Some more docs improvements
Dec 1, 2020
40847a3
Truncation
LysandreJik Dec 2, 2020
cb0395b
Tokenization integration test
LysandreJik Dec 2, 2020
4b93499
TAPAS modeling integration test template
LysandreJik Dec 2, 2020
ec7761b
Fix tests
LysandreJik Dec 2, 2020
50c8b8c
Various fixes
LysandreJik Dec 3, 2020
5cb3708
Minor docs improvements
Dec 4, 2020
bc42e80
Merge branch 'tapas_v3' into tapas-truncation
NielsRogge Dec 4, 2020
4200d40
Merge pull request #3 from huggingface/tapas-truncation
NielsRogge Dec 4, 2020
57b6205
Add model integration test first draft
Dec 4, 2020
545ed40
Added URLs of models in model hub
Dec 4, 2020
cd8f5a1
Fix bug, currently 28 tests passed, 6 failed, 7 skipped
Dec 4, 2020
f38c4c1
Improve testing
Dec 4, 2020
e4fe878
Fix URLs in configuration_tapas.py
Dec 6, 2020
38c567f
Improve integration tests
Dec 6, 2020
2b71201
Add integration test for loss calculation
Dec 6, 2020
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -188,6 +188,7 @@ Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, and Wen-tau Yih.
ultilingual BERT into [DistilmBERT](https://github.com/huggingface/transformers/tree/master/examples/distillation) and a German version of DistilBERT.
1. **[SqueezeBert](https://huggingface.co/transformers/model_doc/squeezebert.html)** released with the paper [SqueezeBERT: What can computer vision teach NLP about efficient neural networks?](https://arxiv.org/abs/2006.11316) by Forrest N. Iandola, Albert E. Shaw, Ravi Krishna, and Kurt W. Keutzer.
1. **[T5](https://huggingface.co/transformers/model_doc/t5.html)** (from Google AI) released with the paper [Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer](https://arxiv.org/abs/1910.10683) by Colin Raffel and Noam Shazeer and Adam Roberts and Katherine Lee and Sharan Narang and Michael Matena and Yanqi Zhou and Wei Li and Peter J. Liu.
1. **[TAPAS](https://huggingface.co/transformers/master/model_doc/tapas.html)** released with the paper [TAPAS: Weakly Supervised Table Parsing via Pre-training](https://arxiv.org/abs/2004.02349) by Jonathan Herzig, Paweł Krzysztof Nowak, Thomas Müller, Francesco Piccinno and Julian Martin Eisenschlos.
1. **[Transformer-XL](https://huggingface.co/transformers/model_doc/transformerxl.html)** (from Google/CMU) released with the paper [Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context](https://arxiv.org/abs/1901.02860) by Zihang Dai*, Zhilin Yang*, Yiming Yang, Jaime Carbonell, Quoc V. Le, Ruslan Salakhutdinov.
1. **[XLM](https://huggingface.co/transformers/model_doc/xlm.html)** (from Facebook) released together with the paper [Cross-lingual Language Model Pretraining](https://arxiv.org/abs/1901.07291) by Guillaume Lample and Alexis Conneau.
1. **[XLM-ProphetNet](https://huggingface.co/transformers/model_doc/xlmprophetnet.html)** (from Microsoft Research) released with the paper [ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training](https://arxiv.org/abs/2001.04063) by Yu Yan, Weizhen Qi, Yeyun Gong, Dayiheng Liu, Nan Duan, Jiusheng Chen, Ruofei Zhang and Ming Zhou.
Expand Down Expand Up @@ -222,4 +223,4 @@ We now have a [paper](https://arxiv.org/abs/1910.03771) you can cite for the
year={2019},
volume={abs/1910.03771}
}
```
```
16 changes: 10 additions & 6 deletions docs/source/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -145,22 +145,25 @@ conversion utilities for the following models:
27. :doc:`T5 <model_doc/t5>` (from Google AI) released with the paper `Exploring the Limits of Transfer Learning with a
Unified Text-to-Text Transformer <https://arxiv.org/abs/1910.10683>`__ by Colin Raffel and Noam Shazeer and Adam
Roberts and Katherine Lee and Sharan Narang and Michael Matena and Yanqi Zhou and Wei Li and Peter J. Liu.
28. :doc:`Transformer-XL <model_doc/transformerxl>` (from Google/CMU) released with the paper `Transformer-XL:
28. :doc:`TAPAS <model_doc/tapas>` (from Google AI) released with the paper `TAPAS: Weakly Supervised Table Parsing via
Pre-training <https://arxiv.org/abs/2004.02349>`__ by Jonathan Herzig, Paweł Krzysztof Nowak, Thomas Müller,
Francesco Piccinno and Julian Martin Eisenschlos.
29. :doc:`Transformer-XL <model_doc/transformerxl>` (from Google/CMU) released with the paper `Transformer-XL:
Attentive Language Models Beyond a Fixed-Length Context <https://arxiv.org/abs/1901.02860>`__ by Zihang Dai*,
Zhilin Yang*, Yiming Yang, Jaime Carbonell, Quoc V. Le, Ruslan Salakhutdinov.
29. :doc:`XLM <model_doc/xlm>` (from Facebook) released together with the paper `Cross-lingual Language Model
30. :doc:`XLM <model_doc/xlm>` (from Facebook) released together with the paper `Cross-lingual Language Model
Pretraining <https://arxiv.org/abs/1901.07291>`__ by Guillaume Lample and Alexis Conneau.
30. :doc:`XLM-ProphetNet <model_doc/xlmprophetnet>` (from Microsoft Research) released with the paper `ProphetNet:
31. :doc:`XLM-ProphetNet <model_doc/xlmprophetnet>` (from Microsoft Research) released with the paper `ProphetNet:
Predicting Future N-gram for Sequence-to-Sequence Pre-training <https://arxiv.org/abs/2001.04063>`__ by Yu Yan,
Weizhen Qi, Yeyun Gong, Dayiheng Liu, Nan Duan, Jiusheng Chen, Ruofei Zhang and Ming Zhou.
31. :doc:`XLM-RoBERTa <model_doc/xlmroberta>` (from Facebook AI), released together with the paper `Unsupervised
32. :doc:`XLM-RoBERTa <model_doc/xlmroberta>` (from Facebook AI), released together with the paper `Unsupervised
Cross-lingual Representation Learning at Scale <https://arxiv.org/abs/1911.02116>`__ by Alexis Conneau*, Kartikay
Khandelwal*, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke
Zettlemoyer and Veselin Stoyanov.
32. :doc:`XLNet <model_doc/xlnet>` (from Google/CMU) released with the paper `​XLNet: Generalized Autoregressive
33. :doc:`XLNet <model_doc/xlnet>` (from Google/CMU) released with the paper `​XLNet: Generalized Autoregressive
Pretraining for Language Understanding <https://arxiv.org/abs/1906.08237>`__ by Zhilin Yang*, Zihang Dai*, Yiming
Yang, Jaime Carbonell, Ruslan Salakhutdinov, Quoc V. Le.
33. `Other community models <https://huggingface.co/models>`__, contributed by the `community
34. `Other community models <https://huggingface.co/models>`__, contributed by the `community
<https://huggingface.co/users>`__.

.. toctree::
Expand Down Expand Up @@ -258,6 +261,7 @@ conversion utilities for the following models:
model_doc/roberta
model_doc/squeezebert
model_doc/t5
model_doc/tapas
model_doc/transformerxl
model_doc/xlm
model_doc/xlmprophetnet
Expand Down
Loading