Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
71 commits
Select commit Hold shift + click to select a range
450d9b9
First commit: adding all files from tapas_v3
Dec 7, 2020
5299d57
Fix multiple bugs including soft dependency and new structure of the …
Dec 7, 2020
fb4bf22
Improve testing by adding torch_device to inputs and adding dependenc…
Dec 7, 2020
71c87ae
Use Python 3 inheritance rather than Python 2
Dec 7, 2020
da846e6
First draft model cards of base sized models
Dec 7, 2020
086be40
Remove model cards as they are already on the hub
Dec 7, 2020
122923d
Fix multiple bugs with integration tests
Dec 7, 2020
eb99dc7
All model integration tests pass
Dec 7, 2020
b411f87
Remove print statement
Dec 7, 2020
65003b4
Add test for convert_logits_to_predictions method of TapasTokenizer
Dec 8, 2020
57a66b2
Incorporate suggestions by Google authors
Dec 8, 2020
dc6e866
Fix remaining tests
LysandreJik Dec 8, 2020
6956f2e
Change position embeddings sizes to 512 instead of 1024
Dec 9, 2020
2bc720a
Comment out positional embedding sizes
Dec 9, 2020
89a06d4
Update PRETRAINED_VOCAB_FILES_MAP and PRETRAINED_POSITIONAL_EMBEDDING…
Dec 9, 2020
29af69a
Added more model names
Dec 9, 2020
2f1540d
Fix truncation when no max length is specified
LysandreJik Dec 9, 2020
4c976c5
Disable torchscript test
LysandreJik Dec 9, 2020
3092927
Make style & make quality
Dec 9, 2020
13e9e26
Quality
LysandreJik Dec 9, 2020
ee28e0d
Address CI needs
LysandreJik Dec 10, 2020
f4eae58
Test the Masked LM model
LysandreJik Dec 10, 2020
57aa4fe
Fix the masked LM model
LysandreJik Dec 10, 2020
c67071b
Truncate when overflowing
LysandreJik Dec 10, 2020
729cf9b
More much needed docs improvements
Dec 11, 2020
4e8df2d
Fix some URLs
Dec 11, 2020
d9ec541
Some more docs improvements
Dec 11, 2020
0e06a3d
Test PyTorch scatter
LysandreJik Dec 11, 2020
c05e5fd
Set to slow + minify
LysandreJik Dec 11, 2020
b6a955b
Merge pull request #8 from huggingface/tapas_v4_final
NielsRogge Dec 11, 2020
054f023
Calm flake8 down
LysandreJik Dec 11, 2020
0c66d17
Merge pull request #9 from huggingface/tapas_v4_final
NielsRogge Dec 11, 2020
48b7238
First commit: adding all files from tapas_v3
Dec 7, 2020
899cf12
Fix multiple bugs including soft dependency and new structure of the …
Dec 7, 2020
fbab808
Improve testing by adding torch_device to inputs and adding dependenc…
Dec 7, 2020
b791902
Use Python 3 inheritance rather than Python 2
Dec 7, 2020
a5b94fe
First draft model cards of base sized models
Dec 7, 2020
7aadce5
Remove model cards as they are already on the hub
Dec 7, 2020
2d086b4
Fix multiple bugs with integration tests
Dec 7, 2020
0f52b35
All model integration tests pass
Dec 7, 2020
8d28fc2
Remove print statement
Dec 7, 2020
21584c4
Add test for convert_logits_to_predictions method of TapasTokenizer
Dec 8, 2020
54110a4
Incorporate suggestions by Google authors
Dec 8, 2020
4c3c6c3
Fix remaining tests
LysandreJik Dec 8, 2020
4f97d15
Change position embeddings sizes to 512 instead of 1024
Dec 9, 2020
c5466d9
Comment out positional embedding sizes
Dec 9, 2020
cf885ec
Update PRETRAINED_VOCAB_FILES_MAP and PRETRAINED_POSITIONAL_EMBEDDING…
Dec 9, 2020
7280993
Added more model names
Dec 9, 2020
e7f0771
Fix truncation when no max length is specified
LysandreJik Dec 9, 2020
6a91097
Disable torchscript test
LysandreJik Dec 9, 2020
29d2a32
Make style & make quality
Dec 9, 2020
5c74867
Quality
LysandreJik Dec 9, 2020
f190fa3
Address CI needs
LysandreJik Dec 10, 2020
5260c5a
Test the Masked LM model
LysandreJik Dec 10, 2020
e041dec
Fix the masked LM model
LysandreJik Dec 10, 2020
7ee545d
Truncate when overflowing
LysandreJik Dec 10, 2020
b252797
More much needed docs improvements
Dec 11, 2020
f1a10df
Fix some URLs
Dec 11, 2020
89f5229
Some more docs improvements
Dec 11, 2020
9545e80
Add add_pooling_layer argument to TapasModel
Dec 14, 2020
830cb08
Fix merge conflicts
Dec 14, 2020
ee09c36
Fix issue in docs + fix style and quality
Dec 14, 2020
b4f6b6c
Clean up conversion script and add task parameter to TapasConfig
Dec 14, 2020
c4c94d1
Revert the task parameter of TapasConfig
Dec 14, 2020
e7d0ce9
Improve conversion script and add test for absolute position embeddings
NielsRogge Dec 15, 2020
fce7b98
Improve conversion script and add test for absolute position embeddings
NielsRogge Dec 15, 2020
8ba021c
Fix bug with reset_position_index_per_cell arg of the conversion cli
NielsRogge Dec 15, 2020
c9d5f5b
Add notebooks to the examples directory and fix style and quality
NielsRogge Dec 15, 2020
1acb830
Apply suggestions from code review
LysandreJik Dec 15, 2020
74ad7e9
Move from `nielsr/` to `google/` namespace
LysandreJik Dec 15, 2020
d63e8ab
Apply Sylvain's comments
LysandreJik Dec 15, 2020
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions .circleci/config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -79,6 +79,7 @@ jobs:
- v0.4-{{ checksum "setup.py" }}
- run: pip install --upgrade pip
- run: pip install .[sklearn,tf-cpu,torch,testing,sentencepiece]
- run: pip install tapas torch-scatter -f https://pytorch-geometric.com/whl/torch-1.7.0+cpu.html
- save_cache:
key: v0.4-{{ checksum "setup.py" }}
paths:
Expand All @@ -105,6 +106,7 @@ jobs:
- v0.4-{{ checksum "setup.py" }}
- run: pip install --upgrade pip
- run: pip install .[sklearn,torch,testing,sentencepiece]
- run: pip install tapas torch-scatter -f https://pytorch-geometric.com/whl/torch-1.7.0+cpu.html
- save_cache:
key: v0.4-torch-{{ checksum "setup.py" }}
paths:
Expand Down Expand Up @@ -183,6 +185,7 @@ jobs:
- v0.4-{{ checksum "setup.py" }}
- run: pip install --upgrade pip
- run: pip install .[sklearn,torch,testing,sentencepiece]
- run: pip install tapas torch-scatter -f https://pytorch-geometric.com/whl/torch-1.7.0+cpu.html
- save_cache:
key: v0.4-torch-{{ checksum "setup.py" }}
paths:
Expand Down
2 changes: 2 additions & 0 deletions .github/workflows/self-push.yml
Original file line number Diff line number Diff line change
Expand Up @@ -50,6 +50,7 @@ jobs:
pip install --upgrade pip
pip install .[torch,sklearn,testing,onnxruntime,sentencepiece]
pip install git+https://github.com/huggingface/datasets
pip install pandas torch-scatter -f https://pytorch-geometric.com/whl/torch-$(python -c "import torch; print(''.join(torch.__version__)")+$(python -c "import torch; print(''.join(torch.version.cuda.split('.')))").html
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I will update this after the merge to something cleaner.


- name: Are GPUs recognized by our DL frameworks
run: |
Expand Down Expand Up @@ -187,6 +188,7 @@ jobs:
pip install --upgrade pip
pip install .[torch,sklearn,testing,onnxruntime,sentencepiece]
pip install git+https://github.com/huggingface/datasets
pip install pandas torch-scatter -f https://pytorch-geometric.com/whl/torch-$(python -c "import torch; print(''.join(torch.__version__)")+$(python -c "import torch; print(''.join(torch.version.cuda.split('.')))").html
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same here.


- name: Are GPUs recognized by our DL frameworks
run: |
Expand Down
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -222,6 +222,7 @@ Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, and Wen-tau Yih.
ultilingual BERT into [DistilmBERT](https://github.com/huggingface/transformers/tree/master/examples/distillation) and a German version of DistilBERT.
1. **[SqueezeBert](https://huggingface.co/transformers/model_doc/squeezebert.html)** released with the paper [SqueezeBERT: What can computer vision teach NLP about efficient neural networks?](https://arxiv.org/abs/2006.11316) by Forrest N. Iandola, Albert E. Shaw, Ravi Krishna, and Kurt W. Keutzer.
1. **[T5](https://huggingface.co/transformers/model_doc/t5.html)** (from Google AI) released with the paper [Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer](https://arxiv.org/abs/1910.10683) by Colin Raffel and Noam Shazeer and Adam Roberts and Katherine Lee and Sharan Narang and Michael Matena and Yanqi Zhou and Wei Li and Peter J. Liu.
1. **[TAPAS](https://huggingface.co/transformers/master/model_doc/tapas.html)** released with the paper [TAPAS: Weakly Supervised Table Parsing via Pre-training](https://arxiv.org/abs/2004.02349) by Jonathan Herzig, Paweł Krzysztof Nowak, Thomas Müller, Francesco Piccinno and Julian Martin Eisenschlos.
1. **[Transformer-XL](https://huggingface.co/transformers/model_doc/transformerxl.html)** (from Google/CMU) released with the paper [Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context](https://arxiv.org/abs/1901.02860) by Zihang Dai*, Zhilin Yang*, Yiming Yang, Jaime Carbonell, Quoc V. Le, Ruslan Salakhutdinov.
1. **[XLM](https://huggingface.co/transformers/model_doc/xlm.html)** (from Facebook) released together with the paper [Cross-lingual Language Model Pretraining](https://arxiv.org/abs/1901.07291) by Guillaume Lample and Alexis Conneau.
1. **[XLM-ProphetNet](https://huggingface.co/transformers/model_doc/xlmprophetnet.html)** (from Microsoft Research) released with the paper [ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training](https://arxiv.org/abs/2001.04063) by Yu Yan, Weizhen Qi, Yeyun Gong, Dayiheng Liu, Nan Duan, Jiusheng Chen, Ruofei Zhang and Ming Zhou.
Expand Down
16 changes: 11 additions & 5 deletions docs/source/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -176,19 +176,22 @@ and conversion utilities for the following models:
30. :doc:`T5 <model_doc/t5>` (from Google AI) released with the paper `Exploring the Limits of Transfer Learning with a
Unified Text-to-Text Transformer <https://arxiv.org/abs/1910.10683>`__ by Colin Raffel and Noam Shazeer and Adam
Roberts and Katherine Lee and Sharan Narang and Michael Matena and Yanqi Zhou and Wei Li and Peter J. Liu.
31. :doc:`Transformer-XL <model_doc/transformerxl>` (from Google/CMU) released with the paper `Transformer-XL:
31. `TAPAS <https://huggingface.co/transformers/master/model_doc/tapas.html>`__ released with the paper `TAPAS: Weakly
Supervised Table Parsing via Pre-training <https://arxiv.org/abs/2004.02349>`__ by Jonathan Herzig, Paweł Krzysztof
Nowak, Thomas Müller, Francesco Piccinno and Julian Martin Eisenschlos.
32. :doc:`Transformer-XL <model_doc/transformerxl>` (from Google/CMU) released with the paper `Transformer-XL:
Attentive Language Models Beyond a Fixed-Length Context <https://arxiv.org/abs/1901.02860>`__ by Zihang Dai*,
Zhilin Yang*, Yiming Yang, Jaime Carbonell, Quoc V. Le, Ruslan Salakhutdinov.
32. :doc:`XLM <model_doc/xlm>` (from Facebook) released together with the paper `Cross-lingual Language Model
33. :doc:`XLM <model_doc/xlm>` (from Facebook) released together with the paper `Cross-lingual Language Model
Pretraining <https://arxiv.org/abs/1901.07291>`__ by Guillaume Lample and Alexis Conneau.
33. :doc:`XLM-ProphetNet <model_doc/xlmprophetnet>` (from Microsoft Research) released with the paper `ProphetNet:
34. :doc:`XLM-ProphetNet <model_doc/xlmprophetnet>` (from Microsoft Research) released with the paper `ProphetNet:
Predicting Future N-gram for Sequence-to-Sequence Pre-training <https://arxiv.org/abs/2001.04063>`__ by Yu Yan,
Weizhen Qi, Yeyun Gong, Dayiheng Liu, Nan Duan, Jiusheng Chen, Ruofei Zhang and Ming Zhou.
34. :doc:`XLM-RoBERTa <model_doc/xlmroberta>` (from Facebook AI), released together with the paper `Unsupervised
35. :doc:`XLM-RoBERTa <model_doc/xlmroberta>` (from Facebook AI), released together with the paper `Unsupervised
Cross-lingual Representation Learning at Scale <https://arxiv.org/abs/1911.02116>`__ by Alexis Conneau*, Kartikay
Khandelwal*, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke
Zettlemoyer and Veselin Stoyanov.
35. :doc:`XLNet <model_doc/xlnet>` (from Google/CMU) released with the paper `​XLNet: Generalized Autoregressive
36. :doc:`XLNet <model_doc/xlnet>` (from Google/CMU) released with the paper `​XLNet: Generalized Autoregressive
Pretraining for Language Understanding <https://arxiv.org/abs/1906.08237>`__ by Zhilin Yang*, Zihang Dai*, Yiming
Yang, Jaime Carbonell, Ruslan Salakhutdinov, Quoc V. Le.

Expand Down Expand Up @@ -269,6 +272,8 @@ TensorFlow and/or Flax.
+-----------------------------+----------------+----------------+-----------------+--------------------+--------------+
| T5 | ✅ | ✅ | ✅ | ✅ | ❌ |
+-----------------------------+----------------+----------------+-----------------+--------------------+--------------+
| TAPAS | ✅ | ❌ | ✅ | ❌ | ❌ |
+-----------------------------+----------------+----------------+-----------------+--------------------+--------------+
| Transformer-XL | ✅ | ❌ | ✅ | ✅ | ❌ |
+-----------------------------+----------------+----------------+-----------------+--------------------+--------------+
| XLM | ✅ | ❌ | ✅ | ✅ | ❌ |
Expand Down Expand Up @@ -382,6 +387,7 @@ TensorFlow and/or Flax.
model_doc/roberta
model_doc/squeezebert
model_doc/t5
model_doc/tapas
model_doc/transformerxl
model_doc/xlm
model_doc/xlmprophetnet
Expand Down
Loading