-
Notifications
You must be signed in to change notification settings - Fork 31.6k
Added mask_time_prob and mask_time_length arguments to wav2vec2 pretraining script and readme #19997
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
mpierrau
wants to merge
419
commits into
huggingface:main
from
mpierrau:adding-mask_time_prob-args-to-wav2vec2-pretraining-script
Closed
Added mask_time_prob and mask_time_length arguments to wav2vec2 pretraining script and readme #19997
Changes from 1 commit
Commits
Show all changes
419 commits
Select commit
Hold shift + click to select a range
29a59f9
Generate: general TF XLA constrastive search are now slow tests (#20277)
gante 4c732a3
Fixing the doctests failures. (#20294)
Narsil 5aff383
set the default cache_enable to True, aligned with the default value …
sywangyi f26e427
Add docstrings for canine model (#19457)
raghavanone 1a6d6e4
Add AutoBackbone + ResNetBackbone (#20229)
NielsRogge b132b80
Add missing report button for Example test (#20293)
ydshieh ba23a07
refactor test (#20300)
younesbelkada 1c2b2b3
[Tiny model creation] deal with `ImageProcessor` (#20298)
ydshieh 7a63702
Fix blender bot missleading doc (#20301)
ArthurZucker 4bb1eac
remove two tokens that should not be suppressed (#20302)
ArthurZucker 7bb727a
[ASR Examples] Update README for Whisper (#20230)
sanchit-gandhi baca2c1
Add padding image transformation (#19838)
amyeroberts 90e95e1
Pin TensorFlow (#20313)
sgugger e35c976
Pin to the right version...
sgugger e2bd841
Also pin TensorFlow CPU
sgugger 70a1123
Add AnyPrecisionAdamW optimizer (#18961)
atturaioe 5d2c1db
[Proposal] Breaking change `zero-shot-object-detection` for impro…
Narsil 78e64cf
Fix flakey test with seed (#20318)
muellerzr 0e7bea9
Pin TF 2.10.1 for Push CI (#20319)
ydshieh 56f5ce0
Remove double brackets (#20307)
stevhliu 6716752
TF: future proof our keras imports (#20317)
gante 5bf1f59
Add Neighborhood Attention Transformer (NAT) and Dilated NAT (DiNAT) …
alihassanijr d966c6f
organize pipelines by modality (#20306)
stevhliu aa6beaa
Fix torch device issues (#20304)
ydshieh 713f136
Generate: add generation config class (#20218)
gante 82ceaa9
translate zh quicktour(#20095) (#20181)
bfss 89e57ea
Add Spanish translation of serialization.mdx (#20245)
donelianc 0266ccc
Add LayerScale to NAT/DiNAT (#20325)
alihassanijr 86e73c1
[Switch Transformers] Fix failing slow test (#20346)
younesbelkada fe6367d
fix: "BigSicence" typo in docs (#20331)
rajrajhans b9d8426
add MobileNetV1 model (#17799)
hollance 785a276
Generate: `model_kwargs` can also be an input to `prepare_inputs_for_…
gante 34e2123
Update Special Language Tokens for PLBART (#19980)
jordiclive 76a9e02
Add resources (#20296)
NielsRogge d7d9abe
Enhance HfArgumentParser functionality and ease of use (#20323)
konstantinjdobler b84b1a4
Add Audio Spectogram Transformer (#19981)
NielsRogge 25f9f54
Add inference section to task guides (#18781)
stevhliu 4f64be7
Fix toctree for Section 3 in Spanish Documentation (#20360)
donelianc 82216f8
Generate: shorter XLA contrastive search tests (#20354)
gante 66c8000
revert `keys_to_ignore` for M2M100 (#20381)
younesbelkada 822ed4a
add `accelerate` support for `ESM` (#20379)
younesbelkada 7dcaa83
Bump pillow in /examples/research_projects/decision_transformer (#20378)
dependabot[bot] b36de61
Skip failing test
sgugger 053998a
Fix nightly runs (#20352)
sgugger c7926e0
Optimizes DonutProcessor token2json method for speed (#20283)
michaelnation26 5c595e8
Indicate better minimal version of PyTorch in big model inference (#2…
sgugger c687e75
Improve backbone (#20380)
NielsRogge 4242739
Fix longformer onnx broken export (#20292)
fxmarty dc27bf0
Use tiny models for ONNX tests - text modality (#20333)
lewtun 08cae53
[ESM] fix `accelerate` tests for esmfold (#20387)
younesbelkada 2ef4552
Generate: fix plbart generation tests (#20391)
gante 35073be
[bloom] convert script tweaks (#18593)
stas00 3cdf2e4
Fix doctest file path (#20400)
ydshieh 6300e21
[Image Transformers] to_pil fix float edge cases (#20406)
patrickvonplaten 5f1e0c2
make daily CI happy (#20410)
younesbelkada ddb08ff
fix nasty `bnb` bug (#20408)
younesbelkada ae63540
change the way sentinel tokens can retrived (#20373)
raghavanone fc8447b
[BNB] Throw `ValueError` when trying to cast or assign (#20409)
younesbelkada 79fb3ec
Use updated `model_max_length` when saving tokenizers (#20401)
ydshieh 4f27fe5
Add Spanish translation of pr_checks.mdx (#20339)
donelianc 8383c67
fix device in longformer onnx path (#20419)
fxmarty 4036c7b
Fix ModelOutput instantiation when there is only one tuple (#20416)
sgugger 56bfaf6
`accelerate` support for `OwlViT` (#20411)
younesbelkada 587edf3
[AnyPrecisionAdamW] test fix (#20454)
stas00 4aaaeee
fix `word_to_tokens` docstring format (#20450)
SaulLu e1ed7ba
Fix typo in FSMT Tokenizer (#20456)
kamalkraj 54c03a4
Fix device issues in `CLIPSegModelIntegrationTest` (#20467)
ydshieh f369695
Fix links for `contrastive_loss` (#20455)
ydshieh c238536
Fix doctests for audio models (#20468)
ydshieh 87b6d3a
Fix ESM checkpoints for tests (#20436)
Rocketknight1 364d09c
More TF int dtype fixes (#20384)
Rocketknight1 06fe148
make tensors in function build_relative_position created on proper de…
qq775294390 9243274
update cpu related doc (#20444)
sywangyi 62160c5
with pytorch cpu only version. without --no_cuda, using --bf16 will t…
sywangyi 3328867
[CLIPTokenizer] Improve warning (#20458)
patrickvonplaten f322573
Replace assertions with value errors on distilbert model (#20463)
JuheonChu 43fa78b
[Doctest] Add configuration_fsmt.py (#19936)
sha016 f98492b
Replace assertion with ValueError exceptions in run_image_captioning_…
katiele47 6aefbc2
[FLAX] Add dtype to embedding for bert/bart/opt/t5 (#20340)
merrymercy a7ad384
Safetensors offload (#20321)
sgugger b880842
fix both failing RoCBert tests (#20469)
ArthurZucker ce4502a
[AutoBackbone] Improve API (#20407)
NielsRogge c5f3c32
Include image processor in add-new-model-like (#20439)
amyeroberts 53a19cd
chore: add link to the video cls notebook. (#20386)
sayakpaul 88c5dda
add timeout option for deepspeed engine (#20443)
henghuiz 426411a
[Maskformer] Add MaskFormerSwin backbone (#20344)
NielsRogge 6b67f5b
Extract warnings from CI artifacts (#20474)
ydshieh 6e6c78e
Add Donut image processor (#20425)
amyeroberts acd32ce
Fix torch meshgrid warnings (#20475)
fxmarty a770c67
Fix init import_structure sorting (#20477)
sgugger f513330
extract warnings in GH workflows (#20487)
ydshieh afc47ad
add in layer gpt2 tokenizer (#20421)
piEsposito d4fcef7
Replace assert statements with raise exceptions (#20478)
miyu386 c445dbf
fixed small typo (#20490)
sandeepgadhwal 64fc338
Fix documentation code to import facebook/detr-resnet-50 model (#20491)
JuanFKurucz 6aa8f88
Fix disk offload for full safetensors checkpoints (#20497)
sgugger 9961d33
[modelcard] Check for IterableDataset (#20495)
sanchit-gandhi fc5b925
[modelcard] Set model name if empty (#20496)
sanchit-gandhi 929ea58
Add segmentation + object detection image processors (#20160)
amyeroberts 1cd75d6
remove `attention_mask` truncation in whisper (#20488)
ydshieh 7de8718
Make `add_special_tokens` more clear (#20424)
ydshieh bffd68c
[OPT/Galactica] Load large `galactica` models (#20390)
younesbelkada 3fd9af8
Support extraction of both train and eval XLA graphs (#20492)
jeffhataws 42c7df0
fix ipex+fp32 jit trace error in ipex 1.13 (#20504)
sywangyi 579e73d
Expected output for the test changed (#20493)
ArthurZucker 96bc441
Fix TF nightly tests (#20507)
Rocketknight1 8de532f
Update doc examples feature extractor -> image processor (#20501)
amyeroberts 8586ebe
Fix Typo in Docs for GPU (#20509)
julianpollmann 0140201
Repurpose torchdynamo training args towards torch._dynamo (#20498)
sgugger 436bd20
Fix minimum version for device_map (#20489)
sgugger cbddaa9
Add Chinese-CLIP implementation (#20368)
yangapku b0e44f3
Fix style
sgugger 1eb5f80
Update `AutomaticSpeechRecognitionPipeline` doc example (#20512)
ydshieh 473cab0
Add `natten` for CI (#20511)
ydshieh d485617
Fix Data2VecTextForCasualLM example code documentation (#20510)
JuanFKurucz 984369c
Add some warning for Dynamo and enable TF32 when it's set (#20515)
sgugger e335df6
[modelcard] Update dataset tags (#20506)
sanchit-gandhi 15c15a1
Change Doctests CI launch time (#20523)
ydshieh 6aed6e5
Fix `PLBart` doctest (#20527)
ydshieh 4dd3237
Fix `ConditionalDetrForSegmentation` doc example (#20531)
ydshieh 4fa8ce9
add doc for (#20525)
younesbelkada 7155d42
Update `ZeroShotObjectDetectionPipeline` doc example (#20528)
ydshieh 6cbbcce
update post_process_image_guided_detection (#20521)
fcakyon bc60b35
QnA example: add speed metric (#20522)
sywangyi acb8029
Fix doctest (#20534)
NielsRogge 4f856bb
Fix Hubert models in TFHubertModel and TFHubertForCTC documentation c…
JuanFKurucz 1255f7b
Fix link in pipeline device map (#20517)
stevhliu c4b4c75
v4.26.0.dev0
sgugger 2cfac98
fix cuda OOM by using single Prior (#20486)
ArthurZucker 62ff7e5
[New Model] Add TimeSformer model (#18908)
fcakyon 0101ae1
Add ESM contact prediction (#20535)
Rocketknight1 3378539
flan-t5.mdx: fix link to large model (#20555)
szhublox da72950
Fix torch device issues (#20584)
ydshieh 5b1f623
Fix flax GPT-J-6B linking model in tests (#20556)
JuanFKurucz f6ab014
[Vision] fix small nit on `BeitDropPath` layers (#20587)
younesbelkada 4edc583
Fix repo consistency
sgugger 9b050fd
Install `natten` with CUDA version (#20546)
ydshieh ed85ef5
Add entries to `FEATURE_EXTRACTOR_MAPPING_NAMES` (#20551)
ydshieh ea4fc67
Cleanup some config attributes (#20554)
ydshieh 911d21f
[Whisper] Move decoder id method to tokenizer (#20589)
sanchit-gandhi e106bcb
Add `require_torch` to 2 pipeline tests (#20585)
ydshieh bc7e172
Install `tensorflow_probability` for TF pipeline CI (#20586)
ydshieh b549696
Add BioGPT (#20420)
kamalkraj 36bf4a6
Ci-whisper-asr (#20588)
ArthurZucker 63d622d
cross platform from_pretrained (#20538)
ArthurZucker ea5ce1d
Make convert_to_onnx runable as script again (#20009)
mcernusca 1010508
ESM openfold_utils type hints (#20544)
e3d99ba
Add RemBERT ONNX config (#20520)
hchings 6c4c1bb
Fix link to Swin Model contributor novice03 (#20557)
JuanFKurucz 737c697
Fix link to swin transformers v2 microsoft model (#20558)
JuanFKurucz 37b7f12
Fix link to table transformer detection microsoft model (#20560)
JuanFKurucz d4011a4
clean up unused `classifier_dropout` in config (#20596)
ydshieh 4db864d
Fix whisper and speech to text doc (#20595)
ArthurZucker 8215fb4
Replace `set-output` by `$GITHUB_OUTPUT` (#20547)
ydshieh 53d93ae
[Vision] `.to` function for ImageProcessors (#20536)
younesbelkada 11c904d
[Whisper] Fix decoder ids methods (#20599)
sanchit-gandhi e9f100c
Add-whisper-conversion (#20600)
ArthurZucker a37d710
README in Hindi 🇮🇳 (#20097)
pacman100 2dc7784
Fix code sample in preprocess (#20561)
stevhliu 87a9039
Split autoclasses on modality (#20559)
stevhliu 2a1c77a
Fix test for file not found (#20604)
sgugger 42180af
Rework the pipeline tutorial (#20437)
Narsil ff40a03
Documentation fixes (#20607)
samuelzxu 2643822
Adding anchor links to Hindi README (#20606)
pacman100 340c3b9
exclude jit time from the speed metric calculation of evaluation and …
sywangyi 248ea71
Check if docstring is None before formating it (#20592)
xxyzz fa694b4
updating T5 and BART models to support Prefix Tuning (#20601)
pacman100 6750938
Repo consistency
sgugger d90d85b
Fix `AutomaticSpeechRecognitionPipelineTests.run_pipeline_test` (#20597)
ydshieh 655a194
Ci-jukebox (#20613)
ArthurZucker c6ec20e
Update some GH action versions (#20537)
ydshieh 7972297
Fix dtype of weights in from_pretrained when device_map is set (#20602)
sgugger e925cd5
add missing is_decoder param (#20631)
stevhliu fadca52
Fix link to speech encoder decoder model in speech recognition readme…
JuanFKurucz ac0ee24
Fix `natten` installation in docker file (#20632)
ydshieh 9b472d3
Clip floating point constants to bf16 range to avoid inf conversion (…
sangeethabal 910fc7f
Pin TensorFlow to the next release (#20635)
sgugger 138cca0
[MaskFormer] Add support for ResNet backbone (#20483)
NielsRogge 83c31ec
Add BiT + ViT hybrid (#20550)
NielsRogge b31287d
[Trainer] add error when passing `8bit`models (#20651)
younesbelkada 6d749c4
[`ViTHybrid`] + [`BiT`] cleaner `__init__` (#20649)
younesbelkada 9ed699e
Update summarization `run_pipeline_test` (#20623)
ydshieh d9e2542
pin TF 2.11 in docker files (#20642)
ydshieh 1770d9a
Speed up git-lfs detection on error (#20641)
xloem 4e86af8
Updated Trainer args typing (#20655)
julianmack 7e6d7a6
Add `dpt-hybrid` support (#20645)
younesbelkada 8ffc9c1
[Whisper] Fix forced decoder ids (#20652)
sanchit-gandhi c61e830
Add TFBartForSequenceClassification (#20570)
uglyboxer d4aa766
run_speech_recognition_seq2seq.py: add cache_dir param to dataset (#2…
eschmidbauer 67e1953
[`BiT`] Small patch fix (#20657)
younesbelkada 44a868b
Fix gpt2 fp16 training when tracing is enabled (#20656)
JingyaHuang 486a165
Fix load from PT-formatted checkpoint in composite TF models (#20661)
sgugger 254c612
Update the list of contributors to reflect current organization (#20603)
sgugger b66c3d7
Fix expected values for TF-ESM tests (#20680)
Rocketknight1 f3bd7fe
Add `BackboneMixin` (#20660)
ydshieh c60d369
Bump certifi in /examples/research_projects/lxmert (#20672)
dependabot[bot] 672c74b
Bump certifi in /examples/research_projects/decision_transformer (#20…
dependabot[bot] af927c7
Bump certifi in /examples/research_projects/visual_bert (#20673)
dependabot[bot] b27e801
Migrate torchdynamo to torch.compile (#20634)
sgugger 5ff084f
Whilelist Transformers private method in DummyObject (#20681)
sgugger 074306d
[`ViTHybrid`] Fix `accelerate` slow tests (#20679)
younesbelkada 87b2076
Enable bf16 option for XLA devices (#20684)
jeffhataws cad0c65
Fix CIs for PyTorch 1.13 (#20686)
ydshieh 7cc4588
Fix donut image processor (#20625)
amyeroberts e878528
Added missing `test_tokenization_led` (#20568)
IMvision12 bbc4928
Add deprecation warning when image FE instantiated (#20427)
amyeroberts df63458
Add video classification pipeline (#20151)
nateraw feed8ec
[Backbones] Improve out features (#20675)
NielsRogge bd395f7
Change transformers.onnx to use optimum.exporters.onnx (#20529)
michaelbenayoun 60fae5e
skip `test_multi_gpu_data_parallel_forward` for `MaskFormerSwinModelT…
ydshieh af23b08
Vision processors - replace FE with IPs (#20590)
amyeroberts ef579a8
Replace FE references (#20702)
amyeroberts b6979f7
[`ViTHybrid`] fix last `accelerate` slow test (#20705)
younesbelkada 2151285
Fix rendering issue in quicktour (#20708)
sgugger 98df7e5
Made LUKE Tokenizer independent from RoBERTa (#20720)
salvo96 fad625d
Spanish translation of asr.mdx and add_new_pipeline.mdx (#20569)
alceballosa b98370a
Add `accelerate` support for LongT5 models (#20341)
pszemraj c08b449
Fix `AutoModelTest.test_model_from_pretrained` (#20730)
ydshieh ec19a0e
Adding ValueError when imcompatible parameters are used. (#20729)
Narsil 7b023fc
Add type hints for Whisper models (#20396)
donelianc 9b60888
Very small edit to change name to OpenAI GPT (#20722)
stanleycai95 9a8df40
fsdp fix (#20719)
pacman100 1a0e651
Spanish translation of the file debugging.mdx (#20566)
SimplyJuanjo 3f78943
Convert tokenizer outputs for Keras in doc example (#20732)
Rocketknight1 fe239d7
Clarify return_tensor and return_text parameters (#20662)
stevhliu 4d682a6
Add vision requirement to image transforms (#20712)
amyeroberts 0e3bb1e
Add gpt-sw3 model to transformers (#20209)
ekgren ed58694
Add a progress bar for large model loading (#20713)
sgugger 71c6e75
Disambiguate test for required_input in tokenization base file. (#20731)
sgugger fc1a8e8
Add decorator for flaky Donut tests (#20739)
amyeroberts 7896580
rename `layoutlm_job` to `exotic_models_job` (#20736)
ydshieh 214bb6f
Update CI to torch 1.13.0 (#20687)
ydshieh ff0be4d
Add `keep_in_fp32_modules` support (#20683)
younesbelkada f4d4d5f
Change a logic in pipeline test regarding TF (#20710)
ydshieh edecaeb
Fix AdamWeightDecay for TF 2.11 (#20735)
Rocketknight1 6b27a46
in the resize() function in image_transforms.py, the line 267: (#20728)
dhansmair bb528c5
[NAT, DiNAT] Add backbone class (#20654)
NielsRogge ccf278c
Add docs xlm roberta (#20742)
hazrulakmal 19c41e9
Fixing the pipeline tutorial test (#20746)
Narsil edaa863
Uninstall `torch_tensorrt` in `DeepSpeed` CI image for now (#20758)
ydshieh 3274267
Remove image_transforms functions from init (#20704)
amyeroberts 4c87485
Fix missing `()` in some usage of `is_flaky` (#20749)
ydshieh 513d90a
[Tests] Improve test_attention_outputs (#20701)
NielsRogge d3e8444
Fix attribute error problem (#20765)
fjklqq ce40ff2
[CI-Test] Fixes but also skips the mT5 tests (#20755)
ArthurZucker 07e7deb
Replaces xxx_required with requires_backends (#20715)
amyeroberts 9e57892
Install `torch-tensorrt 1.3.0` for DeepSpeed CI (#20764)
ydshieh 032c6c0
Add Swin backbone (#20769)
NielsRogge 17e997c
Even more validation. (#20762)
Narsil c583e1d
Install vision for TF pipeline tests (#20771)
ydshieh File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.