Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

merge w/official #6

Merged
merged 268 commits into from
Nov 27, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
268 commits
Select commit Hold shift + click to select a range
df6f36a
deprecate function `get_default_device` in `tools/base.py` (#26774)
ji-huazhong Oct 31, 2023
b5c8e23
Remove broken links to s-JoL/Open-Llama (#27164)
CSRessel Oct 31, 2023
9234cae
[docstring] Fix docstring for AltCLIPTextConfig, AltCLIPVisionConfig …
AksharGoyal Oct 31, 2023
14bb196
[doctring] Fix docstring for BlipTextConfig, BlipVisionConfig (#27173)
Hangsiin Oct 31, 2023
9dc4ce9
Disable CI runner check (#27170)
ydshieh Oct 31, 2023
b5db8ca
Add flash attention for `gpt_bigcode` (#26479)
susnato Oct 31, 2023
3cd3eaf
fix: Fix typical_p behaviour broken in recent change (#27165)
njhill Oct 31, 2023
2963e19
Add support for loading GPTQ models on CPU (#26719)
vivekkhandelwal1 Oct 31, 2023
a8e74eb
Trigger CI if `tiny_model_summary.json` is modified (#27175)
ydshieh Oct 31, 2023
08fadc8
Shorten the conversation tests for speed + fixing position overflows …
Rocketknight1 Oct 31, 2023
f53041a
device agnostic pipelines testing (#27129)
ji-huazhong Oct 31, 2023
309a906
[FEAT] Add Neftune into transformers Trainer (#27141)
younesbelkada Oct 31, 2023
05f2290
Backward compatibility fix for the Conversation class (#27176)
Rocketknight1 Oct 31, 2023
4bb50aa
[`Quantization` / `tests` ] Fix bnb MPT test (#27178)
younesbelkada Oct 31, 2023
e22b7ce
Fix dropout in `StarCoder` (#27182)
susnato Oct 31, 2023
6b7f8ff
translate traning.md to chinese (#27122)
jiaqiw09 Oct 31, 2023
77930f8
[docs] Update CPU/GPU inference docs (#26881)
stevhliu Oct 31, 2023
50378cb
device agnostic models testing (#27146)
ji-huazhong Oct 31, 2023
25e6e94
Unify warning styles for better readability (#27184)
oneonlee Oct 31, 2023
113ebf8
Safetensors serialization by default (#27064)
LysandreJik Oct 31, 2023
7d8ff36
🌐 [i18n-ZH] Translate tflite.md into Chinese (#27134)
yyLeaves Oct 31, 2023
82c7e87
device agnostic fsdp testing (#27120)
ji-huazhong Nov 1, 2023
ae093ee
[`core` / `Quantization` ] AWQ integration (#27045)
younesbelkada Nov 1, 2023
7102552
Fix docstring get maskformer resize output image size (#27196)
wesleylp Nov 1, 2023
636f704
Fix the typos and grammar mistakes in CONTRIBUTING.md. (#27193)
THEFZNKHAN Nov 1, 2023
f3c1a17
Fixing docstring in get_resize_output_image_size function (#27191)
wesleylp Nov 1, 2023
037fb7d
added unsqueeze_dim to apply_rotary_pos_emb (#27117)
ShashankMosaicML Nov 1, 2023
f9b4bea
Added cache_block_outputs option to enable GPTQ for non-regular model…
AlexKoff88 Nov 1, 2023
391d14e
[WhisperForCausalLM] Add WhisperForCausalLM for speculative decoding …
patrickvonplaten Nov 1, 2023
f8afb2b
Add TensorFlow implementation of ConvNeXTv2 (#25558)
neggles Nov 1, 2023
21a2fba
Fix docstring in get_oneformer_resize_output_image_size func (#27207)
wesleylp Nov 1, 2023
1e32b05
improving TimmBackbone to support FrozenBatchNorm2d (#27160)
rafaelpadilla Nov 1, 2023
239cd0e
Translate task summary to chinese (#27180)
jiaqiw09 Nov 1, 2023
c9e72f5
Add exllamav2 better (#27111)
SunMarc Nov 1, 2023
95020f2
Fix CPU offload + disk offload tests (#27204)
LysandreJik Nov 1, 2023
3520e37
Enable split_batches through TrainingArguments (#26798)
muellerzr Nov 1, 2023
af3de8d
[Whisper, Bart, MBart] Add Flash Attention 2 (#27203)
patrickvonplaten Nov 1, 2023
7adaefe
support bf16 (#25879)
etemadiamd Nov 2, 2023
c52e429
Reproducible checkpoint for npu (#27208)
ji-huazhong Nov 2, 2023
9b25c16
[`core` / `Quantization`] Fix for 8bit serialization tests (#27234)
younesbelkada Nov 2, 2023
8a31295
Fuyu: improve image processing (#27007)
molbap Nov 2, 2023
4557a0d
Wrap `_prepare_4d_causal_attention_mask` as a leaf function (#27236)
michaelbenayoun Nov 2, 2023
443bf5e
Fix safetensors failing tests (#27231)
LysandreJik Nov 2, 2023
8801861
Fixing m4t. (#27240)
Narsil Nov 2, 2023
441c3e0
fix-deprecated-exllama-arg (#27243)
SunMarc Nov 2, 2023
a6c82d4
Generate: return `past_key_values` (#25086)
gante Nov 2, 2023
147e8ce
Remove redundant code from T5 encoder mask creation (#27216)
pietrolesci Nov 2, 2023
0ed6729
Enrich TTS pipeline parameters naming (#26473)
ylacombe Nov 2, 2023
bc78fd1
Dev version
LysandreJik Nov 2, 2023
00d8502
translate peft.md to chinese (#27215)
jiaqiw09 Nov 2, 2023
4991216
Removed the redundant SiLUActivation class. (#27136)
hisushanta Nov 2, 2023
552ff24
Fixed base model class name extraction from PeftModels (#27162)
kkteru Nov 2, 2023
e9a6c72
Fuyu protection (#27248)
LysandreJik Nov 3, 2023
05ea7b7
Refactor: Use Llama RoPE implementation for Falcon (#26933)
tomaarsen Nov 3, 2023
8f1a43c
[`PEFT` / `Tests` ] Fix peft integration failing tests (#27258)
younesbelkada Nov 3, 2023
af8d1dc
Avoid many failing tests in doctesting (#27262)
ydshieh Nov 3, 2023
011b15c
[docs] Custom model doc update (#27213)
MKhalusova Nov 3, 2023
db69bd8
Update the ConversationalPipeline docstring for chat templates (#27250)
Rocketknight1 Nov 3, 2023
f13f544
Fix switch transformer mixed precision issue (#27220)
timlee0212 Nov 3, 2023
ad8ff96
[`Docs` / `SAM` ] Reflect correct changes to run inference without OO…
younesbelkada Nov 3, 2023
5964f82
[Docs] Model_doc structure/clarity improvements (#26876)
MKhalusova Nov 3, 2023
1ac2463
[`FA2`] Add flash attention for for `DistilBert` (#26489)
susnato Nov 3, 2023
bf7cfac
translate autoclass_tutorial to chinese (#27269)
jiaqiw09 Nov 3, 2023
cc3e478
translate run_scripts.md to chinese (#27246)
jiaqiw09 Nov 3, 2023
b026b5c
Fix tokenizer export for LLamaTokenizerFast (#27222)
mayank31398 Nov 6, 2023
d788d37
Fix daily CI image build (#27307)
ydshieh Nov 6, 2023
eef7ea9
Update doctest workflow file (#27306)
ydshieh Nov 6, 2023
d7dcfa8
Remove an unexpected argument for FlaxResNetBasicLayerCollection (#27…
pingzhili Nov 6, 2023
1ffc4de
enable memory tracker metrics for npu (#27280)
ji-huazhong Nov 6, 2023
147f774
[`PretrainedTokenizer`] add some of the most important functions to t…
ArthurZucker Nov 6, 2023
e9dbd39
Update sequence_classification.md (#27281)
akshayvkt Nov 6, 2023
a6e0d5a
Fix VideoMAEforPretrained dtype error (#27296)
ikergarcia1996 Nov 6, 2023
1b20e2b
Fix `Kosmos2Processor` batch mode (#27323)
ydshieh Nov 6, 2023
9beb273
[docs] fixed links with 404 (#27327)
MKhalusova Nov 6, 2023
da7ea9a
[Whisper] Block language/task args for English-only (#27322)
sanchit-gandhi Nov 7, 2023
26d8d5f
Fix autoawq docker image (#27339)
younesbelkada Nov 7, 2023
90b4adc
Generate: skip tests on unsupported models instead of passing (#27265)
gante Nov 7, 2023
606d908
Fix Whisper Conversion Script: Correct decoder_attention_heads and _d…
zuazo Nov 7, 2023
0ded281
[`FA2`] Add flash attention for `GPT-Neo` (#26486)
susnato Nov 7, 2023
88832c0
[`Whisper`] Add conversion script for the tokenizer (#27338)
ArthurZucker Nov 7, 2023
9459d82
Remove a redundant variable. (#27288)
hisushanta Nov 7, 2023
8c91f15
Resolve AttributeError by utilizing device calculation at the start o…
folbaeni Nov 7, 2023
cc9f27b
Remove padding_masks from `gpt_bigcode`. (#27348)
susnato Nov 7, 2023
8f840ed
[`Whisper`] Nit converting the tokenizer (#27349)
ArthurZucker Nov 7, 2023
ac5d4cf
FIx Bark batching feature (#27271)
ylacombe Nov 7, 2023
7e1eff7
Allow scheduler parameters (#26480)
Plemeur Nov 7, 2023
f213d5d
translate the en tokenizer_summary.md to Chinese (#27291)
ZouJiu1 Nov 7, 2023
e264745
translate model_sharing.md and llm_tutorial.md to chinese (#27283)
jiaqiw09 Nov 7, 2023
be74b2e
Add numpy alternative to FE using torchaudio (#26339)
ylacombe Nov 8, 2023
b6dbfee
moving example of benchmarking to legacy dir (#27337)
ji-huazhong Nov 8, 2023
efa57cb
Fix example tests from failing (#27353)
muellerzr Nov 8, 2023
5ef650b
Fix `Kosmos-2` device issue (#27346)
ydshieh Nov 8, 2023
f16ff0f
MusicGen Update (#27084)
sanchit-gandhi Nov 8, 2023
eb30a49
Translate index.md to Turkish (#27093)
mertyyanik Nov 8, 2023
845aa83
Remove unused param from example script tests (#27354)
muellerzr Nov 8, 2023
7b175cf
[Flax Whisper] large-v3 compatibility (#27360)
sanchit-gandhi Nov 8, 2023
bd8f45b
Fix tiny model script: not using `from_pt=True` (#27372)
ydshieh Nov 8, 2023
ef71673
translate big_models.md and performance.md to chinese (#27334)
jiaqiw09 Nov 8, 2023
a5bee89
Add Flash Attention 2 support to Bark (#27364)
ylacombe Nov 8, 2023
0e402e1
Update deprecated `torch.range` in `test_modeling_ibert.py` (#27355)
kit1980 Nov 8, 2023
ced9fd8
translate debugging.md to chinese (#27374)
jiaqiw09 Nov 8, 2023
7ecd229
Smangrul/fix failing ds ci tests (#27358)
pacman100 Nov 9, 2023
085ea7e
[`CodeLlamaTokenizer`] Nit, update __init__ to make sure the AddedTok…
ArthurZucker Nov 9, 2023
e9adb0c
Change thresh in test (#27378)
muellerzr Nov 9, 2023
bc086a2
Put doctest options back to `pyproject.toml` (#27366)
ydshieh Nov 9, 2023
9999b73
Skip failing cache call tests (#27393)
amyeroberts Nov 9, 2023
c5d7754
device-agnostic deepspeed testing (#27342)
ji-huazhong Nov 9, 2023
791ec37
Adds dvclive callback (#27352)
dberenbaum Nov 9, 2023
3258ff9
use `pytest.mark` directly (#27390)
ydshieh Nov 9, 2023
cf2a3f3
Fix fuyu checkpoint repo in `FuyuConfig` (#27399)
ydshieh Nov 9, 2023
c5037b4
Use editable install for git deps (#27404)
muellerzr Nov 9, 2023
c8b6052
Final fix of the accelerate installation issue (#27408)
ydshieh Nov 9, 2023
e38348a
Fix RequestCounter to make it more future-proof (#27406)
Wauplin Nov 9, 2023
51a98c4
remove failing tests and clean FE files (#27414)
ylacombe Nov 9, 2023
740cd93
Fix `Owlv2` checkpoint name and a default value in `Owlv2VisionConfig…
ydshieh Nov 9, 2023
cf32c94
Run all tests if `circleci/create_circleci_config.py` is modified (#2…
ydshieh Nov 9, 2023
184f60d
add attention_mask and position_ids in assisted model (#26892)
jiqing-feng Nov 10, 2023
fd685cf
[`Quantization`] Add str to enum conversion for AWQ (#27320)
younesbelkada Nov 10, 2023
9dd58c5
update Bark FA2 docs (#27400)
ylacombe Nov 10, 2023
7e9f10a
Add CLVP (#24745)
susnato Nov 10, 2023
68afca3
[`AttentionMaskConverter`] ]Fix-mask-inf (#27114)
ArthurZucker Nov 10, 2023
00dc856
At most 2 GPUs for CI (#27435)
ydshieh Nov 10, 2023
e1c3ac2
Add Phi-1 and Phi-1_5 (#26170)
susnato Nov 10, 2023
ed115b3
Normalize floating point cast (#27249)
amyeroberts Nov 10, 2023
7ee995f
Make `examples_torch_job` faster (#27437)
ydshieh Nov 10, 2023
9d87cd2
Fix line ending in `utils/not_doctested.txt` (#27459)
ydshieh Nov 13, 2023
7b998ca
Fix some Wav2Vec2 related models' doctest (#27462)
ydshieh Nov 13, 2023
8f577dc
Fixed typo in error message (#27461)
cmcmaster1 Nov 13, 2023
b97cab7
Remove-auth-token (#27060)
ArthurZucker Nov 13, 2023
210e38d
[`Llama + Mistral`] Add attention dropout (#27315)
ArthurZucker Nov 13, 2023
9dc8fe1
Default to msgpack for safetensors (#27460)
LysandreJik Nov 13, 2023
68ae3be
Fix `from_pt` flag when loading with `safetensors` (#27394)
LysandreJik Nov 13, 2023
20abdac
OWLv2: bug fix in post_process_object_detection() when using cuda dev…
assafbot Nov 13, 2023
2dc29cf
Fix docstring for `gradient_checkpointing_kwargs` (#27470)
tomaszcichy98 Nov 13, 2023
3b59621
Install `python-Levenshtein` for `nougat` in CI image (#27465)
ydshieh Nov 13, 2023
2422c38
Add DINOv2 depth estimation (#26092)
NielsRogge Nov 13, 2023
1af766e
Add version check for Jinja (#27403)
Rocketknight1 Nov 13, 2023
04af4b9
Fix Falcon tokenizer loading in pipeline (#27316)
Rocketknight1 Nov 13, 2023
7b13902
[`AWQ` ] Addresses TODO for awq tests (#27467)
younesbelkada Nov 13, 2023
eb79b55
Perf torch compile (#27422)
jiaqiw09 Nov 13, 2023
8017a59
Fixed typo in pipelines.md documentation (#27455)
adismort14 Nov 13, 2023
2ac5b93
[time series] Add PatchTST (#25927)
psinthong Nov 13, 2023
721d1c8
Fix FA2 import + deprecation cycle (#27330)
SunMarc Nov 14, 2023
d71fa9f
[`Peft`] `modules_to_save` support for peft integration (#27466)
younesbelkada Nov 14, 2023
e107ae3
[`CI-test_torch`] skip `test_tf_from_pt_safetensors` for 4 models (#2…
ArthurZucker Nov 14, 2023
ee4fb32
Fix M4T weights tying (#27395)
ylacombe Nov 14, 2023
4309abe
Add speecht5 batch generation and fix wrong attention mask when paddi…
Spycsh Nov 14, 2023
b86c54d
Clap processor: remove wasteful np.stack operations (#27454)
m-bain Nov 14, 2023
a4616c6
[Whisper] Fix pipeline test (#27442)
sanchit-gandhi Nov 14, 2023
78f6ed6
Revert "[time series] Add PatchTST (#25927)" (#27486)
amyeroberts Nov 14, 2023
73bc0c9
translate hpo_train.md and perf_hardware.md to chinese (#27431)
jiaqiw09 Nov 14, 2023
fe472b1
Generate: fix `ExponentialDecayLengthPenalty` doctest (#27485)
gante Nov 14, 2023
5468ab3
Update and reorder docs for chat templates (#27443)
Rocketknight1 Nov 14, 2023
a53a0c5
Generate: `GenerationConfig.from_pretrained` can return unused kwargs…
gante Nov 14, 2023
250032e
Minor type annotation fix (#27276)
vwxyzjn Nov 14, 2023
067c4a3
Have seq2seq just use gather (#27025)
muellerzr Nov 14, 2023
303c1d6
Update processor mapping for hub snippets (#27477)
amyeroberts Nov 14, 2023
2fc33eb
Track the number of tokens seen to metrics (#27274)
muellerzr Nov 14, 2023
186c077
[`CI-test_torch`] skip test_tf_from_pt_safetensors and `test_assisted…
ArthurZucker Nov 15, 2023
cc0dc24
[Fuyu] Add tests (#27001)
NielsRogge Nov 15, 2023
72f531a
[Table Transformer] Add Transformers-native checkpoints (#26928)
NielsRogge Nov 15, 2023
7ddb21b
Update spelling mistake (#27506)
LimJing7 Nov 15, 2023
1e0e2dd
[`CircleCI`] skip test_assisted_decoding_sample for everyone (#27511)
ArthurZucker Nov 15, 2023
64e21ca
Make some jobs run on the GitHub Actions runners (#27512)
ydshieh Nov 15, 2023
3d1a7bf
[`tokenizers`] update `tokenizers` version pin (#27494)
ArthurZucker Nov 15, 2023
453079c
🚨🚨 Fix beam score calculation issue for decoder-only models (#27351)
VsonicV Nov 15, 2023
48ba1e0
[ `PretrainedConfig`] Improve messaging (#27438)
ArthurZucker Nov 15, 2023
a85ea4b
Fix wav2vec2 params (#27515)
muellerzr Nov 15, 2023
a0633c4
Translating `en/model_doc` docs to Japanese. (#27401)
Yuki-Imajuku Nov 15, 2023
e6522e4
Fixing the failure of models without max_position_embeddings attribut…
AdamLouly Nov 15, 2023
2e72bba
Incorrect setting for num_beams in translation and summarization exam…
Rocketknight1 Nov 15, 2023
b71c38a
Fix bug for T5x to PyTorch convert script with varying encoder and de…
JamesJiang97 Nov 15, 2023
1ac599d
Fix offload disk for loading derivated model checkpoint into base mod…
SunMarc Nov 15, 2023
06343b0
translate model.md to chinese (#27518)
ji-huazhong Nov 16, 2023
1394e08
Support ONNX export for causal LM sequence classifiers (#27450)
dwyatte Nov 16, 2023
85fde09
[`pytest`] Avoid flash attn test marker warning (#27509)
ArthurZucker Nov 16, 2023
69c9b89
docs: add docs for map, and add num procs to load_dataset (#27520)
pphuc25 Nov 16, 2023
4989e73
Update the TF pin for 2.15 (#27375)
Rocketknight1 Nov 16, 2023
5603fad
Revert "add attention_mask and position_ids in assisted model" (#27523)
patrickvonplaten Nov 16, 2023
fd65aa9
Set `usedforsecurity=False` in hashlib methods (FIPS compliance) (#27…
Wauplin Nov 16, 2023
6b39470
Raise error when quantizing a quantized model (#27500)
SunMarc Nov 16, 2023
acb5b4a
Disable docker image build job `latest-pytorch-amd` for now (#27541)
ydshieh Nov 16, 2023
651408a
[`Styling`] stylify using ruff (#27144)
ArthurZucker Nov 16, 2023
12b50c6
Generate: improve assisted generation tests (#27540)
gante Nov 16, 2023
93f31e0
Updated albert.md doc for ALBERT model (#27223)
ENate Nov 16, 2023
b074461
translate Trainer.md to chinese (#27527)
jiaqiw09 Nov 16, 2023
fe3ce06
Skip some fuyu tests (#27553)
ydshieh Nov 17, 2023
d903abf
Fix AMD CI not showing GPU (#27555)
ydshieh Nov 17, 2023
913d03d
Generate: fix flaky tests (#27543)
gante Nov 17, 2023
5330b83
Generate: update compute transition scores doctest (#27558)
gante Nov 17, 2023
638d499
fixed broken link (#27560)
VpkPrasanna Nov 17, 2023
ffbcfc0
Broken links fixed related to datasets docs (#27569)
VpkPrasanna Nov 17, 2023
d1a00f9
translate deepspeed.md to chinese (#27495)
jiaqiw09 Nov 17, 2023
25b0f20
Fix broken distilbert url (#27579)
osanseviero Nov 18, 2023
dc68a39
Adding leaky relu in dict ACT2CLS (#27574)
rafaelpadilla Nov 19, 2023
dbf7bfa
Fix idx2sym not loaded from pretrained vocab file in Transformer XL (…
jtang98 Nov 20, 2023
ee29261
Add `convert_hf_to_openai.py` script to Whisper documentation resourc…
zuazo Nov 20, 2023
e4280d6
docs: fix 404 link (#27529)
panpan0000 Nov 20, 2023
f31af39
[ examples] fix loading jsonl with load dataset in run translation ex…
mathiasesn Nov 20, 2023
e66984f
[`FA-2`] Add fa2 support for `from_config` (#26914)
younesbelkada Nov 20, 2023
93f2de8
timm to pytorch conversion for vit model fix (#26908)
staghado Nov 20, 2023
87e217d
[Whisper] Add `large-v3` version support (#27336)
flyingleafe Nov 20, 2023
f18c95b
Update Korean tutorial for using LLMs, and refactor the nested condit…
YeonwooSung Nov 20, 2023
38e2633
Fix torch.fx import issue for torch 1.12 (#27570)
amyeroberts Nov 20, 2023
8eb9e29
dvclive callback: warn instead of fail when logging non-scalars (#27608)
Nov 21, 2023
0e6794f
[`core` / `gradient_checkpointing`] add support for old GC method (#2…
younesbelkada Nov 21, 2023
ade7af9
[ConvNext] Improve backbone (#27621)
NielsRogge Nov 21, 2023
81b7981
Generate: Update docs regarding reusing `past_key_values` in `generat…
gante Nov 21, 2023
851a4f7
Idefics: Fix information leak with cross attention gate in modeling (…
leot13 Nov 21, 2023
f93c1e9
Add RoCm scheduled CI & upgrade RoCm CI to PyTorch 2.1 (#26940)
fxmarty Nov 21, 2023
82cc0a7
Fix flash attention bugs with Mistral and Falcon (#27625)
fxmarty Nov 21, 2023
0145c68
Fix tracing dinov2 (#27561)
amyeroberts Nov 21, 2023
f5c9738
remove the deprecated method `init_git_repo` (#27617)
ji-huazhong Nov 21, 2023
c770600
TVP model (#25856)
jiqing-feng Nov 21, 2023
7f04373
Explicitely specify `use_cache=True` in Flash Attention tests (#27635)
fxmarty Nov 21, 2023
d2a980e
Harmonize HF environment variables + other cleaning (#27564)
Wauplin Nov 21, 2023
c5be38c
Fix `resize_token_embeddings` (#26861) (#26865)
czy-orange Nov 21, 2023
b54993a
[`dependency`] update pillow pins (#27409)
ArthurZucker Nov 22, 2023
c651eb2
Simplify the implementation of jitter noise in moe models (#27643)
jiangwangyi Nov 22, 2023
b2c63c7
Fix `max_steps` documentation regarding the end-of-training condition…
qgallouedec Nov 22, 2023
4151fbb
[Whisper] Add sequential longform decoding (#27492)
patrickvonplaten Nov 22, 2023
7f6a804
Add UnivNet Vocoder Model for Tortoise TTS Diffusers Integration (#24…
dg845 Nov 22, 2023
8aca43b
update Openai API call method (#27628)
Strive-for-excellence Nov 22, 2023
1ddc4fa
update d_kv'annotation in mt5'configuration (#27585)
callanwu Nov 23, 2023
3bc50d8
[`FA2`] Add flash attention for opt (#26414)
susnato Nov 23, 2023
baabd38
Extended semantic segmentation to image segmentation (#27039)
merveenoyan Nov 23, 2023
b406c4d
Update TVP arxiv link (#27672)
amyeroberts Nov 23, 2023
fe1c16e
[DPT, Dinov2] Add resources (#27655)
NielsRogge Nov 23, 2023
b8db265
Update tiny model summary file (#27388)
ydshieh Nov 23, 2023
a761d6e
Refactoring Trainer, adds `save_only_model` arg and simplifying FSDP …
pacman100 Nov 24, 2023
623432d
Skip pipeline tests for 2 models for now (#27687)
ydshieh Nov 24, 2023
7293fdc
Deprecate `TransfoXL` (#27607)
ydshieh Nov 24, 2023
80e9f76
Fix typo in warning message (#27055)
liuxueyang Nov 24, 2023
181f85d
Docs/Add conversion code to the musicgen docs (#27665)
yoinked-h Nov 24, 2023
2098d34
Fix semantic error in evaluation section (#27675)
anihm136 Nov 24, 2023
a6d178e
[`DocString`] Support a revision in the docstring `add_code_sample_do…
ArthurZucker Nov 24, 2023
c13a43a
Reflect RoCm support in the documentation (#27636)
fxmarty Nov 24, 2023
29c9480
Successfully Resolved The ZeroDivisionError Exception. (#27524)
hisushanta Nov 24, 2023
35551f9
Fix `TVPModelTest` (#27695)
ydshieh Nov 24, 2023
f70db28
Fix sliding_window hasattr in Mistral (#27041)
IlyaGusev Nov 26, 2023
e11e26d
Fix Past CI (#27696)
ydshieh Nov 27, 2023
5c30dd4
fix warning (#27689)
ArthurZucker Nov 27, 2023
27b752b
Reorder the code on the Hub to explicit that sharing on the Hub isn't…
LysandreJik Nov 27, 2023
b09912c
Fix mistral generate for long prompt / response (#27548)
lorabit110 Nov 27, 2023
307cf3a
Fix oneformer instance segmentation RuntimeError (#27725)
yhshin11 Nov 27, 2023
1d7f406
fix assisted decoding assistant model inputs (#27503)
jiqing-feng Nov 27, 2023
59499bb
Update forward signature test for vision models (#27681)
NielsRogge Nov 27, 2023
334a6d1
Modify group_sub_entities in TokenClassification Pipeline to support …
eshoyuan Nov 27, 2023
c832bcb
Fix owlv2 code snippet (#27698)
NielsRogge Nov 27, 2023
ce31508
docs: replace torch.distributed.run by torchrun (#27528)
panpan0000 Nov 27, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
5 changes: 2 additions & 3 deletions .circleci/config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -157,11 +157,10 @@ jobs:
command: pip freeze | tee installed.txt
- store_artifacts:
path: ~/transformers/installed.txt
- run: black --check examples tests src utils
- run: ruff examples tests src utils
- run: ruff check examples tests src utils
- run: ruff format tests src utils --check
- run: python utils/custom_init_isort.py --check_only
- run: python utils/sort_auto_mappings.py --check_only
- run: doc-builder style src/transformers docs/source --max_len 119 --check_only --path_to_docs docs/source
- run: python utils/check_doc_toc.py

check_repository_consistency:
Expand Down
16 changes: 9 additions & 7 deletions .circleci/create_circleci_config.py
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,6 @@

import argparse
import copy
import glob
import os
import random
from dataclasses import dataclass
Expand Down Expand Up @@ -239,7 +238,7 @@ def to_dict(self):

py_command = f'import os; fp = open("reports/{self.job_name}/summary_short.txt"); failed = os.linesep.join([x for x in fp.read().split(os.linesep) if x.startswith("ERROR ")]); fp.close(); fp = open("summary_short.txt", "w"); fp.write(failed); fp.close()'
check_test_command += f"$(python3 -c '{py_command}'); "
check_test_command += f'cat summary_short.txt; echo ""; exit -1; '
check_test_command += 'cat summary_short.txt; echo ""; exit -1; '

# Deeal with failed tests
check_test_command += f'elif [ -s reports/{self.job_name}/failures_short.txt ]; '
Expand All @@ -249,7 +248,7 @@ def to_dict(self):

py_command = f'import os; fp = open("reports/{self.job_name}/summary_short.txt"); failed = os.linesep.join([x for x in fp.read().split(os.linesep) if x.startswith("FAILED ")]); fp.close(); fp = open("summary_short.txt", "w"); fp.write(failed); fp.close()'
check_test_command += f"$(python3 -c '{py_command}'); "
check_test_command += f'cat summary_short.txt; echo ""; exit -1; '
check_test_command += 'cat summary_short.txt; echo ""; exit -1; '

check_test_command += f'elif [ -s reports/{self.job_name}/stats.txt ]; then echo "All tests pass!"; '

Expand Down Expand Up @@ -283,7 +282,7 @@ def job_name(self):
"pip install --upgrade --upgrade-strategy eager pip",
"pip install -U --upgrade-strategy eager .[sklearn,tf-cpu,torch,testing,sentencepiece,torch-speech,vision]",
"pip install -U --upgrade-strategy eager tensorflow_probability",
"pip install -U --upgrade-strategy eager git+https://github.com/huggingface/accelerate",
"pip install -U --upgrade-strategy eager -e git+https://github.com/huggingface/accelerate@main#egg=accelerate",
],
marker="is_pt_tf_cross_test",
pytest_options={"rA": None, "durations": 0},
Expand All @@ -297,7 +296,7 @@ def job_name(self):
"sudo apt-get -y update && sudo apt-get install -y libsndfile1-dev espeak-ng",
"pip install -U --upgrade-strategy eager --upgrade pip",
"pip install -U --upgrade-strategy eager .[sklearn,flax,torch,testing,sentencepiece,torch-speech,vision]",
"pip install -U --upgrade-strategy eager git+https://github.com/huggingface/accelerate",
"pip install -U --upgrade-strategy eager -e git+https://github.com/huggingface/accelerate@main#egg=accelerate",
],
marker="is_pt_flax_cross_test",
pytest_options={"rA": None, "durations": 0},
Expand All @@ -310,7 +309,7 @@ def job_name(self):
"sudo apt-get -y update && sudo apt-get install -y libsndfile1-dev espeak-ng time",
"pip install --upgrade --upgrade-strategy eager pip",
"pip install -U --upgrade-strategy eager .[sklearn,torch,testing,sentencepiece,torch-speech,vision,timm]",
"pip install -U --upgrade-strategy eager git+https://github.com/huggingface/accelerate",
"pip install -U --upgrade-strategy eager -e git+https://github.com/huggingface/accelerate@main#egg=accelerate",
],
parallelism=1,
pytest_num_workers=6,
Expand Down Expand Up @@ -397,13 +396,16 @@ def job_name(self):

examples_torch_job = CircleCIJob(
"examples_torch",
additional_env={"OMP_NUM_THREADS": 8},
cache_name="torch_examples",
install_steps=[
"sudo apt-get -y update && sudo apt-get install -y libsndfile1-dev espeak-ng",
"pip install --upgrade --upgrade-strategy eager pip",
"pip install -U --upgrade-strategy eager .[sklearn,torch,sentencepiece,testing,torch-speech]",
"pip install -U --upgrade-strategy eager -r examples/pytorch/_tests_requirements.txt",
"pip install -U --upgrade-strategy eager -e git+https://github.com/huggingface/accelerate@main#egg=accelerate",
],
pytest_num_workers=1,
)


Expand Down Expand Up @@ -510,7 +512,7 @@ def job_name(self):
"sudo apt-get -y update && sudo apt-get install -y libsndfile1-dev espeak-ng time ffmpeg",
"pip install --upgrade --upgrade-strategy eager pip",
"pip install -U --upgrade-strategy eager -e .[dev]",
"pip install -U --upgrade-strategy eager git+https://github.com/huggingface/accelerate",
"pip install -U --upgrade-strategy eager -e git+https://github.com/huggingface/accelerate@main#egg=accelerate",
"pip install --upgrade --upgrade-strategy eager pytest pytest-sugar",
"pip install -U --upgrade-strategy eager natten",
"find -name __pycache__ -delete",
Expand Down
4 changes: 4 additions & 0 deletions .github/conda/meta.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -26,6 +26,8 @@ requirements:
- protobuf
- tokenizers >=0.11.1,!=0.11.3,<0.13
- pyyaml >=5.1
- safetensors
- fsspec
run:
- python
- numpy >=1.17
Expand All @@ -40,6 +42,8 @@ requirements:
- protobuf
- tokenizers >=0.11.1,!=0.11.3,<0.13
- pyyaml >=5.1
- safetensors
- fsspec

test:
imports:
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/add-model-like.yml
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ on:
jobs:
run_tests_templates_like:
name: "Add new model like template tests"
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
steps:
- uses: actions/checkout@v3

Expand Down
81 changes: 41 additions & 40 deletions .github/workflows/build-docker-images.yml
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ concurrency:
jobs:
latest-docker:
name: "Latest PyTorch + TensorFlow [dev]"
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
steps:
- name: Cleanup disk
run: |
Expand Down Expand Up @@ -69,7 +69,7 @@ jobs:

latest-torch-deepspeed-docker:
name: "Latest PyTorch + DeepSpeed"
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
steps:
- name: Cleanup disk
run: |
Expand Down Expand Up @@ -106,7 +106,7 @@ jobs:
# Can't build 2 images in a single job `latest-torch-deepspeed-docker` (for `nvcr.io/nvidia`)
latest-torch-deepspeed-docker-for-push-ci-daily-build:
name: "Latest PyTorch + DeepSpeed (Push CI - Daily Build)"
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
steps:
- name: Cleanup disk
run: |
Expand Down Expand Up @@ -148,7 +148,7 @@ jobs:
name: "Doc builder"
# Push CI doesn't need this image
if: inputs.image_postfix != '-push-ci'
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
steps:
-
name: Set up Docker Buildx
Expand All @@ -174,7 +174,7 @@ jobs:
name: "Latest PyTorch [dev]"
# Push CI doesn't need this image
if: inputs.image_postfix != '-push-ci'
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
steps:
- name: Cleanup disk
run: |
Expand Down Expand Up @@ -208,46 +208,47 @@ jobs:
push: true
tags: huggingface/transformers-pytorch-gpu

latest-pytorch-amd:
name: "Latest PyTorch (AMD) [dev]"
runs-on: [self-hosted, docker-gpu, amd-gpu, single-gpu, mi210]
steps:
- name: Set up Docker Buildx
uses: docker/setup-buildx-action@v3
- name: Check out code
uses: actions/checkout@v3
- name: Login to DockerHub
uses: docker/login-action@v3
with:
username: ${{ secrets.DOCKERHUB_USERNAME }}
password: ${{ secrets.DOCKERHUB_PASSWORD }}
- name: Build and push
uses: docker/build-push-action@v5
with:
context: ./docker/transformers-pytorch-amd-gpu
build-args: |
REF=main
push: true
tags: huggingface/transformers-pytorch-amd-gpu${{ inputs.image_postfix }}
# Push CI images still need to be re-built daily
-
name: Build and push (for Push CI) in a daily basis
# This condition allows `schedule` events, or `push` events that trigger this workflow NOT via `workflow_call`.
# The later case is useful for manual image building for debugging purpose. Use another tag in this case!
if: inputs.image_postfix != '-push-ci'
uses: docker/build-push-action@v5
with:
context: ./docker/transformers-pytorch-amd-gpu
build-args: |
REF=main
push: true
tags: huggingface/transformers-pytorch-amd-gpu-push-ci
# Need to be fixed with the help from Guillaume.
# latest-pytorch-amd:
# name: "Latest PyTorch (AMD) [dev]"
# runs-on: [self-hosted, docker-gpu, amd-gpu, single-gpu, mi210]
# steps:
# - name: Set up Docker Buildx
# uses: docker/setup-buildx-action@v3
# - name: Check out code
# uses: actions/checkout@v3
# - name: Login to DockerHub
# uses: docker/login-action@v3
# with:
# username: ${{ secrets.DOCKERHUB_USERNAME }}
# password: ${{ secrets.DOCKERHUB_PASSWORD }}
# - name: Build and push
# uses: docker/build-push-action@v5
# with:
# context: ./docker/transformers-pytorch-amd-gpu
# build-args: |
# REF=main
# push: true
# tags: huggingface/transformers-pytorch-amd-gpu${{ inputs.image_postfix }}
# # Push CI images still need to be re-built daily
# -
# name: Build and push (for Push CI) in a daily basis
# # This condition allows `schedule` events, or `push` events that trigger this workflow NOT via `workflow_call`.
# # The later case is useful for manual image building for debugging purpose. Use another tag in this case!
# if: inputs.image_postfix != '-push-ci'
# uses: docker/build-push-action@v5
# with:
# context: ./docker/transformers-pytorch-amd-gpu
# build-args: |
# REF=main
# push: true
# tags: huggingface/transformers-pytorch-amd-gpu-push-ci

latest-tensorflow:
name: "Latest TensorFlow [dev]"
# Push CI doesn't need this image
if: inputs.image_postfix != '-push-ci'
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
steps:
-
name: Set up Docker Buildx
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/build-nightly-ci-docker-images.yml
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@ concurrency:
jobs:
latest-with-torch-nightly-docker:
name: "Nightly PyTorch + Stable TensorFlow"
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
steps:
- name: Cleanup disk
run: |
Expand Down Expand Up @@ -50,7 +50,7 @@ jobs:

nightly-torch-deepspeed-docker:
name: "Nightly PyTorch + DeepSpeed"
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
steps:
- name: Cleanup disk
run: |
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/build-past-ci-docker-images.yml
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@ jobs:
fail-fast: false
matrix:
version: ["1.13", "1.12", "1.11", "1.10"]
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
steps:
-
name: Set up Docker Buildx
Expand Down Expand Up @@ -60,7 +60,7 @@ jobs:
fail-fast: false
matrix:
version: ["2.11", "2.10", "2.9", "2.8", "2.7", "2.6", "2.5"]
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
steps:
-
name: Set up Docker Buildx
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/build_documentation.yml
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ jobs:
commit_sha: ${{ github.sha }}
package: transformers
notebook_folder: transformers_doc
languages: de en es fr hi it ko pt zh ja te
languages: de en es fr hi it ko pt tr zh ja te
secrets:
token: ${{ secrets.HUGGINGFACE_PUSH }}
hf_token: ${{ secrets.HF_DOC_BUILD_PUSH }}
2 changes: 1 addition & 1 deletion .github/workflows/build_pr_documentation.yml
Original file line number Diff line number Diff line change
Expand Up @@ -14,4 +14,4 @@ jobs:
commit_sha: ${{ github.event.pull_request.head.sha }}
pr_number: ${{ github.event.number }}
package: transformers
languages: de en es fr hi it ko pt zh ja te
languages: de en es fr hi it ko pt tr zh ja te
4 changes: 2 additions & 2 deletions .github/workflows/check_runner_status.yml
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,7 @@ env:
jobs:
check_runner_status:
name: Check Runner Status
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
outputs:
offline_runners: ${{ steps.set-offline_runners.outputs.offline_runners }}
steps:
Expand All @@ -39,7 +39,7 @@ jobs:

send_results:
name: Send results to webhook
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
needs: check_runner_status
if: ${{ failure() }}
steps:
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/check_tiny_models.yml
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ env:
jobs:
check_tiny_models:
name: Check tiny models
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
steps:
- name: Checkout transformers
uses: actions/checkout@v3
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/doctests.yml
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ env:

jobs:
run_doctests:
runs-on: [single-gpu, nvidia-gpu, t4, doctest-ci]
runs-on: [single-gpu, nvidia-gpu, t4, ci]
container:
image: huggingface/transformers-all-latest-gpu
options: --gpus 0 --shm-size "16gb" --ipc host -v /mnt/cache/.cache/huggingface:/mnt/cache/
Expand Down Expand Up @@ -66,7 +66,7 @@ jobs:

send_results:
name: Send results to webhook
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
if: always()
needs: [run_doctests]
steps:
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/model-templates.yml
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ on:

jobs:
run_tests_templates:
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
steps:
- name: Checkout repository
uses: actions/checkout@v3
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/release-conda.yml
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ env:

jobs:
build_and_package:
runs-on: ubuntu-latest
runs-on: ubuntu-22.04
defaults:
run:
shell: bash -l {0}
Expand Down
Loading