Prism #1

DaryaTereshchenko · 2024-12-17T09:37:12Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…ingface#13283)

Co-authored-by: Boumadane Abdelmoumene <moumene.boumadane@gmail.com>

* Add note on amy's contribution. Co-authored-by: Amy Roberts <aeroberts4444@gmail.com> * remove non-tech comment. Co-authored by: Amy Roberts <aeroberts4444@gmail.com> Co-authored-by: Amy Roberts <aeroberts4444@gmail.com>

) * Adding `zero-shot-object-detection` pipeline doctest. * Remove nested_simplify.

…ace#20952) * Add generate kwargs to AutomaticSpeechRecognitionPipeline * Add test for generation kwargs

* LLaMA * sharding and docs * tweak * black * inits * ruff * LLAMA_PRETRAINED_CONFIG_ARCHIVE_MAP * init * no checkpoint * docs * ruff * type_vocab_size * tokenizer fixes * tokenizer fixes * Update tokenization_llama.py * Update tokenization_llama.py * Update configuration_llama.py * Update modeling_llama.py * tokenizer add_bos by default * licenses * remove decoder * norms and mlp * rope overhaul * tweaks * black * mention OPT implementation * off-by-one naming * typo * fix * tokenization fix and slicing bug * padding config * cleanup * black * update tests * undo typo * fix vocab caching logic * ruff * docbuilder * attn fix from BlackSamorez * initial feedback * typo * docs * llama case * llama case * load checkpoint docs * comment about tokenizer * tokenizer defaults * clear past_key_values if use_cache=False * last tweaks * last tweaks * last tweaks * last tweaks --------- Co-authored-by: Stella Biderman <stellabiderman@gmail.com>

fixed a typo

@itazap

Hello! ## Pull Request overview * Fix typo ## Details This should speak for itself. cc @itazap @ArthurZucker - Tom Aarsen

Add new model

add fixes and documentation

Changes pr2

…izer fix additional tokens list and run tests

…arity test

add a fix to special tokens handling and add the test_batch_fairseq_p…

hopefully last commit

* gptqmodel Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix format Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update readme Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * gptqmodel need use checkpoint_format (#1) * gptqmodel need use checkpoint_format * fix quantize * Update quantization_config.py * Update quantization_config.py * Update quantization_config.py --------- Co-authored-by: ZX-ModelCloud <zx@modelcloud.ai> Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai> * Revert quantizer_gptq.py (huggingface#2) * revert quantizer_gptq.py change * pass **kwargs * limit gptqmodel and optimum version Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix format Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix warning Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix version check Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * revert unrelated changes Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * enable gptqmodel tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix requires gptq Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * Fix Transformer compat (huggingface#3) * revert quantizer_gptq.py change * pass **kwargs * add meta info * cleanup * cleanup * Update quantization_config.py * hf_select_quant_linear pass checkpoint_format and meta * fix GPTQTestCUDA * Update test_gptq.py * gptqmodel.hf_select_quant_linear() now does not select ExllamaV2 * cleanup * add backend * cleanup * cleanup * no need check exllama version * Update quantization_config.py * lower checkpoint_format and backend * check none * cleanup * Update quantization_config.py * fix self.use_exllama == False * spell * fix unittest * fix unittest --------- Co-authored-by: LRL <lrl@lbx.dev> Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai> * fix format Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix format again Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update gptqmodel version (huggingface#6) * update gptqmodel version * update gptqmodel version * fix unit test (huggingface#5) * update gptqmodel version * update gptqmodel version * "not self.use_exllama" is not equivalent to "self.use_exllama==False" * fix unittest * update gptqmodel version * backend is loading_attibutes (huggingface#7) * fix format and tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix memory check Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix device mismatch Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix result check Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * update tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * review: update docs (huggingface#10) * review: update docs (huggingface#12) * review: update docs * fix typo * update tests for gptqmodel Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update document (huggingface#9) * update overview.md * cleanup * Update overview.md * Update overview.md * Update overview.md * update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md --------- Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai> * typo * doc note for asymmetric quant * typo with apple silicon(e) * typo for marlin * column name revert: review * doc rocm support * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/overview.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/overview.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com> Co-authored-by: LRL-ModelCloud <165116337+LRL-ModelCloud@users.noreply.github.com> Co-authored-by: ZX-ModelCloud <zx@modelcloud.ai> Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai> Co-authored-by: ZX-ModelCloud <165115237+ZX-ModelCloud@users.noreply.github.com> Co-authored-by: LRL <lrl@lbx.dev> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

sgugger and others added 30 commits October 27, 2024 16:45

Trigger CI

24a67db

Selected typo fix (huggingface#6687)

929720b

Add model cards for DynaBERT (huggingface#7999)

7acf7b8

Create README.md

eb26b82

Moving text2text-generation to new pipeline testing mecanism. (hugg…

e4446f1

…ingface#13283)

Empty commit to retrigger build doc

c71061a

Trigger doc building

8e1de9e

Trigger doc build

dc4297b

Trigger doc build

8a400fc

fixed pipeline code (huggingface#15607)

c43e6ca

Co-authored-by: Boumadane Abdelmoumene <moumene.boumadane@gmail.com>

Trigger doc build

8060ae3

Trigger doc build

c2607b8

Trigger doc build

10e705c

Fixed wrong error message for missing weight file (huggingface#17216)

9a2ad62

Fix gather for metrics (huggingface#19389)

c2986cb

Adding zero-shot-object-detection pipeline doctest. (huggingface#20274

8c79502

) * Adding `zero-shot-object-detection` pipeline doctest. * Remove nested_simplify.

Add generate kwargs to AutomaticSpeechRecognitionPipeline (huggingf…

f4c3c38

…ace#20952) * Add generate kwargs to AutomaticSpeechRecognitionPipeline * Add test for generation kwargs

Trigger CI

f9a9396

Update README.md (huggingface#25922)

444ad9e

fixed a typo

Fix typo: depracted -> deprecated (huggingface#32489)

0ecee74

Hello! ## Pull Request overview * Fix typo ## Details This should speak for itself. cc @itazap @ArthurZucker - Tom Aarsen

add working tokenizer test

bd2856a

add working modeling test

2d196b3

fix @slow for some tests

ddea73f

Add PRISM model

0f2c57c

Add final version with checks

a291030

add prism

c6f5df3

delete cache

42810c0

add test changes

11dea6e

DaryaTereshchenko and others added 13 commits October 27, 2024 17:22

Merge pull request #1 from DaryaTereshchenko/new_branch_name

eafd847

Add new model

add fixes and documentation

c70f864

Merge pull request huggingface#2 from DaryaTereshchenko/changes_to_pr1

8350215

add fixes and documentation

change prism.md

56dae91

make the repo consistent

77ccf60

Merge pull request huggingface#3 from DaryaTereshchenko/changes_pr2

2603cf8

Changes pr2

fix additional tokens and run tests

17c1198

Merge pull request huggingface#4 from DaryaTereshchenko/changes_token…

8bedcb3

…izer fix additional tokens list and run tests

add a fix to special tokens handling and add the test_batch_fairseq_p…

30e4169

…arity test

Merge pull request huggingface#5 from DaryaTereshchenko/special_tokens

64b14f6

add a fix to special tokens handling and add the test_batch_fairseq_p…

hopefully last commit

555b80c

Merge pull request huggingface#6 from DaryaTereshchenko/last_commit

2b8fd84

hopefully last commit

merge with the main repo

d90d48a

DaryaTereshchenko merged commit 9571723 into main Dec 17, 2024
3 checks passed

DaryaTereshchenko deleted the prism branch December 17, 2024 09:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prism #1

Prism #1

DaryaTereshchenko commented Dec 17, 2024

Prism #1

Prism #1

Conversation

DaryaTereshchenko commented Dec 17, 2024

What does this PR do?

Before submitting

Who can review?