Update-llama-code #25826

ArthurZucker · 2023-08-29T13:44:42Z

What does this PR do?

Update based on reviews from Llama team and nits here and there!

Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com>

Co-authored-by: pcuenca <pedro@latenitesoft.com>

HuggingFaceDocBuilderDev · 2023-08-29T14:08:17Z

The documentation is not available anymore as the PR was closed or merged.

ArthurZucker · 2023-08-30T12:22:28Z

src/transformers/models/code_llama/tokenization_code_llama_fast.py

@@ -316,36 +322,8 @@ def save_vocabulary(self, save_directory: str, filename_prefix: Optional[str] =

        return (out_vocab_file,)

-    def build_inputs_with_special_tokens(


~removed as it is not used ~ is rarely used, nut still let's keep it.

ArthurZucker · 2023-08-30T12:31:12Z

tests/models/code_llama/test_tokenization_code_llama.py

@@ -587,8 +592,8 @@ def main():
 end
 """,
        ]
-        tokenizer = CodeLlamaTokenizer.from_pretrained("codellama/CodeLlama-7b-hf")
-        tokenizer_fast = CodeLlamaTokenizerFast.from_pretrained("codellama/CodeLlama-7b-hf")
+        tokenizer = CodeLlamaTokenizer.from_pretrained("codellama/CodeLlama-7b-Instruct-hf")


the other model does not support infiling

osanseviero

Out of curiosity on how things are handled in transformers, isn't removing pad_token a backwards-compatibility breaking change?

docs/source/en/model_doc/code_llama.md

amyeroberts

Thanks for updating!

A few comments about backwards compatibility and making sure params are properly documented

src/transformers/models/code_llama/tokenization_code_llama.py

amyeroberts · 2023-08-31T18:26:28Z

src/transformers/models/code_llama/tokenization_code_llama.py

+                if not conversation.new_user_input.startswith(B_SYS) or E_SYS not in conversation.new_user_input:
+                    conversation.new_user_input = B_SYS + DEFAULT_SYSTEM_PROMPT + E_SYS + conversation.new_user_input
+            else:
+                raise ValueError("Last message must be from user")


Do we not want to check that the conversation ids start with B_SYS and contain E_SYS even if we're not using the default prompt?

No this was just to add the system prompt if there are no system prompt. Now we just let the user define the system prompt!

amyeroberts · 2023-08-31T18:27:44Z

src/transformers/models/code_llama/tokenization_code_llama_fast.py

        add_bos_token=True,
        add_eos_token=False,
+        use_default_system_prompt=False,


The additional args should be documented in the doc string

Indeed thanks

(not for this PR) The add_bos_token and add_eos_token are not documented, and the args are in a very different order than the docstring

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

…ransformers into update-llama-code

LysandreJik

Thanks for your changes

docs/source/en/model_doc/code_llama.md

src/transformers/models/code_llama/tokenization_code_llama.py

LysandreJik · 2023-09-01T17:08:26Z

src/transformers/models/code_llama/tokenization_code_llama_fast.py

        add_bos_token=True,
        add_eos_token=False,
+        use_default_system_prompt=False,


(not for this PR) The add_bos_token and add_eos_token are not documented, and the args are in a very different order than the docstring

…ransformers into update-llama-code

…usage

* some bug fixes * updates * Update code_llama.md Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com> * Add co author Co-authored-by: pcuenca <pedro@latenitesoft.com> * add a test * fixup * nits * some updates * fix-coies * adress comments * nits * nits * fix docsting * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * update * add int for https://huggingface.co/spaces/hf-accelerate/model-memory-usage --------- Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com> Co-authored-by: pcuenca <pedro@latenitesoft.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

ArthurZucker and others added 4 commits August 29, 2023 13:34

some bug fixes

450c029

updates

6030191

Update code_llama.md

9cde7c8

Co-authored-by: Omar Sanseviero <osanseviero@users.noreply.github.com>

Add co author

d7d1817

Co-authored-by: pcuenca <pedro@latenitesoft.com>

ArthurZucker added 3 commits August 30, 2023 11:53

add a test

d18ec81

fixup

c89dedb

nits

e7936b7

ArthurZucker commented Aug 30, 2023

View reviewed changes

ArthurZucker added 2 commits August 31, 2023 12:06

some updates

1b7220e

fix-coies

be699b9

ArthurZucker marked this pull request as ready for review August 31, 2023 13:37

ArthurZucker requested review from amyeroberts and osanseviero August 31, 2023 13:37

osanseviero reviewed Aug 31, 2023

View reviewed changes

docs/source/en/model_doc/code_llama.md Outdated Show resolved Hide resolved

amyeroberts reviewed Aug 31, 2023

View reviewed changes

ArthurZucker and others added 7 commits September 1, 2023 14:15

adress comments

ce0dc62

nits

85cc916

nits

f94c725

fix docsting

75d17da

Apply suggestions from code review

9dbd4ce

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

Merge branch 'update-llama-code' of https://github.com/arthurzucker/t…

bd437de

…ransformers into update-llama-code

Merge branch 'main' into update-llama-code

911c55e

ArthurZucker requested a review from LysandreJik September 1, 2023 16:55

LysandreJik approved these changes Sep 1, 2023

View reviewed changes

ArthurZucker added 3 commits September 1, 2023 17:30

update

6077218

Merge branch 'update-llama-code' of https://github.com/arthurzucker/t…

40eb18c

…ransformers into update-llama-code

add int for https://huggingface.co/spaces/hf-accelerate/model-memory-…

2dda12f

…usage

ArthurZucker merged commit a4dd53d into huggingface:main Sep 1, 2023
19 checks passed

ArthurZucker deleted the update-llama-code branch September 1, 2023 18:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update-llama-code #25826

Update-llama-code #25826

ArthurZucker commented Aug 29, 2023

HuggingFaceDocBuilderDev commented Aug 29, 2023 •

edited

Loading

ArthurZucker Aug 30, 2023 •

edited

Loading

ArthurZucker Aug 30, 2023

osanseviero left a comment

amyeroberts left a comment •

edited

Loading

amyeroberts Aug 31, 2023

ArthurZucker Sep 1, 2023

amyeroberts Aug 31, 2023

ArthurZucker Aug 31, 2023

LysandreJik Sep 1, 2023

LysandreJik left a comment

LysandreJik Sep 1, 2023

		@@ -316,36 +322,8 @@ def save_vocabulary(self, save_directory: str, filename_prefix: Optional[str] =

		return (out_vocab_file,)

		def build_inputs_with_special_tokens(

Update-llama-code #25826

Update-llama-code #25826

Conversation

ArthurZucker commented Aug 29, 2023

What does this PR do?

HuggingFaceDocBuilderDev commented Aug 29, 2023 • edited Loading

ArthurZucker Aug 30, 2023 • edited Loading

Choose a reason for hiding this comment

ArthurZucker Aug 30, 2023

Choose a reason for hiding this comment

osanseviero left a comment

Choose a reason for hiding this comment

amyeroberts left a comment • edited Loading

Choose a reason for hiding this comment

amyeroberts Aug 31, 2023

Choose a reason for hiding this comment

ArthurZucker Sep 1, 2023

Choose a reason for hiding this comment

amyeroberts Aug 31, 2023

Choose a reason for hiding this comment

ArthurZucker Aug 31, 2023

Choose a reason for hiding this comment

LysandreJik Sep 1, 2023

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

LysandreJik Sep 1, 2023

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Aug 29, 2023 •

edited

Loading

ArthurZucker Aug 30, 2023 •

edited

Loading

amyeroberts left a comment •

edited

Loading