Fix typo in Llama docstrings #24020

Kh4L · 2023-06-05T13:13:44Z

What does this PR do?

Fix typos in docs.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).

amyeroberts

Thanks for fixing!

Have you run the doc examples locally to confirm they still work and the generated responses are still the same (with the typo fixed)?

I don't think we can pass in the inputs directly like this, as generate expects a tensor as its first argument and inputs is a dictionary

Kh4L · 2023-06-06T08:36:19Z

@amyeroberts yes, I ran it locally, got the error and fixed it,

here are the types in my code

type(tokenizer)
<class 'transformers.models.llama.tokenization_llama.LlamaTokenizer'>

type(inputs)
<class 'torch.Tensor'
``

amyeroberts · 2023-06-06T09:53:27Z

@Kh4L Out of interest - could you share the checkpoint being used? Could you also run this snippet with the checkpoint and share the output?

from transformers import AutoTokenizer
tokenizer = AutoTokenizer.from_pretrained(MODEL_CHECKPOINT)
prompt = "Hey, are you conscious? Can you talk to me?"
inputs = tokenizer(prompt, return_tensors="pt")
print(inputs)
print(type(inputs))
print(type(tokenizer))

The current changes need to be checked with a standard checkpoint for all the models affected here. For instance, if I run the snippet with the OPT checkpoint in the example

MODEL_CHECKPOINT = "facebook/opt-350m"

I get the following output:

{'input_ids': tensor([[    2, 13368,     6,    32,    47, 13316,   116,  2615,    47,  1067,
             7,   162,   116]]), 'attention_mask': tensor([[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]])}
<class 'transformers.tokenization_utils_base.BatchEncoding'>
<class 'transformers.models.gpt2.tokenization_gpt2_fast.GPT2TokenizerFast'>

Kh4L · 2023-06-07T06:38:09Z

@amyeroberts Checkpoint is the 7B LLama converted to HF, I get the same output!
Sorry for the confusion, I was using LlamaTokenizer and not AutoTokenizer in my code

Signed-off-by: Serge Panev <spanev@nvidia.com>

amyeroberts

When changing doc examples or tests, it's important to run them to make sure they still pass / the outputs are as expected.

Llama and Open Llama don't have default checkpoints, so can't be checked, but the output for the OPT model does change with the spelling fix so needs to be updated.

src/transformers/models/opt/modeling_opt.py

src/transformers/models/opt/modeling_tf_opt.py

Signed-off-by: Serge Panev <spanev@nvidia.com>

amyeroberts

Thanks for fixing these!

The style checks are currently failing - you'll need to run make style and push any changes made by the linter. Once the CI is green we can merge :)

Signed-off-by: Serge Panev <spanev@nvidia.com>

Kh4L · 2023-06-07T16:37:09Z

Thanks for the detailed review!
I am a bit confused as I can't see the latest commit Kh4L@62ea9f2 in this PR, even though I pushed it on my branch https://github.com/Kh4L/pytorch-transformers/tree/fix_conscious_typo 🤔

HuggingFaceDocBuilderDev · 2023-06-08T11:43:25Z

The documentation is not available anymore as the PR was closed or merged.

amyeroberts · 2023-06-08T16:19:02Z

@Kh4L Github PRs were down for part of yesterday - I think it was just that. I can see the commit now and all tests are passing :)

* Fix typo in Llama docstrings Signed-off-by: Serge Panev <spanev@nvidia.com> * Update Signed-off-by: Serge Panev <spanev@nvidia.com> * make style Signed-off-by: Serge Panev <spanev@nvidia.com> --------- Signed-off-by: Serge Panev <spanev@nvidia.com>

amyeroberts reviewed Jun 5, 2023

View reviewed changes

Fix typo in Llama docstrings

f635853

Signed-off-by: Serge Panev <spanev@nvidia.com>

Kh4L force-pushed the fix_conscious_typo branch from 2322832 to f635853 Compare June 7, 2023 06:40

Kh4L requested a review from amyeroberts June 7, 2023 06:41

amyeroberts reviewed Jun 7, 2023

View reviewed changes

src/transformers/models/opt/modeling_opt.py Outdated Show resolved Hide resolved

src/transformers/models/opt/modeling_tf_opt.py Outdated Show resolved Hide resolved

Update

e7f0d12

Signed-off-by: Serge Panev <spanev@nvidia.com>

Kh4L requested a review from amyeroberts June 7, 2023 14:30

amyeroberts approved these changes Jun 7, 2023

View reviewed changes

make style

62ea9f2

Signed-off-by: Serge Panev <spanev@nvidia.com>

amyeroberts merged commit 9322c24 into huggingface:main Jun 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix typo in Llama docstrings #24020

Fix typo in Llama docstrings #24020

Kh4L commented Jun 5, 2023

amyeroberts left a comment •

edited

Loading

Kh4L commented Jun 6, 2023

amyeroberts commented Jun 6, 2023

Kh4L commented Jun 7, 2023

amyeroberts left a comment

amyeroberts left a comment

Kh4L commented Jun 7, 2023

HuggingFaceDocBuilderDev commented Jun 8, 2023 •

edited

Loading

amyeroberts commented Jun 8, 2023

Fix typo in Llama docstrings #24020

Fix typo in Llama docstrings #24020

Conversation

Kh4L commented Jun 5, 2023

What does this PR do?

Before submitting

amyeroberts left a comment • edited Loading

Choose a reason for hiding this comment

Kh4L commented Jun 6, 2023

amyeroberts commented Jun 6, 2023

Kh4L commented Jun 7, 2023

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

Kh4L commented Jun 7, 2023

HuggingFaceDocBuilderDev commented Jun 8, 2023 • edited Loading

amyeroberts commented Jun 8, 2023

amyeroberts left a comment •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 8, 2023 •

edited

Loading