TF: use the correct config with `(...)EncoderDecoder` models #18097

gante · 2022-07-11T15:28:01Z

What does this PR do?

Modifies unpack_inputs to ignore the config file for (...)EncoderDecoder models, mimicking the behavior in PT. If we don't ignore it, then unset options will get set with the config's default (False for most of them), causing the inner models to ignore their own config files.

⚠️ I've added a corresponding test for the EncoderDecoder models. I then noticed that other (...)EncoderFecoder tests have copy/pasted their own EncoderDecoderMixin, so I've left the other classes for a follow-up PR with the following question: should a common EncoderDecoderMixin be defined and shared across (...)EncoderDecoder tests, or should I add a similar test to all other classes individually?

HuggingFaceDocBuilderDev · 2022-07-11T15:38:56Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks for fixing! As for your question, it depends if they are all exact copies and can easily be refactored in one mixin or not.

gante · 2022-07-11T15:47:30Z

I believe they are -- going to give it a go afterwards if @ydshieh also agrees :)

ydshieh · 2022-07-11T18:00:52Z

I have limited connection at this moment in the mountain, so feel free to merge if you prefer. Regarding the common mixin, good for me. I see there are a few little things to address, like the input names (input_ids, pixel values etc).
Would be nice if you do this refactorization after merging my PR about the PT/TF equivalence tests, or incoperate the change in it 🙏

Thank you for the fix, @gante

gante · 2022-07-21T12:14:52Z

@ydshieh can I have a review plz 🙏

ydshieh

Thank you for the fix, @gante . Would you mind to rebase on main, and run test_pt_tf_model_equivalence for TF (Vision)EncoderDecoderModel? Thank you.

ydshieh · 2022-07-21T14:51:36Z

src/transformers/modeling_tf_utils.py

-        unpacked_inputs = input_processing(func, self.config, main_input, **fn_args_and_kwargs)
+
+        # Encoder Decoder models delegate the application of the configuration options to their inner models.
+        if "encoder_decoder" in str(self).lower():


Just a nit, no need to feel strong:

I would personally prefer to use self.__name__ instead of str and test against EncoderDecoder. If I understand correctly, str gives the full path, which includes the module name like encoder_decoder or vision_encoder_decoder, right?

I was trying self.__name__ on this line, and it seems like it isn't defined in some cases -- e.g. in tests/models/vision_encoder_decoder/test_modeling_tf_vision_encoder_decoder.py::TFViT2GPT2EncoderDecoderModelTest::test_encoder_decoder_model, it throws *** AttributeError: 'TFVisionEncoderDecoderModel' object has no attribute '__name__'

str(self) here includes the full import path for the object, like you wrote, which contains the class name -- '<transformers.models.vision_encoder_decoder.modeling_tf_vision_encoder_decoder.TFVisionEncoderDecoderModel object at 0x7fc7783b8a60>'

Maybe we can update in the future, but since it is working for now I'm going with it :D

OK, thanks for the info.

ydshieh · 2022-07-21T14:52:01Z

src/transformers/models/encoder_decoder/modeling_tf_encoder_decoder.py

+            if use_cache:
+                past_key_values = decoder_outputs[1]
+            # The starting index of the remaining elements in `decoder_outputs`
+            start_index = sum([1 if x is not None else 0 for x in (loss, logits, past_key_values)])


Thanks for the improvement / cleanup!

ydshieh · 2022-07-21T14:52:37Z

tests/models/encoder_decoder/test_modeling_encoder_decoder.py

+        decoder_attention_mask = decoder_attention_mask[:, :-1]
+        encoder_model, decoder_model = self.get_encoder_decoder_model(config, decoder_config)
+        enc_dec_model = EncoderDecoderModel(encoder=encoder_model, decoder=decoder_model)
+        enc_dec_model.config.output_attentions = True  # model config -> won't work


nice addition!

gante · 2022-07-21T17:03:03Z

@ydshieh rebased with main and reran tests -- all working 👍

gante changed the title ~~TF: use the right config~~ TF: use the correct config with encoder decoder models Jul 11, 2022

gante changed the title ~~TF: use the correct config with encoder decoder models~~ TF: use the correct config with (...)EncoderDecoder models Jul 11, 2022

gante requested review from sgugger and ydshieh July 11, 2022 15:31

gante force-pushed the unpack_inputs_composite branch from 07c1575 to 0dc94b8 Compare July 11, 2022 15:40

sgugger approved these changes Jul 11, 2022

View reviewed changes

ydshieh reviewed Jul 21, 2022

View reviewed changes

ydshieh approved these changes Jul 21, 2022

View reviewed changes

gante force-pushed the unpack_inputs_composite branch from 0dc94b8 to 250d5fd Compare July 21, 2022 17:02

gante added 2 commits July 22, 2022 11:27

TF composite models: use the right config

16c36dc

make fixup

f59bbb7

gante force-pushed the unpack_inputs_composite branch from 250d5fd to f59bbb7 Compare July 22, 2022 11:28

gante merged commit 1fc4b2a into huggingface:main Jul 22, 2022

gante deleted the unpack_inputs_composite branch July 22, 2022 12:31

muellerzr pushed a commit that referenced this pull request Jul 25, 2022

TF: use the correct config with (...)EncoderDecoder models (#18097)

02dffcf

ydshieh mentioned this pull request Aug 9, 2022

Minor update of run_call_with_unpacked_inputs #18541

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TF: use the correct config with `(...)EncoderDecoder` models #18097

TF: use the correct config with `(...)EncoderDecoder` models #18097

gante commented Jul 11, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 11, 2022 •

edited

Loading

sgugger left a comment

gante commented Jul 11, 2022

ydshieh commented Jul 11, 2022 •

edited

Loading

gante commented Jul 21, 2022

ydshieh left a comment

ydshieh Jul 21, 2022

gante Jul 21, 2022

gante Jul 21, 2022

ydshieh Jul 21, 2022

ydshieh Jul 21, 2022

ydshieh Jul 21, 2022

gante commented Jul 21, 2022

TF: use the correct config with (...)EncoderDecoder models #18097

TF: use the correct config with (...)EncoderDecoder models #18097

Conversation

gante commented Jul 11, 2022 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Jul 11, 2022 • edited Loading

sgugger left a comment

Choose a reason for hiding this comment

gante commented Jul 11, 2022

ydshieh commented Jul 11, 2022 • edited Loading

gante commented Jul 21, 2022

ydshieh left a comment

Choose a reason for hiding this comment

ydshieh Jul 21, 2022

Choose a reason for hiding this comment

gante Jul 21, 2022

Choose a reason for hiding this comment

gante Jul 21, 2022

Choose a reason for hiding this comment

ydshieh Jul 21, 2022

Choose a reason for hiding this comment

ydshieh Jul 21, 2022

Choose a reason for hiding this comment

ydshieh Jul 21, 2022

Choose a reason for hiding this comment

gante commented Jul 21, 2022

TF: use the correct config with `(...)EncoderDecoder` models #18097

TF: use the correct config with `(...)EncoderDecoder` models #18097

gante commented Jul 11, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 11, 2022 •

edited

Loading

ydshieh commented Jul 11, 2022 •

edited

Loading