Fix CI for VLMs #35690

zucchini-nlp · 2025-01-14T11:03:38Z

What does this PR do?

As per title, main changes are:

Removed some tests, not needed anymore as they tested old merging behavior in VLMs
Removed logit level check as those break a lot because of precision errors when we update torch for ex. Test on token-level generation should be enough
Other tests actually needed to update their expected_text so updated them
Since it was a tiny change, propagated Emu3 checkpoint name update to docs (can move to a new PR to make it cleaner)
Mllama tests are failing because of gated repo (meta-llama/Llama-3.2-11B-Vision) cc @ydshieh
Emu3 also needs a bit more GPU memory to run (around 16GB), that is the minimum possible value if we get a tiny image as input. So I don't know if we should just remove batched slow tests cc @ydshieh

Note: Emu3/Aria flex-attn test will be failing and needs a different PR to fix the way attention is dispatched on some VLMs. Apparently it never worked for Emu3-like models

HuggingFaceDocBuilderDev · 2025-01-14T11:31:48Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ydshieh · 2025-01-14T14:26:01Z

Mllama tests are failing because of gated repo (meta-llama/Llama-3.2-11B-Vision)

Asked the infrao. It can't be downloaded in EU region ...

Emu3 also needs a bit more GPU memory to run (around 16GB), that is the minimum possible value if we get a tiny image as input. So I don't know if we should just remove batched slow tests

which (full) test name? @Cyrilvallez talked to me we might need to introduce @need_a10_gpu decorator if we don't have other way

ydshieh · 2025-01-14T14:28:15Z

tests/models/kosmos2/test_modeling_kosmos2.py

+            [[0.9148, -1.4148, 3.8040], [3.3443, 1.9478, 0.2080], [1.6604, 2.8184, -0.3618]]
        ).to(torch_device)

        self.assertTrue(
-            torch.allclose(outputs.vision_model_output.last_hidden_state[0, :3, :3], expected_slice, atol=1e-4)
+            torch.allclose(outputs.vision_model_output.last_hidden_state[0, :3, :3], expected_slice, atol=1e-1)


somehow I feel such changes are quite large, and not really know what happened in the history of commits ...
But first question: are you checking this on T4 runner?

yep, running everything on T4. I believe the user who added the test didn't trigger slow tests with our runners thus the difference

The test fails when I go to the commit when it was added

Hmmm, I am the one working on Kosmos2 and I ran the test like hundred of times 😆 .
Maybe torch versions changes matter here. Anyway, I can take a look on this specific model and make sure

oh, this particular test was added by community user, after model release. But yeah, double checking would not hurt

zucchini-nlp · 2025-01-14T15:36:09Z

Asked the infrao. It can't be downloaded in EU region ...

😢

which (full) test name? @Cyrilvallez talked to me we might need to introduce @need_a10_gpu decorator if we don't have other way

Cool, the decorator wouuld definitely help a lot! The tests are test_model_generation_batched and test_model_generation_multi_image

ArthurZucker

LGTM appart from tests that are a bit of an edge case, even if fixed!

ArthurZucker · 2025-01-16T16:36:18Z

tests/models/llava/test_modeling_llava.py

@@ -481,49 +480,6 @@ def test_batched_generation(self):
        outputs = processor.batch_decode(generate_ids, skip_special_tokens=True, clean_up_tokenization_spaces=False)
        self.assertEqual(outputs, EXPECTED_OUTPUT)

-    @slow
-    @require_bitsandbytes
-    def test_llava_index_error_bug(self):


not sure why we are removing this one?

the original bug was related to extending the attention mask after we merge_inputs the old way, and the bug cannot be there anymore as we removed old style merging

Cyrilvallez · 2025-01-17T12:13:09Z

See the decorator @require_torch_large_gpu if needed! Just merged it

* fix some easy test * more tests * remove logit check here also * add require_torch_large_gpu in Emu3

fix some easy test

66cdd8d

zucchini-nlp requested review from Rocketknight1 and ArthurZucker as code owners January 14, 2025 11:03

more tests

c7d1d1f

zucchini-nlp requested a review from stevhliu as a code owner January 14, 2025 13:31

remove logit check here also

379eb5d

zucchini-nlp requested a review from ydshieh January 14, 2025 14:16

ydshieh reviewed Jan 14, 2025

View reviewed changes

ArthurZucker approved these changes Jan 16, 2025

View reviewed changes

ArthurZucker mentioned this pull request Jan 16, 2025

Fix Aria CI and testing #35674

Open

zucchini-nlp added 2 commits January 17, 2025 16:34

add require_torch_large_gpu in Emu3

5aaedd7

Merge remote-tracking branch 'upstream/main' into ci-vlms

1cf8755

zucchini-nlp merged commit 8571bb1 into huggingface:main Jan 20, 2025
16 checks passed

bursteratom pushed a commit to bursteratom/transformers that referenced this pull request Jan 31, 2025

Fix CI for VLMs (huggingface#35690)

1c94867

* fix some easy test * more tests * remove logit check here also * add require_torch_large_gpu in Emu3

elvircrn pushed a commit to elvircrn/transformers that referenced this pull request Feb 13, 2025

Fix CI for VLMs (huggingface#35690)

89f08d1

* fix some easy test * more tests * remove logit check here also * add require_torch_large_gpu in Emu3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix CI for VLMs #35690

Fix CI for VLMs #35690

zucchini-nlp commented Jan 14, 2025 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 14, 2025

ydshieh commented Jan 14, 2025

ydshieh Jan 14, 2025

zucchini-nlp Jan 14, 2025

ydshieh Jan 14, 2025

zucchini-nlp Jan 14, 2025

zucchini-nlp commented Jan 14, 2025

ArthurZucker left a comment

ArthurZucker Jan 16, 2025

zucchini-nlp Jan 17, 2025

Cyrilvallez commented Jan 17, 2025

Fix CI for VLMs #35690

Fix CI for VLMs #35690

Conversation

zucchini-nlp commented Jan 14, 2025 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Jan 14, 2025

ydshieh commented Jan 14, 2025

ydshieh Jan 14, 2025

Choose a reason for hiding this comment

zucchini-nlp Jan 14, 2025

Choose a reason for hiding this comment

ydshieh Jan 14, 2025

Choose a reason for hiding this comment

zucchini-nlp Jan 14, 2025

Choose a reason for hiding this comment

zucchini-nlp commented Jan 14, 2025

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Jan 16, 2025

Choose a reason for hiding this comment

zucchini-nlp Jan 17, 2025

Choose a reason for hiding this comment

Cyrilvallez commented Jan 17, 2025

zucchini-nlp commented Jan 14, 2025 •

edited

Loading