Add LlavaImageProcessor #33191

NielsRogge · 2024-08-29T08:31:34Z

What does this PR do?

Fixes #33175. It adds the option to pad an image before applying the same preprocessing as CLIPImageProcessor based on the original implementation. This allows people to match the logits with the original implementation.

For now I decided to set do_pad to False by default as otherwise it would be a breaking change.

To do:

update padding method to take numpy as input and produce numpy as output
add equivalence test
add tests

HuggingFaceDocBuilderDev · 2024-08-29T08:51:54Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

src/transformers/models/llava/image_processing_llava.py

zucchini-nlp

Thanks, looks good to me! Wondering if we need to add a small tip section in the docs, to make users aware of padding (in case they want to enable 100% match with original impl)?

src/transformers/models/llava/image_processing_llava.py

amyeroberts

Thanks for adding!

Main comment is about proper testing

src/transformers/pytorch_utils.py

src/transformers/models/llava/image_processing_llava.py

amyeroberts · 2024-09-02T12:06:19Z

tests/models/llava/test_image_processing_llava.py

+        self.assertEqual(image_processor.crop_size, {"height": 84, "width": 84})
+
+    # Ignore copy
+    def test_padding(self):


This needs to be more thoroughly tested namely - it needs to test:

background colour is properly set when background_color is a non-default int, or a non-default tuple

the method works as expected for both channels first and channels last inputs

Hi @zucchini-nlp could you take this up?

oke, no prob, I will come back on this tomorrow

src/transformers/models/llava/image_processing_llava.py

…cessor

hljjjmssyh · 2025-01-20T03:04:45Z

Is there any update for this feature? It seems very important for aligning implementations between Hugging Face and the official ones.

…cessor

qubvel

Thanks, looks good to me, just a few comments

src/transformers/models/llava/image_processing_llava.py

Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>

* First draft * Add equivalence test * Update docstrings * Add tests * Use numpy * Fix tests * Improve variable names * Improve docstring * Add link * Remove script * Add copied from * Address comment * Add note in docs * Add docstring, data format * Improve test * Add test * update * Update src/transformers/models/llava/image_processing_llava.py Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> * Update src/transformers/models/llava/image_processing_llava.py Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> * loop once only --------- Co-authored-by: raushan <raushan@huggingface.co> Co-authored-by: Raushan Turganbay <raushan.turganbay@alumni.nu.edu.kz> Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>

First draft

2afe096

NielsRogge added 3 commits August 29, 2024 11:07

Add equivalence test

29175dc

Update docstrings

183c2e8

Add tests

d4d38f2

NielsRogge commented Aug 30, 2024

View reviewed changes

src/transformers/models/llava/image_processing_llava.py Outdated Show resolved Hide resolved

NielsRogge added 7 commits August 31, 2024 10:22

Use numpy

b4d527e

Fix tests

17a5e13

Improve variable names

e207ad7

Improve docstring

8691b34

Add link

142e1b5

Remove script

0adae27

Add copied from

2b77160

NielsRogge requested review from zucchini-nlp and amyeroberts September 2, 2024 08:36

zucchini-nlp approved these changes Sep 2, 2024

View reviewed changes

src/transformers/models/llava/image_processing_llava.py Outdated Show resolved Hide resolved

Address comment

e6cccc6

amyeroberts reviewed Sep 2, 2024

View reviewed changes

NielsRogge added 5 commits September 2, 2024 14:26

Add note in docs

11ff374

Merge remote-tracking branch 'upstream/main' into add_llava_image_pro…

2709116

…cessor

Add docstring, data format

38a00f8

Improve test

9842cab

Add test

614b295

zucchini-nlp added 2 commits January 21, 2025 10:24

Merge remote-tracking branch 'upstream/main' into add_llava_image_pro…

9bb9c8e

…cessor

update

c59e2fc

zucchini-nlp requested a review from qubvel January 21, 2025 10:00

qubvel reviewed Jan 21, 2025

View reviewed changes

src/transformers/models/llava/image_processing_llava.py Outdated Show resolved Hide resolved

src/transformers/models/llava/image_processing_llava.py Outdated Show resolved Hide resolved

src/transformers/models/llava/image_processing_llava.py Outdated Show resolved Hide resolved

zucchini-nlp and others added 2 commits January 21, 2025 11:38

Update src/transformers/models/llava/image_processing_llava.py

e0d526b

Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>

Update src/transformers/models/llava/image_processing_llava.py

3b27cb8

Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>

qubvel added Vision Multimodal Processing labels Jan 21, 2025

loop once only

2fc6494

qubvel approved these changes Jan 21, 2025

View reviewed changes

zucchini-nlp merged commit 78f5ee0 into huggingface:main Jan 21, 2025
25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LlavaImageProcessor #33191

Add LlavaImageProcessor #33191

NielsRogge commented Aug 29, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 29, 2024

zucchini-nlp left a comment

amyeroberts left a comment

amyeroberts Sep 2, 2024

NielsRogge Jan 20, 2025

zucchini-nlp Jan 20, 2025

hljjjmssyh commented Jan 20, 2025

qubvel left a comment

Add LlavaImageProcessor #33191

Add LlavaImageProcessor #33191

Conversation

NielsRogge commented Aug 29, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Aug 29, 2024

zucchini-nlp left a comment

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts Sep 2, 2024

Choose a reason for hiding this comment

NielsRogge Jan 20, 2025

Choose a reason for hiding this comment

zucchini-nlp Jan 20, 2025

Choose a reason for hiding this comment

hljjjmssyh commented Jan 20, 2025

qubvel left a comment

Choose a reason for hiding this comment

NielsRogge commented Aug 29, 2024 •

edited

Loading