Support configurable input size #3788

eunwoosh · 2024-08-05T05:41:22Z

Summary

This PR includes

adds configurable input size and adaptive input size to OTX.
refactor input size in model code.

Explanation

adds configurable input size feature to OTX

User can configure desired input size after this PR. input_size value in 'data' part of recipe is automatically passed to model unless model is already initialized before it's passed to the engine in API use case.
Also, user can use adaptive input size which finds appropriate input size based on dataset statistics. Also, user can force to use downscale only which doesn't change input size if bigger input size is considered as proper input size.
All interfaces for input size are added to OTXDataModule.
So CLI user can uses it by adding --data.input_size 1000 for configuring input size and --data.adaptive_input_size "auto" for adaptive input size.
API user can use it by setting arguments of OTXDataModule. If model is initialized in engine, input_size will be automatically set but if not, then user should set input_size argument of the model by himself or herself.

refactor input size in model code

Previously, how input_size is manged in model side is different depending on each model.
Now, it's unified. All models get input_size argument and keep it as attribute except zvp model which uses fixed input_size.
Model also initializes own module differently based on input_size if necessary.

How to test

otx train --config ... --data_root ... --data.input_size 1024
otx train --config ... --data_root ... --data.adaptive_input_size "auto | downscale"

Checklist

I have added unit tests to cover my changes.
I have added integration tests to cover my changes.
I have ran e2e tests and there is no issues.
I have added the description of my changes into CHANGELOG in my target branch (e.g., CHANGELOG in develop).
I have updated the documentation in my target branch accordingly (e.g., documentation in develop).
I have linked related issues.

License

I submit my code changes under the same Apache License that covers the project.
Feel free to contact the maintainers if that's a concern.
I have updated the license header for each file (see an example below).

# Copyright (C) 2024 Intel Corporation
# SPDX-License-Identifier: Apache-2.0

goodsong81

Nice work. A few points to mention are as follows:

hasatrr() based branch logics might be quickly rusted and tangled. Hope it would be based on formal interfaces in the future.
input_size term might be better to be more consistent in terms of usage

src/otx/algo/classification/huggingface_model.py

src/otx/algo/classification/mobilenet_v3.py

src/otx/algo/detection/rtdetr.py

src/otx/algo/detection/yolox.py

src/otx/cli/cli.py

src/otx/core/data/module.py

src/otx/core/data/utils/utils.py

src/otx/core/model/action_classification.py

src/otx/engine/utils/auto_configurator.py

sooahleex

Looks good to me for h-label head!

goodsong81

LGTM w/ one last comment: we may want to touch the mem cache max image size to make this feature more effective in training
(Maybe in another PR? :p)

eunwoosh · 2024-08-14T01:48:16Z

LGTM w/ one last comment: we may want to touch the mem cache max image size to make this feature more effective in training (Maybe in another PR? :p)

yes, we can consider including that feature in the next release.

goodsong81 · 2024-08-14T02:23:12Z

LGTM w/ one last comment: we may want to touch the mem cache max image size to make this feature more effective in training (Maybe in another PR? :p)

yes, we can consider including that feature in the next release.

FYI, the configured input size could be set to the max memcache image size.

eunwoosh · 2024-08-14T02:36:22Z

@goodsong81 I understand your intention. If it doesn't need huge code change, maybe it can be included in this release. Thanks for comment :)

eunwoosh added 4 commits July 31, 2024 11:10

draft implementation

27df73c

draft implementation2

bb9b66e

check input size constant value

b6a7685

update model part

4c0781f

github-actions bot added TEST Any changes in tests OTX 2.0 labels Aug 5, 2024

draft implementation

bf0c736

eunwoosh force-pushed the configurable_input_size branch from 8113046 to a40db40 Compare August 5, 2024 08:09

eunwoosh added 11 commits August 5, 2024 17:10

update interface

a40db40

implement adaptive input size draft version

ac506ef

handle edge case

b1121f0

add input_size_multiplier and pass it to datamodule in cli

659a9cf

change typehint from sequence to tuple

e329029

align with pre-commit

8474b07

write doc string

28dcd63

implement unit test

01cb629

update unit test

30172dd

implement left unit test

73598ab

Merge branch 'develop' into configurable_input_size

14b7a59

goodsong81 reviewed Aug 9, 2024

View reviewed changes

eunwoosh added 6 commits August 9, 2024 16:04

align with develop branch

4df5674

fix typo

4fcc627

exclude batch and num channel from input size

39b0650

update docstring

aee7600

update unit test

5d6c481

adaptive input size supports not square

ff8ecf9

github-actions bot added the DOC Improvements or additions to documentation label Aug 9, 2024

update changelog

82e41e0

eunwoosh marked this pull request as ready for review August 9, 2024 11:02

eunwoosh requested a review from samet-akcay as a code owner August 9, 2024 11:02

eunwoosh dismissed wonjuleee’s stale review via 255a4e0 August 13, 2024 05:19

update unit test

255a4e0

harimkang previously approved these changes Aug 13, 2024

View reviewed changes

wonjuleee previously approved these changes Aug 13, 2024

View reviewed changes

sungchul2 previously approved these changes Aug 13, 2024

View reviewed changes

eunwoosh dismissed stale reviews from sungchul2, wonjuleee, and harimkang via 900faaf August 13, 2024 09:59

Merge branch 'develop' into configurable_input_size

900faaf

harimkang previously approved these changes Aug 13, 2024

View reviewed changes

sovrasov previously approved these changes Aug 13, 2024

View reviewed changes

eunwoosh dismissed stale reviews from sovrasov and harimkang via 8e6f8f8 August 13, 2024 14:43

eunwoosh added 2 commits August 13, 2024 23:43

update h-label head

8e6f8f8

Merge branch 'develop' into configurable_input_size

4868ba5

sooahleex approved these changes Aug 14, 2024

View reviewed changes

harimkang approved these changes Aug 14, 2024

View reviewed changes

eunwoosh enabled auto-merge August 14, 2024 01:35

goodsong81 approved these changes Aug 14, 2024

View reviewed changes

harimkang added this to the 2.2.0 milestone Aug 14, 2024

eunwoosh added this pull request to the merge queue Aug 14, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Aug 14, 2024

eunwoosh added this pull request to the merge queue Aug 14, 2024

Merged via the queue into openvinotoolkit:develop with commit 0b5ed3b Aug 14, 2024
18 checks passed

eunwoosh deleted the configurable_input_size branch August 14, 2024 06:24

eunwoosh mentioned this pull request Aug 14, 2024

Feature Request: Support custom and non-square input sizes #3581

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support configurable input size #3788

Support configurable input size #3788

eunwoosh commented Aug 5, 2024 •

edited

Loading

goodsong81 left a comment

sooahleex left a comment

goodsong81 left a comment

eunwoosh commented Aug 14, 2024

goodsong81 commented Aug 14, 2024

eunwoosh commented Aug 14, 2024 •

edited

Loading

Support configurable input size #3788

Support configurable input size #3788

Conversation

eunwoosh commented Aug 5, 2024 • edited Loading

Summary

Explanation

adds configurable input size feature to OTX

refactor input size in model code

How to test

Checklist

License

goodsong81 left a comment

Choose a reason for hiding this comment

sooahleex left a comment

Choose a reason for hiding this comment

goodsong81 left a comment

Choose a reason for hiding this comment

eunwoosh commented Aug 14, 2024

goodsong81 commented Aug 14, 2024

eunwoosh commented Aug 14, 2024 • edited Loading

eunwoosh commented Aug 5, 2024 •

edited

Loading

eunwoosh commented Aug 14, 2024 •

edited

Loading