[Hexagon] Enable depthwise conv2d NHWC with an HWIO kernel layout #13414

farshidsp · 2022-11-17T07:06:50Z

This PR adds support for depthwise_conv2d_NHWC with an HWIO kernel layout.

tvm-bot · 2022-11-17T07:06:53Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @Icemist, @ibsidorenko, @mehrdadh _{See #10317 for details}

_{Generated by tvm-bot}

Lunderberg

I like the feature, but I'm not quite following how the permutation specified results in a HWIO kernel layout.

Lunderberg · 2022-11-28T21:34:33Z

python/tvm/topi/nn/depthwise_conv2d.py

-                * Filter[
-                    di, dj, idxdiv(c, channel_multiplier), idxmod(c, channel_multiplier)
-                ].astype(out_dtype)
+                * Filter.__getitem__(


The explicit __getitem__ isn't required. Filter[x,y,z] is syntactic sugar for Filter[(x,y,z)]. If you already have a tuple, then Filter[tuple( ... )] can be called.

Oh I see, I have changed this as your recommendation.

Lunderberg · 2022-11-28T21:46:44Z

python/tvm/topi/nn/depthwise_conv2d.py

+        kernel_permutation_to = [0, 1] + list(range(2, dim + 2))
+    elif kernel_layout == "HWIO":
+        filter_height, filter_width, channel_multiplier, filter_channel = Filter.shape
+        kernel_permutation_to = [dim + 1, dim] + list(range(dim))


I'm not seeing how this permutation generates HWIO. This defines kernel_permutation_to as [3, 2, 0, 1], so kernel_permutation_from is [2, 3, 1, 0]. With the usage below, that would permute from [di, dj, c//channel_multiplier, c%channel_multiplier] to [c//channel_multiplier, c%channel_multiplier, dj, di], which would be OIWH.

Should this be list(range(dim)) + [dim + 1, dim] instead?

@Lunderberg You are right, my bad. Not sure why I thought this would result in HWIO. Thanks so much for catching this bug.

Lunderberg · 2022-11-28T21:46:46Z

python/tvm/topi/nn/depthwise_conv2d.py

+        filter_height, filter_width, channel_multiplier, filter_channel = Filter.shape
+        kernel_permutation_to = [dim + 1, dim] + list(range(dim))
+
+    kernel_permutation_from = np.argsort(kernel_permutation_to)


Is there a benefit to defining in terms of kernel_permutation_to instead of directly defining kernel_permutation_from?

I tried to follow this as much as possible so in the future we can add more features if needed. If you think this is not the best way I could change it.

Ah, I see. It looks like that implementation is trying to be more clever, and to identify the permutation by inspecting the string. In this case, since we're only supporting two explicitly enabled shapes, I'd lean for defining a kernel_permutation directly for each one.

if kernel_layout == 'HWOI': kernel_permutaiton = [0,1,2,3] elif kernel_layout == 'HWIO': kernel_permutaiton = [0,1,3,2] else: raise ValueError(f'Unsupported kernel layout: {kernel_layout}')

That said, if there are many locations where the same kernel permutation definitions are used, it may be useful to pull it out into a common utility.

Changed the way of permutation from your recommendation. @Lunderberg

TejashShah · 2022-12-05T18:57:13Z

@farshidsp Is kernel layout or filter layout a better word to indicate the change in layout?

farshidsp · 2022-12-05T19:09:44Z

@farshidsp Is kernel layout or filter layout a better word to indicate the change in layout?

In other topi files, we have used kernel_permutation to indicate the change in the kernel layout and that's why I used the same term.

Lunderberg

Thank you for making the changes, and LGTM!

…ache#13414) Enable depthwise conv2d NHWC with HWIO kernel layout. The default kernel layout is HWOI, matched to previous behavior.

enable depthwise conv2d NHWC with HWIO kernel layout

c621db9

farshidsp marked this pull request as ready for review November 17, 2022 07:08

lint fix

fc2b365

Lunderberg reviewed Nov 28, 2022

View reviewed changes

farshidsp added 3 commits November 30, 2022 17:34

bug fix

6cdb2d0

fix

e582efb

fix

ce70659

Merge branch 'main' into hexagon/depthwise_HWIO

dfff52d

Lunderberg approved these changes Dec 5, 2022

View reviewed changes

farshidsp added 5 commits December 5, 2022 15:05

fix

e34374b

have the default kernel layout to HWOI

0bf433f

lint

de68e67

Merge branch 'main' into hexagon/depthwise_HWIO

4239414

arm-cpu fix

5155ebd

Lunderberg merged commit 12311dc into apache:main Dec 13, 2022

ysh329 mentioned this pull request Apr 17, 2023

[Release] v0.12.0 Release Candidate Notes #14645

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Hexagon] Enable depthwise conv2d NHWC with an HWIO kernel layout #13414

[Hexagon] Enable depthwise conv2d NHWC with an HWIO kernel layout #13414

farshidsp commented Nov 17, 2022

tvm-bot commented Nov 17, 2022 •

edited

Loading

Lunderberg left a comment

Lunderberg Nov 28, 2022

farshidsp Nov 30, 2022

Lunderberg Nov 28, 2022

farshidsp Nov 30, 2022

Lunderberg Nov 28, 2022

farshidsp Nov 30, 2022

Lunderberg Dec 2, 2022

Lunderberg Dec 2, 2022

farshidsp Dec 5, 2022

TejashShah commented Dec 5, 2022

farshidsp commented Dec 5, 2022

Lunderberg left a comment

[Hexagon] Enable depthwise conv2d NHWC with an HWIO kernel layout #13414

[Hexagon] Enable depthwise conv2d NHWC with an HWIO kernel layout #13414

Conversation

farshidsp commented Nov 17, 2022

tvm-bot commented Nov 17, 2022 • edited Loading

Lunderberg left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

TejashShah commented Dec 5, 2022

farshidsp commented Dec 5, 2022

Lunderberg left a comment

Choose a reason for hiding this comment

tvm-bot commented Nov 17, 2022 •

edited

Loading