Add qnn batch_matmul operator #8401

elvin-n · 2021-07-04T18:25:29Z

added support of the different out type for x86 batch_matmul

- add support of the different out type for x86 batch_matmul

jwfromm

These changes look excellent, I really like how you've minimized code duplication in the shape functions. I'm not quite sure why CI is failing, the error message is saying that Relay expects the output shape to be [16, 32] instead of [16, 16] like its supposed to be. I couldn't find any change in the shape functions that would make relay think that but there must be something subtle.

elvin-n · 2021-07-08T05:37:58Z

@jwfromm @masahi Could you please review?

masahi · 2021-07-08T08:14:10Z

python/tvm/relay/qnn/op/qnn.py

+
+    Parameters
+    ----------
+    x : tvm.relay.Expr


masahi · 2021-07-08T08:14:36Z

tests/python/relay/test_op_qnn_batch_matmul.py

+    if yzero_point_zero == True:
+        y_zero_point = 0
+    else:
+        y_zero_point = -1


Should test on more non-zero zero points.

@masahi just curious if changing -1 to other value would be enough or you propose to add other tests.
Currently there are 5 test cases:
x zp =0, y zp = 0
x zp = -1, y zp = 0
x zp = 0, y zp = -1
x zp =-1, y zp =-1
that covers all flows in QnnBatchMatmulCanonicalize

I think we should test on larger non-zero zero points, to verify the accuracy.

modified zp to 123

masahi · 2021-07-08T08:15:41Z

@elvin-n Thanks. This is going to be useful for quantized transformers.

cc @anijain2305

masahi · 2021-07-13T05:18:37Z

Thanks @elvin-n @jwfromm

* Add qnn batch_matmul operator - add support of the different out type for x86 batch_matmul * Fix code style * Add out_dtype to generic batch_matmul * Restore fixe in batch_matmul for dynamic shapes * Fix documentation for qnn.batch_matmul * Remove debug code * Modify zero point for qnn batch_matmul test

elvin-n added 3 commits July 4, 2021 21:18

Add qnn batch_matmul operator

bab0f91

- add support of the different out type for x86 batch_matmul

Fix code style

650211e

Add out_dtype to generic batch_matmul

ed8f2c5

jwfromm reviewed Jul 5, 2021

View reviewed changes

Restore fixe in batch_matmul for dynamic shapes

085a6f5

masahi reviewed Jul 8, 2021

View reviewed changes

elvin-n added 3 commits July 8, 2021 21:03

Fix documentation for qnn.batch_matmul

6243dc0

Remove debug code

6ca14e4

Modify zero point for qnn batch_matmul test

0cc475f

masahi approved these changes Jul 13, 2021

View reviewed changes

masahi merged commit 807373c into apache:main Jul 13, 2021

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add qnn batch_matmul operator #8401

Add qnn batch_matmul operator #8401

elvin-n commented Jul 4, 2021

jwfromm left a comment

elvin-n commented Jul 8, 2021

masahi Jul 8, 2021

elvin-n Jul 8, 2021

masahi Jul 8, 2021

elvin-n Jul 8, 2021

masahi Jul 9, 2021

elvin-n Jul 12, 2021

masahi commented Jul 8, 2021

masahi commented Jul 13, 2021

Add qnn batch_matmul operator #8401

Add qnn batch_matmul operator #8401

Conversation

elvin-n commented Jul 4, 2021

jwfromm left a comment

Choose a reason for hiding this comment

elvin-n commented Jul 8, 2021

masahi Jul 8, 2021

Choose a reason for hiding this comment

elvin-n Jul 8, 2021

Choose a reason for hiding this comment

masahi Jul 8, 2021

Choose a reason for hiding this comment

elvin-n Jul 8, 2021

Choose a reason for hiding this comment

masahi Jul 9, 2021

Choose a reason for hiding this comment

elvin-n Jul 12, 2021

Choose a reason for hiding this comment

masahi commented Jul 8, 2021

masahi commented Jul 13, 2021