[OneDNN] Fc elementwise add fusion #58276

zhanglirong1999 · 2023-10-20T08:10:11Z

PR types

New features

PR changes

Others

Description

Paddle does not support in-place computation now, implement fc_elementise_add using fc + binary_add. It is also a re-implementation of the previous PR #55504 directly delete the pass to solve the accuracy problem.

paddle-bot · 2023-10-20T08:10:20Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-ci-bot · 2023-11-02T03:13:18Z

Sorry to inform you that 436e6df's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

paddle/fluid/inference/api/paddle_pass_builder.cc

xinyu-intel · 2023-11-02T08:26:24Z

paddle/fluid/operators/mkldnn/fc_mkldnn_op.cc

+      auto residual_data_md = dnnl::memory::desc(
+          {MB, OC}, dnnl::memory::data_type::f32, dnnl::memory::format_tag::ab);


recommend to get_mdesc from the residual input tensor instead of assumption here.

For inner product primitive, the dst is always NC(mentioned in doc). That's why we can not use residual mdesc directly, have to make sure residual md shape is NC.

when the residual won't be NC? BTW, the data type can also be bf16, right?

paddle/fluid/operators/mkldnn/fc_mkldnn_op.cc

xinyu-intel · 2023-11-02T08:35:27Z

paddle/fluid/operators/mkldnn/fc_mkldnn_op.cc

@@ -506,6 +623,10 @@ class FCMKLDNNKernel : public framework::OpKernel<T_in> {
      ip_cache->src_mem = *src_memory_p;
      ip_cache->weights_mem = *weights_memory_p;
      ip_cache->dst_mem = *dst_memory_p;
+      if (residual_data && residual_data_memory_p) {
+        ip_cache->residual_data = *residual_data;


why need cache residual_data?

xinyu-intel · 2023-11-02T08:37:41Z

test/mkldnn/test_fc_add_int8_mkldnn_op.py

+            # 'Scale_in_eltwise': self.residual_scale,
+            # 'fuse_residual_connection': True


is it expected?

xinyu-intel · 2023-11-02T08:48:17Z

@LLee233 Please help on a review, thanks:)

YangQun1 · 2023-11-02T09:09:30Z

paddle/fluid/operators/mkldnn/fc_mkldnn_op.cc

+          out->dims(),
+          residual_param->dims(),
+          phi::errors::InvalidArgument(
+              "Output and elementwise parameter need to have the "


after using post_binary_add, do we still need to force residual data and dst to have same dims? I just think binary-add should support broadcast.

paddle/fluid/operators/mkldnn/fc_mkldnn_op.cc

LLee233

By the way, just want to ask if we should still keep "fc_eltwise_add" since now it becomes "binary_add" (have extra input).

LLee233 · 2023-11-05T09:37:26Z

paddle/fluid/operators/mkldnn/fc_mkldnn_op.cc

+      // For Inner Product primitives, the destination always N * C
+      auto residual_data = ctx.Input<phi::DenseTensor>("ResidualData");
+      auto residual_data_md =
+          dnnl::memory::desc({MB, OC},


Does residual_data_md should have determined shape? Since binary_add has enabled broadcast, maybe {1, 1} or {MB, 1} is also good?

zhanglirong1999 · 2023-11-09T02:39:36Z

@XieYunshen , hi, could you please help me approve the setting TIMEOUT properties?

zhanglirong1999 · 2023-11-09T02:41:18Z

@XiaoguangHu01 , hi, could you help me approve CI 'the usage of const_cast'? thanks

paddle-ci-bot · 2023-11-15T03:14:56Z

Sorry to inform you that f70bfbc's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

paddle-bot bot added the contributor External developers label Oct 20, 2023

zhanglirong1999 changed the title ~~Fc elementwise add fusion~~ [OneDNN] Fc elementwise add fusion Oct 20, 2023

xinyu-intel self-requested a review October 20, 2023 08:16

xinyu-intel added the Intel label Oct 20, 2023

zhanglirong1999 force-pushed the fc_elementwise_add_fusion branch 5 times, most recently from 1a16786 to a6016d5 Compare October 23, 2023 02:19

onecatcn assigned vivienfanghuagood Oct 23, 2023

zhanglirong1999 force-pushed the fc_elementwise_add_fusion branch from 88d8b8b to 436e6df Compare October 25, 2023 08:08

xinyu-intel reviewed Nov 2, 2023

View reviewed changes

xinyu-intel requested a review from YangQun1 November 2, 2023 08:38

YangQun1 reviewed Nov 2, 2023

View reviewed changes

zhanglirong1999 force-pushed the fc_elementwise_add_fusion branch 2 times, most recently from 8950d76 to 300015a Compare November 3, 2023 08:31

LLee233 reviewed Nov 5, 2023

View reviewed changes

zhanglirong1999 force-pushed the fc_elementwise_add_fusion branch 3 times, most recently from c1da50a to f70bfbc Compare November 7, 2023 06:31

zhanglirong1999 added 7 commits November 7, 2023 14:42

fc_elementwise_add pass implement with binary_add

41cce25

ignore int8, uint8, and add some update

20ef70e

update

9d15eb5

update

20ea7f2

update to run jenkins

4bfe45e

update

e8aa01d

fix accuray error

bb85645

zhanglirong1999 added 27 commits November 7, 2023 14:42

fix

3b8da82

add log

c66ff58

add T_out

9be8bab

use reorder

3ebf1a7

explictly use int8 for residual data

715bf19

push newest version

288740a

push newest version

241b1ba

finish version fc+binary_add

5c1dcf2

Finish fc_binary_add fusion in fp32 and int8 and add ut to test

91f0cd1

fix format

4a9934b

remove useless comment

ae420c0

refresh

b562357

skip residual for CI pass

a863779

modify code style

1f42297

add bf16 support

af33ded

reformat python style

0b654f8

reformat python style

51cf456

fix bug and skip fc_add pass in bf16

eb5cd45

unuse checkout_output = false

c6a71a7

fix some bug from comment suggestion

b9d881a

remove residual cache

e436f66

remove useless

b61d7c7

use broadcast in binary_mul

de890b9

reuse binary_mul

6ef8f8b

fix bug

7298b81

fix bug

cb515f4

delete print

f70bfbc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OneDNN] Fc elementwise add fusion #58276

[OneDNN] Fc elementwise add fusion #58276

zhanglirong1999 commented Oct 20, 2023

paddle-bot bot commented Oct 20, 2023

paddle-ci-bot bot commented Nov 2, 2023

xinyu-intel Nov 2, 2023

zhanglirong1999 Nov 2, 2023

xinyu-intel Nov 2, 2023

xinyu-intel Nov 2, 2023

xinyu-intel Nov 2, 2023

xinyu-intel commented Nov 2, 2023

YangQun1 Nov 2, 2023

LLee233 left a comment

LLee233 Nov 5, 2023

zhanglirong1999 commented Nov 9, 2023

zhanglirong1999 commented Nov 9, 2023

paddle-ci-bot bot commented Nov 15, 2023

		auto residual_data_md = dnnl::memory::desc(
		{MB, OC}, dnnl::memory::data_type::f32, dnnl::memory::format_tag::ab);

		# 'Scale_in_eltwise': self.residual_scale,
		# 'fuse_residual_connection': True

[OneDNN] Fc elementwise add fusion #58276

Are you sure you want to change the base?

[OneDNN] Fc elementwise add fusion #58276

Conversation

zhanglirong1999 commented Oct 20, 2023

PR types

PR changes

Description

paddle-bot bot commented Oct 20, 2023

paddle-ci-bot bot commented Nov 2, 2023

xinyu-intel Nov 2, 2023

Choose a reason for hiding this comment

zhanglirong1999 Nov 2, 2023

Choose a reason for hiding this comment

xinyu-intel Nov 2, 2023

Choose a reason for hiding this comment

xinyu-intel Nov 2, 2023

Choose a reason for hiding this comment

xinyu-intel Nov 2, 2023

Choose a reason for hiding this comment

xinyu-intel commented Nov 2, 2023

YangQun1 Nov 2, 2023

Choose a reason for hiding this comment

LLee233 left a comment

Choose a reason for hiding this comment

LLee233 Nov 5, 2023

Choose a reason for hiding this comment

zhanglirong1999 commented Nov 9, 2023

zhanglirong1999 commented Nov 9, 2023

paddle-ci-bot bot commented Nov 15, 2023