Fused RNN Operators have nonsupport of `add` grad_req with mkl-dnn #16578

xziya · 2019-10-22T02:22:56Z

Currently, we have not integrated the add grad_req routine in FusedRNNCell with mkl-dnn fusion. It will be highly appreciated if anyone could tell us the application scenario using add during the training process.

FYI, #568 and #725. When weights are updated every batch, add and write will produce the same results.

The text was updated successfully, but these errors were encountered:

ddavydenko · 2019-10-28T18:41:20Z

@mxnet-label-bot add [MKLDNN]

ddavydenko · 2019-10-28T18:42:22Z

@TaoLv , I suggest you got this issue verified once you have [Upgrade MKL-DNN dependency to v1.0 #16555] merged.

xziya · 2019-10-31T09:58:35Z

@TaoLv , I suggest you got this issue verified once you have [Upgrade MKL-DNN dependency to v1.0 #16555] merged.

Thanks for your suggestion. MKL-DNN RNN operators do have this issue, and it will terminate the program when meets add. I have not met some cases that use add for RNN training. It will be highly appreciated if you can provide some information.
Currently, we need to deliver the gradients from mkl-dnn space to MXNet native space. It requires further design to guarantee performance and accuracy. We prefer to have #16555 merged, and then accomplish the add operation.

pengzhao-intel · 2019-11-01T01:28:21Z

@zixuanweeei does the issue resolve by MKLDNN upgrade PR?

xziya · 2019-11-01T01:36:37Z

@zixuanweeei does the issue resolve by MKLDNN upgrade PR?

This issue exists in the upgrade PR. It will give an error message when uses add.

pengzhao-intel · 2019-11-01T01:39:26Z

OK, is there a workaround for the user and what's the plan for the next step?

xziya · 2019-11-01T01:51:19Z

OK, is there a workaround for the user and what's the plan for the next step?

It will be appreciated that users could report their usage with add. But we couldn't force everyone to do this. So I will put it on the list. I plan to fix it in one week.

TaoLv · 2020-01-02T05:30:42Z

Should be fixed via #17075. Feel free to re-open if the problem is still there. @zixuanweeei

xziya mentioned this issue Oct 22, 2019

Upgrade MKL-DNN dependency to v1.0 #16555

Merged

7 tasks

lanking520 added the MKLDNN label Oct 28, 2019

pengzhao-intel added the Feature request label Nov 20, 2019

xziya mentioned this issue Dec 15, 2019

[MKLDNN] mkldnn RNN operator enhancement #17075

Merged

4 tasks

TaoLv closed this as completed Jan 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fused RNN Operators have nonsupport of `add` grad_req with mkl-dnn #16578

Fused RNN Operators have nonsupport of `add` grad_req with mkl-dnn #16578

xziya commented Oct 22, 2019

ddavydenko commented Oct 28, 2019

ddavydenko commented Oct 28, 2019

xziya commented Oct 31, 2019

pengzhao-intel commented Nov 1, 2019

xziya commented Nov 1, 2019

pengzhao-intel commented Nov 1, 2019

xziya commented Nov 1, 2019

TaoLv commented Jan 2, 2020

Fused RNN Operators have nonsupport of add grad_req with mkl-dnn #16578

Fused RNN Operators have nonsupport of add grad_req with mkl-dnn #16578

Comments

xziya commented Oct 22, 2019

ddavydenko commented Oct 28, 2019

ddavydenko commented Oct 28, 2019

xziya commented Oct 31, 2019

pengzhao-intel commented Nov 1, 2019

xziya commented Nov 1, 2019

pengzhao-intel commented Nov 1, 2019

xziya commented Nov 1, 2019

TaoLv commented Jan 2, 2020

Fused RNN Operators have nonsupport of `add` grad_req with mkl-dnn #16578

Fused RNN Operators have nonsupport of `add` grad_req with mkl-dnn #16578