Support logical functors and refine tensor packing function #33089

JamesLim-sy · 2021-05-24T15:18:37Z

PR types

Performance optimization

PR changes

OPs

Describe

Feature :

Refine the warpper function of packing input and output tensors into respective vector
Support one unary op and three logical binary ops below:
not
and
Or
Xor

Performance optimization:

The performance variation is below:

Conclusion :

As can be seen in the table, the time cost of most of test cases reflect the great improvment in logical ops after optimization of Elementwise and Broadcast op.

paddle-bot-old · 2021-05-24T15:18:41Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

…time

…functors

Xreki · 2021-06-03T04:14:12Z

paddle/fluid/operators/elementwise/elementwise_op_function.h

 */
 template <typename OutT>
 int PackTensorsIntoVector(const framework::ExecutionContext &ctx,
                          std::vector<const framework::Tensor *> *ins,
-                          std::vector<framework::Tensor *> *outs) {
+                          std::vector<framework::Tensor *> *outs,
+                          framework::Tensor *x_ptr = nullptr) {


参数叫x_ptr会引起疑惑。比如叫x_for_selectedrows这样的，比较直观。函数解释里面说明一下，输入x可以是LoDTensor或SelectedRows。当x是SelectedRows，需要先转换成LoDTensor参与计算，此时需要在op kernel中额外定义一个临时的tensor，并传入x_for_selectedrows参数。

elementwise_mul_op.h里面，ElementwiseMulKernel也改下。

好的，下个commit 修改完成

Xreki · 2021-06-03T04:16:01Z

paddle/fluid/operators/elementwise/elementwise_op_function.h

  auto *y = ctx.Input<framework::LoDTensor>("Y");
-  auto *z = ctx.Output<framework::LoDTensor>("Out");
+
+  if (x_ptr == nullptr || x_var->IsType<framework::LoDTensor>()) {


这里条件应该是if (x_var->IsType<framework::LoDTensor>)就够了，就算传入了x_ptr也是没用的。

确实显得画蛇添足了，下个commit修改掉

Xreki · 2021-06-03T04:20:40Z

paddle/fluid/operators/elementwise/elementwise_op_function.h

+    z = ctx.Output<framework::LoDTensor>("Out");
+    ins->emplace_back(x);
+    x_dims_size = x->dims().size();
+


觉得这里加空行不太好看。。。

下个commit 删除

Xreki · 2021-06-03T04:22:14Z

paddle/fluid/operators/elementwise/elementwise_op_function.h

+                      platform::errors::InvalidArgument(
+                          "For elementwise_op, if X is Sparse, Y must be "
+                          "scalar. But reveived the size of Y = %d.",
+                          y->dims().size()));


这里要PADDLE_ENFORCE检查下x_ptr不为null

按照建议修改

Xreki · 2021-06-03T04:22:45Z

paddle/fluid/operators/elementwise/elementwise_op_function.h

+    z = ctx.Output<framework::SelectedRows>("Out")->mutable_value();
+    ins->emplace_back(x_ptr);
+    x_dims_size = x_ptr->dims().size();
+


同上，这里不要加空行。

下个commit 删除

Xreki · 2021-06-03T04:26:40Z

paddle/fluid/operators/elementwise/elementwise_op_function.h


  if (y != nullptr) {
    ins->emplace_back(y);
    axis = ctx.HasAttr("axis") ? ctx.Attr<int>("axis") : -1;
-    axis = axis == -1 ? std::abs(y->dims().size() - x->dims().size()) : axis;
+    axis = axis == -1 ? std::abs(y->dims().size() - x_dims_size) : axis;


其实我原来的review建议，只是把axis==-1时的这个换算放到LaunchElementwiseCudaKernel里面，就这一行代码。如果axis==-1只适用于二元、broadcast的情况，那就放到LaunchBroadcastElementwiseCudaKernel里面，感觉代码能简化一些？

我原来一直会错意了... 之前以为是将axis的整体计算全部搬到LaunchBroadcastElementwiseCudaKernel 里面。搬到LaunchBroadcastElementwiseCudaKernel 里面确实更合适一些，代码的通用性也更强。按照comment的思路修改的话，可能最终的代码成果是下面这样的，或者其他类似形式。

void LaunchBroadcastElementwiseCudaKernel(cuda_ctx, ins, outs , axis, func) { std::vector<int> dims_size; bool no_broadcast_flag = true; for (auto *in : ins) { no_broadcast_flag = ins[0]->dims() == in->dims(); dims_size.emplace_back(in->dims().size()); } if (no_broadcast_flag) { LaunchSameDimsElementwiseCudaKernel<ET, InT, OutT>( cuda_ctx, ins, outs, func); } else { axis = axis == -1 ? *std::max_element(dims_size.begin(), dims_size.end()) - *std::min_element(dims_size.begin(), dims_size.end()) : axis; LaunchBroadcastElementwiseCudaKernel<ET, InT, OutT>(cuda_ctx, ins, outs, axis, func); } }

Xreki

LGTM

JamesLim-sy added 2 commits May 24, 2021 15:14

first commit

6e10771

fix bugs

412df1d

JamesLim-sy and others added 15 commits May 25, 2021 10:09

Fisrt commit

9d46543

Fix bugs.

49cc2e6

Fix bugs

9b048ae

Trigger of rerun

74e4179

To avoid spartial specification bugs which happened in PR-CI-ROCM

656ac99

To avoid spartial specification bugs which happened in PR-CI-ROCM

315a06b

Avoid kUnary instantiation of LaunchElementwiseCudaKernel at compile …

585566f

…time

refine the logical elementwise warpper

a4a6960

refine warpper of broadcast and add cuda op

d9c70ec

Merge branch 'Fix_bugs_in_elementwise_warpprer' into Support_logical_…

a6b79f0

…functors

Fix bugs

5a378d3

Fix conflicts

caada56

Merge branch 'develop' into Support_logical_functors

279f0b8

Fix bugs 06-03

da64ddc

refine warpper of tensor package functions

e686fa3

Xreki reviewed Jun 3, 2021

View reviewed changes

refine warpper

e812bf1

Xreki approved these changes Jun 3, 2021

View reviewed changes

Xreki merged commit 941308c into PaddlePaddle:develop Jun 4, 2021

JamesLim-sy changed the title ~~Support logical functors~~ Support logical functors and refine tensor packing function Jun 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support logical functors and refine tensor packing function #33089

Support logical functors and refine tensor packing function #33089

JamesLim-sy commented May 24, 2021 •

edited

Loading

paddle-bot-old bot commented May 24, 2021

Xreki Jun 3, 2021

JamesLim-sy Jun 3, 2021

Xreki Jun 3, 2021

JamesLim-sy Jun 3, 2021

Xreki Jun 3, 2021

JamesLim-sy Jun 3, 2021

Xreki Jun 3, 2021

JamesLim-sy Jun 3, 2021

Xreki Jun 3, 2021

JamesLim-sy Jun 3, 2021

Xreki Jun 3, 2021

JamesLim-sy Jun 3, 2021 •

edited by Xreki

Loading

Xreki left a comment

Support logical functors and refine tensor packing function #33089

Support logical functors and refine tensor packing function #33089

Conversation

JamesLim-sy commented May 24, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented May 24, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JamesLim-sy Jun 3, 2021 • edited by Xreki Loading

Choose a reason for hiding this comment

Xreki left a comment

Choose a reason for hiding this comment

JamesLim-sy commented May 24, 2021 •

edited

Loading

JamesLim-sy Jun 3, 2021 •

edited by Xreki

Loading