[AutoParallel] convert distensor for eager custom op #59137

wanghuancoder · 2023-11-20T02:13:04Z

PR types

Others

PR changes

Others

Description

自定义算子，如果有1个Tensor为DistTensor，则全部转换为DistTensor
Pcard-73145

…b.com/wanghuancoder/Paddle into convert_disttensor_for_eager_custom_op

paddle-bot · 2023-11-20T02:13:09Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

…b.com/wanghuancoder/Paddle into convert_disttensor_for_eager_custom_op

jiahy0825 · 2023-11-20T06:52:17Z

paddle/utils/pybind.cc

@@ -44,7 +44,7 @@ void ShareTensor(PyObject* src, PyObject* dst) {
  }
 }

-paddle::Tensor CastPyArg2Tensor(PyObject* obj, Py_ssize_t arg_pos) {
+paddle::Tensor& CastPyArg2Tensor(PyObject* obj, Py_ssize_t arg_pos) {


这里为什么要改成引用

因为需要对PyObject*里的Tensor做原位修改，如果返回复制对象，则不能做原位修改。

jiahy0825 · 2023-11-20T06:59:09Z

paddle/fluid/pybind/eager_functions.cc

-      paddle::Tensor tensor =
-          std::move(CastPyArg2Tensor(obj, i + 1));  // NOLINT
-      ctx.EmplaceBackInput(std::move(tensor));
+      paddle::Tensor& tensor = CastPyArg2Tensor(obj, i + 1);  // NOLINT


Suggested change

paddle::Tensor& tensor = CastPyArg2Tensor(obj, i + 1); // NOLINT

const paddle::Tensor& tensor = CastPyArg2Tensor(obj, i + 1); // NOLINT

jiahy0825 · 2023-11-20T07:08:48Z

paddle/fluid/pybind/eager_functions.cc

+  const phi::distributed::ProcessMesh* mesh = nullptr;
+  if (InputsContainDistTensor(&mesh, *(ctx.AllMutableInput()))) {
+    ctx.AllMutableInput()->clear();
+    for (size_t i = 0; i < inputs.size(); ++i) {
+      const auto& input = inputs.at(i);
+      // Parse op_type first, so that use i + 1
+      PyObject* obj = PyTuple_GET_ITEM(args, i + 1);
+      // Emplace Py_None from python, this means optional inputs passed to C++,
+      // use one un-initialized tensor to indicate both Tensor and
+      // vector<Tensor> inputs.
+      if (obj == Py_None) {
+        VLOG(7) << "Custom operator add input " << input
+                << " to CustomOpKernelContext. Add un-initialized tensor "
+                   "because the optional input is None";
+        ctx.EmplaceBackInput(std::move(paddle::Tensor()));
+        continue;
+      }
+      if (paddle::framework::detail::IsDuplicableVar(input)) {
+        std::vector<paddle::Tensor> tensors =
+            std::move(CastPyArg2VectorOfTensor(obj, i + 1, mesh));  // NOLINT
+        ctx.EmplaceBackInputs(std::move(tensors));
+        VLOG(7) << "Custom operator add input " << input
+                << " to CustomOpKernelContext. Add vector<Tensor> size = "
+                << ctx.InputRangeAt(i).second - ctx.InputRangeAt(i).first;
+      } else {
+        paddle::Tensor& tensor = CastPyArg2Tensor(obj, i + 1);  // NOLINT
+        ConvertAllInputsToDistTensor(mesh, tensor);


这里的代码和前面对 input 的处理，重复度很高，可以合在一起，或者写成一个函数吗？
这里也用了很多 Tensor&，建议改成 const Tensor&

我想了一下，没办法合并或者写成函数。
不能合并的原因是：可能传入的参数中，中间的一个Tensor是DistTensor，我需要重新全部扫描一遍，如果Tensor不是DistTensor则转为DistTensor，如果是DistTensor，则要求他们的mesh相等。必须重新来一遍。
不能写成函数的原因是：两个代码的结构虽然一样，但到具体执行的时候有差别。不适合1个函数，强行写成1个函数只能是可读性更差。

… convert_disttensor_for_eager_backward

…b.com/wanghuancoder/Paddle into convert_disttensor_for_eager_custom_op

… convert_disttensor_for_eager_custom_op

jiahy0825

LGTM

) * convert distensor for eager custom op

wanghuancoder added 4 commits November 20, 2023 01:57

convert disttensor for eager backward

2626cf0

convert distensor for eager custom op

c85162c

refine

84386a6

Merge branch 'convert_disttensor_for_eager_backward' of https://githu…

ac3e768

…b.com/wanghuancoder/Paddle into convert_disttensor_for_eager_custom_op

wanghuancoder added 6 commits November 20, 2023 02:41

refine

4cee83d

refine

0fc5c60

refine

430c4c8

Merge branch 'convert_disttensor_for_eager_backward' of https://githu…

9afca51

…b.com/wanghuancoder/Paddle into convert_disttensor_for_eager_custom_op

refine

fbf4633

refine

b9e3540

jiahy0825 reviewed Nov 20, 2023

View reviewed changes

wanghuancoder added 8 commits November 20, 2023 07:09

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

5e626e2

… convert_disttensor_for_eager_backward

refine

3aab638

Merge branch 'convert_disttensor_for_eager_backward' of https://githu…

54e0871

…b.com/wanghuancoder/Paddle into convert_disttensor_for_eager_custom_op

refine

e7afae9

refine

3a42ed0

Merge branch 'convert_disttensor_for_eager_backward' of https://githu…

8d725b7

…b.com/wanghuancoder/Paddle into convert_disttensor_for_eager_custom_op

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

5fc0b83

… convert_disttensor_for_eager_custom_op

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

4471c2e

… convert_disttensor_for_eager_custom_op

jiahy0825 approved these changes Nov 22, 2023

View reviewed changes

wanghuancoder merged commit ce9d2b5 into PaddlePaddle:develop Nov 22, 2023
28 checks passed

SecretXV pushed a commit to SecretXV/Paddle that referenced this pull request Nov 28, 2023

[AutoParallel] convert distensor for eager custom op (PaddlePaddle#59137

a79421c

) * convert distensor for eager custom op

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AutoParallel] convert distensor for eager custom op #59137

[AutoParallel] convert distensor for eager custom op #59137

wanghuancoder commented Nov 20, 2023

paddle-bot bot commented Nov 20, 2023

jiahy0825 Nov 20, 2023

wanghuancoder Nov 20, 2023

jiahy0825 Nov 20, 2023

wanghuancoder Nov 20, 2023

jiahy0825 Nov 20, 2023

wanghuancoder Nov 20, 2023

jiahy0825 left a comment

	paddle::Tensor& tensor = CastPyArg2Tensor(obj, i + 1); // NOLINT
	const paddle::Tensor& tensor = CastPyArg2Tensor(obj, i + 1); // NOLINT

[AutoParallel] convert distensor for eager custom op #59137

[AutoParallel] convert distensor for eager custom op #59137

Conversation

wanghuancoder commented Nov 20, 2023

PR types

PR changes

Description

paddle-bot bot commented Nov 20, 2023

jiahy0825 Nov 20, 2023

Choose a reason for hiding this comment

wanghuancoder Nov 20, 2023

Choose a reason for hiding this comment

jiahy0825 Nov 20, 2023

Choose a reason for hiding this comment

wanghuancoder Nov 20, 2023

Choose a reason for hiding this comment

jiahy0825 Nov 20, 2023

Choose a reason for hiding this comment

wanghuancoder Nov 20, 2023

Choose a reason for hiding this comment

jiahy0825 left a comment

Choose a reason for hiding this comment