[AutoParallel] Support operators have mixed inputs. #57774

GhostScreaming · 2023-09-26T10:32:21Z

PR types

Others

PR changes

Others

Description

Pcard-73145

Support operators have mixed inputs like DenseTensor and DistTensor. DenseTensor will be sharded to replicate DistTensor automatically.

Former implementation is deployed in phi APIs: PR 57684. However, inputs are all const reference, which means converting input DenseTensor to DistTensor is an unsafe operation. Meanwhile, there is modification of inputs in AutoGrad level like convert them to continuous Tensors. As a result, modification of phi input like x_tmp in AutoGrad level will not change the real input x, but backward node relies on the real input x information as Tensor metas. It's more properly to implement this sharding strategy in Python-C level.

…nd DistTensor.

paddle-bot · 2023-09-26T10:32:25Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

chenwhql · 2023-09-26T11:41:17Z

test/auto_parallel/test_api_dist_branch.py

+            local_t_2 = paddle.to_tensor(np_array, dtype='float16')
+        elif np_array.dtype == np.int32:
+            local_t_1 = paddle.to_tensor(np_array, dtype='int32')
+            # local_t_2 = paddle.to_tensor(np_array, dtype='float16')


这里要新增一个int32的local值吗

注释忘记删除了，已经恢复

chenwhql · 2023-09-26T11:43:41Z

test/auto_parallel/test_api_dist_branch.py

+        x = np.random.random(size=[4, 4]).astype("float32")
+        y = np.random.random(size=[4, 4]).astype("float32")
+        local_x, dist_x = self.create_local_and_dist_tensor_pair(x)
+        local_y, dist_y = self.create_two_local_tensor_pair(y)


local_y, dist_y -> local_y1, local_y2？

chenwhql · 2023-09-26T11:45:40Z

paddle/fluid/eager/auto_code_generator/generator/python_c_gen.py

@@ -72,6 +72,13 @@ def FindParsingFunctionFromAttributeType(atype):
    "    auto {} = {}(\"{}\", \"{}\", args, {}, {});\n"
 )

+CONVERT_INPUT_TENSORS_TO_DIST_TENSOR_TEMPLATE = """
+    const phi::distributed::ProcessMesh* mesh = nullptr;
+    auto mesh_ptr = &mesh;


这里是不是直接传入&mesh参数即可，不用新增一行赋值

chenwhql · 2023-09-26T11:46:25Z

paddle/fluid/eager/auto_code_generator/generator/python_c_gen.py

@@ -325,7 +332,9 @@ def GeneratePythonCFunction(self):
        inplace_returns_pos_map = {}
        # Generate Python-C Tensors Parsing Logic
        get_eager_tensor_str = ""
+        input_names = "mesh_ptr, "


这个可以放到上面模板里吗？这样看模板能知道mesh传进去了，inputs则只包含api的输入，语义更准确一些

chenwhql

LGTM

LiYuRio

LGTM

* [AutoParallel] Support operators have mixed inputs like DenseTensor and DistTensor. * Polish code with review comments.

[AutoParallel] Support operators have mixed inputs like DenseTensor a…

013ff44

…nd DistTensor.

chenwhql reviewed Sep 26, 2023

View reviewed changes

Polish code with review comments.

90ec330

chenwhql approved these changes Sep 27, 2023

View reviewed changes

LiYuRio approved these changes Sep 27, 2023

View reviewed changes

GhostScreaming merged commit b555415 into PaddlePaddle:develop Sep 27, 2023
27 checks passed

chenwhql mentioned this pull request Oct 10, 2023

[AutoParallel] Simplify semi auto parallel simple net test #57998

Merged

Frida-a pushed a commit to Frida-a/Paddle that referenced this pull request Oct 14, 2023

[AutoParallel] Support operators have mixed inputs. (PaddlePaddle#57774)

19c7af2

* [AutoParallel] Support operators have mixed inputs like DenseTensor and DistTensor. * Polish code with review comments.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AutoParallel] Support operators have mixed inputs. #57774

[AutoParallel] Support operators have mixed inputs. #57774

GhostScreaming commented Sep 26, 2023 •

edited

Loading

paddle-bot bot commented Sep 26, 2023

chenwhql Sep 26, 2023

GhostScreaming Sep 26, 2023

chenwhql Sep 26, 2023

GhostScreaming Sep 26, 2023

chenwhql Sep 26, 2023

GhostScreaming Sep 26, 2023

chenwhql Sep 26, 2023

GhostScreaming Sep 26, 2023

chenwhql left a comment

LiYuRio left a comment

[AutoParallel] Support operators have mixed inputs. #57774

[AutoParallel] Support operators have mixed inputs. #57774

Conversation

GhostScreaming commented Sep 26, 2023 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Sep 26, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chenwhql left a comment

Choose a reason for hiding this comment

LiYuRio left a comment

Choose a reason for hiding this comment

GhostScreaming commented Sep 26, 2023 •

edited

Loading