[pten] add concat pten kernel #38955

MingMingShangTian · 2022-01-14T09:27:43Z

PR types

Others

PR changes

Others

Describe

add concat pten kernel

paddle-bot-old · 2022-01-14T09:27:47Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

…cnn_test test

chenwhql · 2022-01-18T12:14:06Z

paddle/fluid/operators/math/concat_and_split.cc

@@ -13,6 +13,8 @@ See the License for the specific language governing permissions and
 limitations under the License. */

 #include "paddle/fluid/operators/math/concat_and_split.h"
+
+#include "paddle/pten/kernels/cpu/concat_and_split.h"


建议直接把concat_and_split.cc迁移过来，马上我们也要把它移过来了，可以下个PR再做

chenwhql · 2022-01-18T12:15:45Z

paddle/pten/infermeta/multiary.cc

+#include "paddle/pten/kernels/funcs/concat_funcs.h"
+namespace pten {
+
+DenseTensorMeta ConcatInferMeta(const std::vector<DenseTensorMeta>& x_meta,


这里后面会改成的返回值作为输入参数指针的形式，和kernel保持一致

chenwhql · 2022-01-18T12:16:20Z

paddle/pten/infermeta/multiary.cc

+
+DenseTensorMeta ConcatInferMeta(const std::vector<DenseTensorMeta>& x_meta,
+                                const Scalar& axis_scalar,
+                                bool is_runtime) {


is_runtime后面会用一个结构体封装起来

chenwhql · 2022-01-18T12:16:52Z

paddle/pten/infermeta/multiary.h

+#include "paddle/pten/core/tensor_meta.h"
+namespace pten {
+
+// TODO(chentianyu03) use std::vector<DenseTensor> as InferMeta inputs


这里后面会新增MetaTensor概念，作为inferMeta的输入

好的，后续再调整

chenwhql · 2022-01-18T12:19:57Z

paddle/pten/kernels/gpu/concat_and_split.h

+#include "paddle/fluid/framework/mixed_vector.h"
+#include "paddle/fluid/memory/malloc.h"
+#include "paddle/fluid/operators/math/concat_and_split.h"
+#include "paddle/fluid/platform/cuda_graph_with_memory_pool.h"


[TODO] 这里需要梳理下platform下还依赖了哪些组件，是需要我们提前迁移的

chenwhql · 2022-01-18T12:21:31Z

paddle/pten/kernels/gpu/concat_and_split.h

+#include <vector>
+#include "gflags/gflags.h"
+#include "paddle/fluid/framework/mixed_vector.h"
+#include "paddle/fluid/memory/malloc.h"


这里依赖malloc有点严重，看下我们有替代写法吗

后续优化

chenwhql · 2022-01-18T12:21:52Z

paddle/pten/kernels/gpu/concat_and_split.h

+    tmp_dev_ins_data = paddle::memory::Alloc(context, in_num * sizeof(T*));
+    auto* restored = paddle::platform::RestoreHostMemIfCapturingCUDAGraph(
+        inputs_data, in_num);
+    paddle::memory::Copy(context.GetPlace(),


能否使用copy_kernel

copy_kernle 是对tensor的操作，这里使用的是指针地址

chenwhql · 2022-01-18T12:22:41Z

paddle/pten/kernels/gpu/concat_kernel.cu

+
+#include "paddle/pten/kernels/concat_kernel.h"
+
+#include "paddle/fluid/framework/lod_tensor.h"


为什么还需要lod_tensor，这里的头文件确认下必要性

这里引用lod_tensor.h，是因为使用了AppendLoD 等辅助函数

chenwhql · 2022-01-18T12:22:53Z

paddle/pten/tests/api/test_concat_api.cc

@@ -0,0 +1,86 @@
+/* Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.


chenwhql · 2022-01-18T12:24:24Z

python/paddle/utils/code_gen/api_gen.py

+      std::vector<pten::DenseTensor> pt_tensors;
+
+      for(auto & t : tensors) {
+          pt_tensors.push_back(*std::dynamic_pointer_cast<pten::DenseTensor>(t.impl()));


这里避免使用dynamic_cast，使用is_dense_tensor判断+static cast

已沟通，先保持现状

YuanRisheng · 2022-01-18T12:48:27Z

paddle/pten/kernels/cpu/concat_kernel.cc

+        auto in_lod = paddle::framework::ConvertToLengthBasedLoD(x[i].lod());
+        paddle::framework::AppendLoD(out_lod, in_lod);


这个kernel文件里引入较多fluid下的代码，建议评估一下迁移难度，如果可以尽量将依赖函数迁移到pten下

XiaoguangHu01

LGTM

add concat pten kernel

54cb700

MingMingShangTian added 14 commits January 17, 2022 03:08

add lod for concat_kernel

48a1f2c

Merge branch 'develop' into concat_kernel_latest

8e7592d

fix conflict with develop branch

94c240d

fix conflict with develop branch

7877f2a

add needed header file for concat_kernel

a0343c1

replace with new pten concat kernel

3a5ff3c

merge develop branch and fix conflict

79b2291

add signatrue and api.yaml for concat kernel

f2432bb

fix ROCM build error

d118664

add concat unit test

fe0e504

fix ROCM build error

55259e9

fix concat windows build error

27a7543

MakePtenDenseTensor support uninitialized tensor to fix trt_cascade_r…

2fee4e3

…cnn_test test

Merge branch 'develop' into concat_kernel_latest

f06374c

chenwhql reviewed Jan 18, 2022

View reviewed changes

YuanRisheng reviewed Jan 18, 2022

View reviewed changes

MingMingShangTian added 2 commits January 19, 2022 07:04

add lod_utils in pten direction

72a79da

fix concat_op xpu build failed

9e88d11

chenwhql previously approved these changes Jan 20, 2022

View reviewed changes

YuanRisheng previously approved these changes Jan 20, 2022

View reviewed changes

MingMingShangTian added 2 commits January 20, 2022 02:52

Merge branch 'develop' into concat_kernel_latest

98e3ef3

fix merge develop conflict

70c7d9b

MingMingShangTian dismissed stale reviews from YuanRisheng and chenwhql via 70c7d9b January 20, 2022 03:35

chenwhql approved these changes Jan 21, 2022

View reviewed changes

XiaoguangHu01 approved these changes Jan 21, 2022

View reviewed changes

Shixiaowei02 merged commit 06803c2 into PaddlePaddle:develop Jan 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pten] add concat pten kernel #38955

[pten] add concat pten kernel #38955

MingMingShangTian commented Jan 14, 2022

paddle-bot-old bot commented Jan 14, 2022

chenwhql Jan 18, 2022

MingMingShangTian Jan 18, 2022

chenwhql Jan 18, 2022

MingMingShangTian Jan 18, 2022

chenwhql Jan 18, 2022

MingMingShangTian Jan 18, 2022

chenwhql Jan 18, 2022

MingMingShangTian Jan 18, 2022

chenwhql Jan 18, 2022 •

edited

Loading

MingMingShangTian Jan 19, 2022

chenwhql Jan 18, 2022

MingMingShangTian Jan 19, 2022

chenwhql Jan 18, 2022

MingMingShangTian Jan 19, 2022

chenwhql Jan 18, 2022

MingMingShangTian Jan 18, 2022 •

edited

Loading

chenwhql Jan 18, 2022

MingMingShangTian Jan 19, 2022

chenwhql Jan 18, 2022

MingMingShangTian Jan 19, 2022

YuanRisheng Jan 18, 2022

MingMingShangTian Jan 19, 2022

XiaoguangHu01 left a comment


		#include "paddle/pten/kernels/concat_kernel.h"

		#include "paddle/fluid/framework/lod_tensor.h"

		@@ -0,0 +1,86 @@
		/* Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.

		auto in_lod = paddle::framework::ConvertToLengthBasedLoD(x[i].lod());
		paddle::framework::AppendLoD(out_lod, in_lod);

[pten] add concat pten kernel #38955

[pten] add concat pten kernel #38955

Conversation

MingMingShangTian commented Jan 14, 2022

PR types

PR changes

Describe

paddle-bot-old bot commented Jan 14, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chenwhql Jan 18, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MingMingShangTian Jan 18, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

chenwhql Jan 18, 2022 •

edited

Loading

MingMingShangTian Jan 18, 2022 •

edited

Loading