[Profiling] Update elementwise op #6229

chengduoZH · 2017-12-04T15:04:37Z

"We found the performance of Eigen::Tensor::broadcast is not good if the Tensor is not just a scalar, because the Eigen::Tensor::broadcast try to implement a very general broadcast method. However, the most of broadcast operators in the neural network is rowwise or colwise. The implementation of rowwise or colwise is much simpler and faster than a general implementation.
The Eigen::Tensor::broadcast is mainly implmeneted by this function." —— @reyoung

reyoung

Excellent! There are some tiny enhancements and one typo.

reyoung · 2017-12-05T02:42:07Z

paddle/operators/elementwise_op_function.h

+      : ptr_(ptr), i_(0), j_(0), n_(n), post_(post) {}
+
+  MidWiseTransformIterator<T, platform::CPUPlace>& operator++() {
+    i_ = ++j_ / post_ % n_;


Please add parentheses here.

reyoung · 2017-12-05T02:42:53Z

paddle/operators/elementwise_op_function.h

+
+  bool operator!=(
+      const MidWiseTransformIterator<T, platform::CPUPlace>& rhs) const {
+    return (ptr_ + i_) &= &(*rhs);


&= -> !=

reyoung · 2017-12-05T02:43:29Z

paddle/operators/elementwise_op_function.h

+  typedef thrust::iterator_adaptor<
+      RowwiseTransformIterator<T, platform::GPUPlace>, const T*>
+      super_t;
+  __host__ __device__ RowwiseTransformIterator(const T* x, int n)


Please use HOSTDEVICE macro

reyoung · 2017-12-05T02:44:04Z

paddle/operators/elementwise_op_function.h

+ private:
+  unsigned int n_;
+  const T* begin_;
+  __host__ __device__ typename super_t::reference dereference() const {


Please use HOSTDEVICE macro

reyoung · 2017-12-05T02:45:34Z

paddle/operators/elementwise_add_op.h

 template <typename Place, typename T>
 class ElementwiseAddKernel : public framework::OpKernel<T> {
 public:
  void Compute(const framework::ExecutionContext& ctx) const override {
-    ElementwiseCompute<EigenAddFunctor, Place, T>(ctx);
+    using Tensor = framework::Tensor;


Maybe we should implement all elemwise operators based on this method.

Not in hurry, but need an issue to record the following works.

We can do these in next PR.

QiJune · 2017-12-05T02:54:39Z

paddle/operators/elementwise_op_function.h

+template <typename Functor, typename T, typename Place>
+struct TransformFunctor {
+  TransformFunctor(const framework::Tensor* x, const framework::Tensor* y,
+                   framework::Tensor* z, const framework::ExecutionContext& ctx,


Better to use DeviceContext other than ExecutionContext

qingqing01 · 2017-12-05T03:47:57Z

paddle/operators/elementwise_op_function.h

+  T* z_;
+  int64_t nx_;
+  const platform::DeviceContext& ctx_;
+  Functor func_;


The naming rules of data members for struct is different from class.

https://google.github.io/styleguide/cppguide.html#Variable_Names

Done, I have replaced struct with class.

reyoung

Excellent

code refine

fbbfe8b

chengduoZH changed the title ~~[Profiling] Updata elementwise op~~ [Profiling] Update elementwise op Dec 4, 2017

chengduoZH force-pushed the profiling/updata_elementwise_op branch from 2a5fb74 to ff4c825 Compare December 4, 2017 15:05

refine cuda

488908e

chengduoZH force-pushed the profiling/updata_elementwise_op branch 3 times, most recently from 18a50a2 to cc210de Compare December 5, 2017 02:03

code refine

54f0962

chengduoZH force-pushed the profiling/updata_elementwise_op branch from cc210de to 54f0962 Compare December 5, 2017 02:15

chengduoZH requested review from reyoung, qingqing01 and typhoonzero December 5, 2017 02:19

reyoung reviewed Dec 5, 2017

View reviewed changes

QiJune reviewed Dec 5, 2017

View reviewed changes

chengduoZH force-pushed the profiling/updata_elementwise_op branch 3 times, most recently from 830dbaa to 1517889 Compare December 5, 2017 03:47

follow comments

9e244a8

chengduoZH force-pushed the profiling/updata_elementwise_op branch from 1517889 to 9e244a8 Compare December 5, 2017 03:47

qingqing01 reviewed Dec 5, 2017

View reviewed changes

follow comments

37671ac

reyoung approved these changes Dec 5, 2017

View reviewed changes

chengduoZH merged commit 3644446 into PaddlePaddle:develop Dec 5, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Profiling] Update elementwise op #6229

[Profiling] Update elementwise op #6229

chengduoZH commented Dec 4, 2017 •

edited

Loading

reyoung left a comment

reyoung Dec 5, 2017

chengduoZH Dec 5, 2017

reyoung Dec 5, 2017

chengduoZH Dec 5, 2017

reyoung Dec 5, 2017

chengduoZH Dec 5, 2017

reyoung Dec 5, 2017

chengduoZH Dec 5, 2017

reyoung Dec 5, 2017

chengduoZH Dec 5, 2017

QiJune Dec 5, 2017

chengduoZH Dec 5, 2017

qingqing01 Dec 5, 2017

chengduoZH Dec 5, 2017

reyoung left a comment

[Profiling] Update elementwise op #6229

[Profiling] Update elementwise op #6229

Conversation

chengduoZH commented Dec 4, 2017 • edited Loading

reyoung left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

reyoung left a comment

Choose a reason for hiding this comment

chengduoZH commented Dec 4, 2017 •

edited

Loading