New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[oneDNN] Accesses to oneDNN cache optimized for conv2d #33048

Merged

luotao1 merged 10 commits into PaddlePaddle:develop from jczaja:prv-conv-opt-pr

May 27, 2021

Contributor

jczaja commented May 21, 2021

PR types

Function optimization

PR changes

OPs

Describe

In similar spirit as PR #32922 conv2d BWD can recreate FWD PD, so we used functions AcquireForwardPrimitiveNonBlocking. Apart from that Refacgtoring of conv2d grad was made to have optimizations done.

jczaja added 2 commits

May 21, 2021 15:12


          Snapshot of changes to convolution

a087f97

- First draft implemented

- compilable and UT not crashing (still failing)

- Fix to UT
	cache.vim


          - Refactoring of changes

6638ecb

paddle-bot-old bot commented May 21, 2021

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

jczaja added the Intel label

jczaja added 5 commits

May 21, 2021 16:40


          - compilation fix

7f91a5b


          test

6e819e6


          - fix

78f6d41


          Refactoring

0af50d5


          - lint fixes

5c64962

jczaja requested review from arogowie-intel, lidanqing-intel and jakpiase

May 21, 2021 16:52

jakpiase suggested changes

View reviewed changes

paddle/fluid/operators/mkldnn/conv_mkldnn_op.cc Outdated

+                    UpdatePaddingAndDilation(&paddings, &dilations, padding_algorithm,
+                                             data_dims, strides, ksize);
+                    auto src_tz = paddle::framework::vectorize(in->dims());

Contributor

jakpiase May 24, 2021

Suggested change

      
                  auto src_tz = paddle::framework::vectorize(in->dims());
          
                  auto src_tz = framework::vectorize(in->dims());

Contributor Author

jczaja May 24, 2021

ok

paddle/fluid/operators/mkldnn/conv_mkldnn_op.cc Outdated

+                                             data_dims, strides, ksize);
+                    auto src_tz = paddle::framework::vectorize(in->dims());
+                    auto weights_tz = paddle::framework::vectorize(filter->dims());

Contributor

jakpiase May 24, 2021

Suggested change

      
                  auto weights_tz = paddle::framework::vectorize(filter->dims());
          
                  auto weights_tz = framework::vectorize(filter->dims());

Contributor Author

jczaja May 24, 2021

ok

paddle/fluid/operators/mkldnn/conv_mkldnn_op.cc

+                    int g = std::max(groups, 1);
+                    platform::GetGroupConvWeightsTz(weights_tz, g);
+                    auto dst_tz = paddle::framework::vectorize(out_grad->dims());

Contributor

jakpiase May 24, 2021

Suggested change

      
                  auto dst_tz = paddle::framework::vectorize(out_grad->dims());
          
                  auto dst_tz = framework::vectorize(out_grad->dims());

paddle/fluid/operators/mkldnn/conv_mkldnn_op.cc Outdated

+                     * ('any') which lets a primitive (conv backward in this case) choose
+                     * the memory format preferred for best performance
+                     */
+                    auto chosen_memory_format = MKLDNNMemoryFormat::any;

Contributor

jakpiase May 24, 2021

Suggested change

      
                  auto chosen_memory_format = MKLDNNMemoryFormat::any;
          
                  constexpr auto chosen_memory_format = MKLDNNMemoryFormat::any;

Contributor Author

jczaja May 24, 2021

This is simple assignment and since chosen_memory_format is not modified later we can just declare it as const

paddle/fluid/operators/mkldnn/conv_mkldnn_op.cc Outdated

+                    platform::GetGroupConvWeightsTz(weights_tz, g);
+                    auto dst_tz = paddle::framework::vectorize(out_grad->dims());
+                    MKLDNNMemoryFormat weights_format =

Contributor

jakpiase May 24, 2021

Why are you setting this value here, when you are always changing the format to "any" in line 332?

Contributor Author

jczaja May 24, 2021

good catch!

jakpiase reviewed

View reviewed changes

paddle/fluid/operators/mkldnn/conv_mkldnn_op.cc Outdated

                           conv_attr, fwd_prop_kind, dnnl::algorithm::convolution_direct,
                           src_md, weights_md, dst_md, stride_dims, dilations_dims,
                           mkldnn_paddings[0], mkldnn_paddings[1]);
                     }
                   }
                 }
+                ConvMKLDNNHandlerT(const framework::ExecutionContext& ctx,
+                                   const paddle::platform::MKLDNNDeviceContext& dev_ctx,

Contributor

jakpiase May 24, 2021

Suggested change

      
                                 const paddle::platform::MKLDNNDeviceContext& dev_ctx,
          
                                 const platform::MKLDNNDeviceContext& dev_ctx,

Contributor Author

jczaja May 24, 2021

ok

jczaja added 2 commits

May 24, 2021 18:43


          - Fixes after review

86c8c88


          - build fix

3f5d870

jakpiase self-requested a review

May 24, 2021 17:31

jakpiase previously approved these changes

View reviewed changes

Contributor

jakpiase left a comment

LGTM

arogowie-intel reviewed

View reviewed changes

paddle/fluid/operators/mkldnn/conv_mkldnn_op.cc

Comment on lines +244 to +249

+                ConvMKLDNNHandlerT(const framework::ExecutionContext& ctx,
+                                   const platform::MKLDNNDeviceContext& dev_ctx,
+                                   platform::Place cpu_place, const Tensor* in,
+                                   const Tensor* filter, const Tensor* bias,
+                                   const Tensor* out_grad, Tensor* filter_grad,
+                                   Tensor* in_x_grad, const std::string& unique_name)

Contributor

arogowie-intel May 24, 2021

Just for consideration: couldn't we have separate classess for FWD and BWD ? Afterall we are going to recreate fwd primitive in BWD pass. Choosing the right pass by passing appropriate arguments to same class constructor seems to me a bit confusing and not clear right away from the user perspective.

Contributor Author

jczaja May 25, 2021

let's discuss this internally first

paddle/fluid/platform/mkldnn_reuse.h

+                  auto backward_p =
+                      std::static_pointer_cast<TBackward_params>(dev_ctx_.GetBlob(key_p));
+                  if (backward_p == nullptr) {
+                    backward_p = std::make_shared<TBackward_params>(*bwd_w_pd_);

Contributor

arogowie-intel May 24, 2021

Maybe you should check here also whether bwd_w_pd_ is not empty?

Contributor Author

jczaja May 25, 2021

ok

Contributor Author

jczaja May 25, 2021

ok

paddle/fluid/platform/mkldnn_reuse.h

+                    framework::Tensor* diff_weights) {
+                  T* ptr = diff_weights->mutable_data<T>(
+                      place_, bwd_w_pd_->diff_weights_desc().get_size());
+                  return this->AcquireMemoryFromPrimitive(bwd_w_pd_->diff_weights_desc(), ptr,

Contributor

arogowie-intel May 24, 2021

Similar as above, shouldn't you check whether bwd_w_pd_ is already created?

Contributor Author

jczaja May 25, 2021

ok

paddle/fluid/platform/mkldnn_reuse.h

+                // Buffer is allocated by oneDNN to store computation results
+                std::shared_ptr<mkldnn::memory> AcquireDiffWeightsMemory(void) {
+                  return this->AcquireMemoryFromPrimitive(bwd_w_pd_->diff_weights_desc(),

Contributor

arogowie-intel May 24, 2021

As above.

Contributor Author

jczaja May 25, 2021

ok

paddle/fluid/platform/mkldnn_reuse.h

               using memory = mkldnn::memory;
               template <typename T, typename TForward,
-                        typename TBackward = mkldnn_dummy_primitive>
+                        typename TBackward = mkldnn_dummy_primitive,
+                        typename TBackward_params = mkldnn_dummy_primitive>

Contributor

arogowie-intel May 24, 2021

If this is only Convolution specific then wouldn't it be more readable to have it in its own handler class? Moreover what if we would like to reuse this class for other operations (like recurrent RNN, GRU, LSTM) which may require even more additional parameter types?

Contributor Author

jczaja May 25, 2021

It is not only convolution specific. It is specific to all primitives that are having trainable weights e.g. conv, conv_transpose, fully connected, prelu... As for LSTM (when we get to training) I'm not sure perhaps it would make sense to do some separate class

paddle/fluid/platform/mkldnn_reuse.h

@@ @@ -72,6 +73,17 @@ class MKLDNNHandlerT { @@
                   return backward_p;
                 }
+                std::shared_ptr<TBackward_params> AcquireBackwardWeightsPrimitive() {
+                  const std::string key_p = key_ + "@bwd_w_p";

Contributor

arogowie-intel May 24, 2021

Shouldn't we have those suffixes like "@bwd_w_p" in some form of enums?

Contributor Author

jczaja May 25, 2021

Yes we should . What we actully need is suffixes as enum and then mapping mechanism "key -> string" to have it printed as part of VLOG.


          - fixes after review

9da4591

jczaja dismissed jakpiase’s stale review via

9da4591

May 25, 2021 09:57

jakpiase self-requested a review

May 25, 2021 10:30

jakpiase approved these changes

View reviewed changes

Contributor

jakpiase left a comment

LGTM

arogowie-intel approved these changes

View reviewed changes

Contributor

arogowie-intel left a comment

Good job! :)

jczaja assigned luotao1

Contributor Author

jczaja commented May 26, 2021

@luotao1 could you please start your review?

luotao1 approved these changes

View reviewed changes

luotao1 merged commit 8c6bbb4 into PaddlePaddle:develop

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels