Fix parallel.do with batch norm #8186

reyoung · 2018-02-06T09:35:19Z

Related issue #8153

reyoung · 2018-02-06T09:35:58Z

paddle/operators/parallel_do_op.cc

@@ -248,6 +248,8 @@ class ParallelDoGradOp : public framework::OperatorBase {
                      const std::vector<framework::Scope *> &sub_scopes,
                      const platform::PlaceList &places) const {
    for (auto &s : Outputs(framework::GradVarName(kParameters))) {
+      VLOG(10) << "Accumulating " << s;
+      if (s == framework::kEmptyVarName) continue;


Do not accumulate @EMPTY@

reyoung · 2018-02-06T09:36:16Z

paddle/operators/parallel_do_op.cc

-      PADDLE_ENFORCE(ctx->HasOutputs(framework::GradVarName(kParameters)));
-      ctx->SetOutputsDim(framework::GradVarName(kParameters),
-                         ctx->GetInputsDim(kParameters));
+    auto p_dims = ctx->GetInputsDim(kParameters);


If parameter gradient is empty, do not infer shape.

reyoung · 2018-02-06T09:36:55Z

python/paddle/v2/fluid/layers/control_flow.py

@@ -274,21 +274,20 @@ def get_parameters(self):
        parent_block = self.parent_block()

        local_inputs = set()
-


The previous logic cannot calculate parameters that used and updated by the same operator.

tonyyang-svail · 2018-02-06T21:46:12Z

For the purpose of this PR, please ignore the nccl error: unhandled cuda error. Possible reason: #8195

Update: it turns out the driver version of 199 is not enough. @helinwang is updating it.

… feature/parallel_do_and_batch_norm

reyoung · 2018-02-08T04:51:23Z

Merged by #8249

Fix parallel.do with batch norm

dcf5632

reyoung requested a review from tonyyang-svail February 6, 2018 09:35

reyoung commented Feb 6, 2018

View reviewed changes

reyoung added 3 commits February 7, 2018 13:59

Change log level

c159212

CopyShare AllPlaces

bd50b7d

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

6ba1ad4

… feature/parallel_do_and_batch_norm

reyoung closed this Feb 8, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix parallel.do with batch norm #8186

Fix parallel.do with batch norm #8186

reyoung commented Feb 6, 2018 •

edited

Loading

reyoung Feb 6, 2018 •

edited

Loading

reyoung Feb 6, 2018

reyoung Feb 6, 2018

tonyyang-svail commented Feb 6, 2018 •

edited

Loading

reyoung commented Feb 8, 2018

		@@ -274,21 +274,20 @@ def get_parameters(self):
		parent_block = self.parent_block()

		local_inputs = set()

Fix parallel.do with batch norm #8186

Fix parallel.do with batch norm #8186

Conversation

reyoung commented Feb 6, 2018 • edited Loading

reyoung Feb 6, 2018 • edited Loading

Choose a reason for hiding this comment

reyoung Feb 6, 2018

Choose a reason for hiding this comment

reyoung Feb 6, 2018

Choose a reason for hiding this comment

tonyyang-svail commented Feb 6, 2018 • edited Loading

reyoung commented Feb 8, 2018

reyoung commented Feb 6, 2018 •

edited

Loading

reyoung Feb 6, 2018 •

edited

Loading

tonyyang-svail commented Feb 6, 2018 •

edited

Loading