Add feed support for ParallelExecutor #9637

panyx0718 · 2018-04-04T05:05:21Z

TODO: Need to work with reader when both reader and
feed are enabled. Normally feed replaces reader input
or var produced from upstream ops.

chengduoZH · 2018-04-04T07:34:18Z

python/paddle/fluid/tests/unittests/test_parallel_executor.py

@@ -135,7 +144,9 @@ def bottleneck_block(input, num_filters, stride, cardinality, reduction_ratio):
    return fluid.layers.elementwise_add(x=short, y=scale, act='relu')


-def SE_ResNeXt152Small(batch_size=2):
+def SE_ResNeXt152Small(batch_size=2, use_feed=False):


SE_ResNeXt152 can be replaced with SE_ResNeXt50, the example is here. SE_ResNeXt50 consumes smaller memory than SE_ResNeXt152. And also be faster.

This change is not from this PR, let's leave this as a followup PR.

chengduoZH · 2018-04-04T09:13:02Z

python/paddle/fluid/parallel_executor.py

+          or numpy array.
+        :return: fetched value list.
+        """
+        feed_tensor_dict = {}


Please check the type of feed_dict.

chengduoZH · 2018-04-04T09:20:33Z

python/paddle/fluid/parallel_executor.py

+            feed_tensor = feed_dict[feed_name]
+            if not isinstance(feed_tensor, core.LoDTensor):
+                feed_tensor = core.LoDTensor()
+                feed_tensor.set(feed_dict[feed_name], self._act_places[0])


There are two ways that feeding data:

All the tensors are transferred to GPU(0) and then transferring sub data to other GPUs by P2P.

All the tensors are in CPU side and then transferring sub data to GPUs by CPU->GPU
Which is better?

We can adopt the first, parallel_do also use this way.

Yes, I believe this is the current solution. set() place the tensor in GPU0, SplitLoDTensor transfer them to each GPUs.

chengduoZH · 2018-04-04T09:41:37Z

I think is ok, how about you @qingqing01 ?

chengduoZH

Excellent!

panyx0718 requested review from reyoung and chengduoZH April 4, 2018 05:05

panyx0718 force-pushed the feed branch from 96cafb7 to 8d12465 Compare April 4, 2018 05:15

Add feed to ParallelExecutor

4bbfa9e

panyx0718 force-pushed the feed branch from d037f34 to 4bbfa9e Compare April 4, 2018 06:14

chengduoZH reviewed Apr 4, 2018

View reviewed changes

panyx0718 requested review from qingqing01 and chengduoZH April 4, 2018 09:29

Add type check

92e92ce

chengduoZH previously approved these changes Apr 4, 2018

View reviewed changes

panyx0718 dismissed chengduoZH’s stale review via 62210b1 April 4, 2018 11:32

polish

bdea5be

panyx0718 force-pushed the feed branch from 62210b1 to bdea5be Compare April 4, 2018 11:34

chengduoZH approved these changes Apr 4, 2018

View reviewed changes

panyx0718 merged commit 043c230 into PaddlePaddle:develop Apr 4, 2018

chengduoZH added the parallel_exe parallel executor label Apr 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add feed support for ParallelExecutor #9637

Add feed support for ParallelExecutor #9637

panyx0718 commented Apr 4, 2018

chengduoZH Apr 4, 2018

panyx0718 Apr 4, 2018 •

edited

Loading

chengduoZH Apr 4, 2018

panyx0718 Apr 4, 2018

chengduoZH Apr 4, 2018

chengduoZH Apr 4, 2018

panyx0718 Apr 4, 2018

chengduoZH commented Apr 4, 2018

chengduoZH left a comment

Add feed support for ParallelExecutor #9637

Add feed support for ParallelExecutor #9637

Conversation

panyx0718 commented Apr 4, 2018

chengduoZH Apr 4, 2018

Choose a reason for hiding this comment

panyx0718 Apr 4, 2018 • edited Loading

Choose a reason for hiding this comment

chengduoZH Apr 4, 2018

Choose a reason for hiding this comment

panyx0718 Apr 4, 2018

Choose a reason for hiding this comment

chengduoZH Apr 4, 2018

Choose a reason for hiding this comment

chengduoZH Apr 4, 2018

Choose a reason for hiding this comment

panyx0718 Apr 4, 2018

Choose a reason for hiding this comment

chengduoZH commented Apr 4, 2018

chengduoZH left a comment

Choose a reason for hiding this comment

panyx0718 Apr 4, 2018 •

edited

Loading