Feature/is nan #7068

reyoung · 2017-12-27T07:14:14Z

This is a part of #7092 .

Make them as usual names.

…dle into feature/enhance_dev_ctx_pool

QiJune · 2017-12-28T06:29:24Z

paddle/framework/tensor_util.cc

+  void operator()() const {
+    auto t = EigenVector<T>::Flatten(tensor_);
+    auto o = EigenScalar<bool>::From(*out_);
+    o.device(*ctx_.eigen_device()) = predicate_(t).any();


We can add a reference comment on eigen any() for better understanding
https://eigen.tuxfamily.org/dox/classEigen_1_1DenseBase.html#abfbf4cb72dd577e62fbe035b1c53e695

QiJune · 2017-12-28T06:30:10Z

paddle/framework/tensor_util.cc

+  template <typename T>
+  auto operator()(const T& eigen_vec) const
+      -> decltype(std::declval<T>().isnan()) {
+    return eigen_vec.isnan();


add a reference comment:
https://eigen.tuxfamily.org/dox/classEigen_1_1ArrayBase.html#aab10b156cb69206461728782dd01c37a

QiJune · 2017-12-28T06:36:07Z

paddle/framework/tensor_util.cc

+    tmp.Resize({1});
+    tmp.mutable_data<bool>(cpu);
+    platform::DeviceContextPool::Instance().Get(gpu)->Wait();
+    CopyFrom(out, cpu, &tmp);


If we do copy between CPU and GPU, we need to pass a DeviceContext

However, the device context is a global variable. We can get DeviceContext by its place.

I mean the CopyFrom interface is

Paddle/paddle/framework/tensor_util.h

Lines 33 to 34 in 1831176

inline void CopyFrom(const Tensor& src, const platform::Place& dst_place,

const platform::DeviceContext& ctx, Tensor* dst) {

QiJune · 2017-12-28T06:36:35Z

paddle/framework/tensor_util_test.cc

@@ -230,5 +231,28 @@ TEST(CopyToVector, Tensor) {
 #endif
 }

+TEST(IsNAN, CPU) {


Please add GPU unit tests at the same time.

QiJune · 2017-12-28T06:40:22Z

paddle/platform/device_context.h

@@ -52,6 +52,14 @@ class CPUDeviceContext : public DeviceContext {
  std::unique_ptr<Eigen::DefaultDevice> eigen_device_;
 };

+template <typename Place>
+struct DefaultDeviceContextType;


What is the DefaultDeviceContextType used for?

It is used for DeviceCtxPool::GetByPlace() method. This method will return the casted DeviceContext by PlaceType.

In the future, we could add a library type to this method.

QiJune · 2017-12-29T02:26:11Z

paddle/framework/tensor_util.h

@@ -207,6 +209,12 @@ inline void CopyToVector(const Tensor& src, std::vector<T>* dst) {
               src_ptr, size);
 }

+// Returns true if a tensor contains NAN, i.e., Not A Number.
+extern bool HasNAN(const framework::Tensor& tensor);


extern seems superfluous.

QiJune · 2017-12-29T02:47:01Z

paddle/framework/tensor_util.cu

@@ -0,0 +1 @@
+./tensor_util.cc


Why not move the code in tensor_util.cc to tensor_util.h. Or we can have a tensor_util_impl.h. It's a little strange to have a symbolic link.

@QiJune Reyoung probably had an offline discussion with you already. But looks like .cu is necessary to make nvcc pass the compilation...

QiJune · 2017-12-29T02:49:26Z

paddle/framework/tensor_util_test.cc

+  ASSERT_TRUE(HasNAN(src));
+}
+
+TEST(IsInf, CPU) {


IsInf --> HasInf

QiJune

LGTM

reyoung added 14 commits December 27, 2017 10:29

Rename API of DeviceContext

fd2bf55

Make them as usual names.

Rename API of DeviceContext

a5e1cf5

Make them as usual names.

Rename API of DeviceContext

8b877dd

Make them as usual names.

Fix compile

42062c3

Merge branch 'feature/enhance_dev_ctx_pool' of github.com:reyoung/Pad…

516967e

…dle into feature/enhance_dev_ctx_pool

Fix compile

b711870

Add API for HasNAN HasInf

15309fd

Fix compile

4518252

Merge branch 'feature/enhance_dev_ctx_pool' into feature/is_nan

e54bb6c

Add is_nan/is_inf

3d282ec

Fix compile

a5291f9

Merge branch 'feature/enhance_dev_ctx_pool' into feature/is_nan

837da79

Fix compile

16a8432

Merge branch 'feature/enhance_dev_ctx_pool' into feature/is_nan

71157b3

QiJune self-requested a review December 27, 2017 08:15

reyoung added 4 commits December 28, 2017 10:25

Merge branch 'develop' of github.com:baidu/Paddle into feature/is_nan

e2be6dd

Fix compile

003917d

Fix compile

878d2e9

Fix compile

a9a44e0

QiJune reviewed Dec 28, 2017

View reviewed changes

reyoung added 2 commits December 28, 2017 17:50

Update tensor_util

3158b4b

Merge branch 'develop' of github.com:baidu/Paddle into feature/is_nan

f97205e

QiJune reviewed Dec 29, 2017

View reviewed changes

QiJune approved these changes Dec 29, 2017

View reviewed changes

reyoung merged commit 93eaa8e into PaddlePaddle:develop Dec 29, 2017

reyoung deleted the feature/is_nan branch January 22, 2018 04:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/is nan #7068

Feature/is nan #7068

reyoung commented Dec 27, 2017 •

edited

Loading

QiJune Dec 28, 2017

QiJune Dec 28, 2017

QiJune Dec 28, 2017

reyoung Dec 28, 2017

QiJune Dec 28, 2017

QiJune Dec 28, 2017

QiJune Dec 28, 2017

reyoung Dec 28, 2017

QiJune Dec 29, 2017

QiJune Dec 29, 2017

tonyyang-svail Feb 13, 2018 •

edited

Loading

QiJune Dec 29, 2017

QiJune left a comment

	inline void CopyFrom(const Tensor& src, const platform::Place& dst_place,
	const platform::DeviceContext& ctx, Tensor* dst) {

		@@ -0,0 +1 @@
		./tensor_util.cc

Feature/is nan #7068

Feature/is nan #7068

Conversation

reyoung commented Dec 27, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tonyyang-svail Feb 13, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

QiJune left a comment

Choose a reason for hiding this comment

reyoung commented Dec 27, 2017 •

edited

Loading

tonyyang-svail Feb 13, 2018 •

edited

Loading