feature/mul converter #3

Superjomn · 2018-05-27T04:44:31Z

No description provided.

- Finished draft of pooling reusing of operators - Using gethash in PoolGrad added - Removed diagnostic - Added pool mkldnn grad reusing of primitives - Added diagnostic - Removed diagnostic - added dependency to mkldnn data type for pooling mkldnn - Added mkldnn memory data type determining based on template type of op - Compilation warning fix - codying style fixes

…e_parallel_executor

use cdn to speed up thirdparty package download

refine mkldnn cmake with official commit id

Fix timeline for the number of profile is one

… develop

…OFF. (PaddlePaddle#10756)

… develop

* listen_and_serv use localscope * fix ut

fix roi_pool op bug

…port-multi-gpu Inferencer support parallel_executor

…Paddle#10762)

allow inference test to generate timeline

…/mul_converter

…le#10760) * use latest pip version * fix ci * update by comment

…ld_cluster_benchmark_script remove old fluid cluster benchmark scripts

CustomReader

…stand_sentiment_serial Disable unstable test

…imizer_tests Change optimizer to old paddle style

add inference api demo impl

fix rename var

…le#10925) * unify UpsamplingBilinear2d interface specification * unify UpsamplingBilinear2d interface specification * fix name conventions * small fix about computation order

* scripts: clean bash scripts. * Fix build related documents.

…xample (PaddlePaddle#10946) * add create lodtensor from list * modify book example

* Add quad transform. * Fix some syntax error. * Fix CUDA kernel launch configure. * Generalize geometry channels. * Rename QuadTransform to PolygonRestore. * Rename op. * Rename op and fix computation. * Modify CMakeLists.txt for box_restore op. * Refine code: 1. rename op 2. uncomment unitest on GPU

…/mul_converter

merge to dev

PaddlePaddle#39128) * Added selected_rows and rw_lock to pten * Renamed the unit test target to fix CI * Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid * Remove rw_lock.h,rw_lock_test.cc in fluid * Use pten::RWLock and pten::AutoRDLock, fix CI * Use pten::SelectedRows * Use pten::SelectedRows * Fix to pass NPU CI * Use pten::SelectedRows, to pass NPU CI * To fix NPU CI * To fix NPU CI again

* initial commit: simple demo * polish copyright format * add grap op simple demo * adapt uncertain number of argument * change trait marco name * add place & dtype support for add kernel * add dispath and infershape func * poish code & add notes * add dynamic_loader dep for paddle_framework * add new custom op test dir * polish impl details * add unittest for new custom op * fix failed unittest * Costum op (#1) * fix compile error * wrap framework tensor with LoDTensor * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * add CustomTensor default constructor * add size() for CustomTensor * make size const for CustomTensor * refactor place related api to circle the concept * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * make place const * make Tensor copy * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * remove additional head of framework * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * add gpu test * merge latest cwh code in * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * Remove ShareData from user && Change CustomTensor to Tensor && Support more data type (#2) * fix compile error * wrap framework tensor with LoDTensor * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * add CustomTensor default constructor * add size() for CustomTensor * make size const for CustomTensor * refactor place related api to circle the concept * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * make place const * make Tensor copy * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * remove additional head of framework * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * add gpu test * merge latest cwh code in * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * hid share data from and to * rename CustomTensor to Tensor * refactor register design & add test * change op_funtion to op_meta_info * split op meta info into .h and .cc * move get methods into friend class * move OpMetaInfoHelper into framework space * move CustomTensorUtils into framework space * change pybind api name * move PD C API into op meta info * add register custom op api * remove inference cmake change * refactor copy to api && change Reshape to lowercase && support more dtype && add more test (#3) * fix compile error * wrap framework tensor with LoDTensor * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * add CustomTensor default constructor * add size() for CustomTensor * make size const for CustomTensor * refactor place related api to circle the concept * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * make place const * make Tensor copy * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * remove additional head of framework * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * add gpu test * merge latest cwh code in * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * hid share data from and to * rename CustomTensor to Tensor * support multi dtype * remove lod, make reshape lowercase, add copy test and refactor copy api * remove lod, make reshape lowercase, add copy test and refactor copy api * remove lod, make reshape lowercase, add copy test and refactor copy api * remove lod, make reshape lowercase, add copy test and refactor copy api * fix copy to error * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * polish detail & error message * polish test details * Add cast api && Change copy related api to copy_to && add more test (#4) * fix compile error * wrap framework tensor with LoDTensor * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * add CustomTensor default constructor * add size() for CustomTensor * make size const for CustomTensor * refactor place related api to circle the concept * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * fix compile error * make place const * make Tensor copy * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * debug CustomTensor core * remove additional head of framework * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * use back to shared ptr for custom tensor * add gpu test * merge latest cwh code in * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * adjust ut code of custom op * hid share data from and to * rename CustomTensor to Tensor * support multi dtype * remove lod, make reshape lowercase, add copy test and refactor copy api * remove lod, make reshape lowercase, add copy test and refactor copy api * remove lod, make reshape lowercase, add copy test and refactor copy api * remove lod, make reshape lowercase, add copy test and refactor copy api * fix copy to error * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add more test * add type cast * add cast and make copy to api * add cast and make copy to api * add cast and make copy to api * add cast and make copy to api * merge cwh code * merge cwh code * merge cwh code * merge cwh code * merge cwh code * add more error log * add more error log * polish code * used for test * remove test comment * remove test comment * fix uint8 type error * fix lost uint8 type error * add test for coverage * polish details by reviewer comments * add prefix for DISABLE_COPY_AND_ASSIGN Co-authored-by: Jiabin Yang <360788950@qq.com>

jczaja and others added 30 commits May 17, 2018 11:01

Merge remote-tracking branch 'yx/fix_bce_cdn_link' into feature/refin…

ceb150e

…e_parallel_executor

Merge pull request PaddlePaddle#10745 from Yancey1989/fix_bce_cdn_link

062c811

use cdn to speed up thirdparty package download

Merge pull request PaddlePaddle#10736 from luotao1/mkldnn_cmake

651c934

refine mkldnn cmake with official commit id

Merge pull request PaddlePaddle#10737 from chengduoZH/fix_timeline

d0a62bf

Fix timeline for the number of profile is one

should load parameter before create parallel_executor

feed94e

optimized checkpoint serial number and folder

6d53dce

remove boost filesystem

8430c8d

modify variable point

7b6c0ab

fix auto serial_num has no initializer

f9d4b9d

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

4d2a2e7

… develop

fix index

8023bc7

fix DataTransFunc (PaddlePaddle#10752)

93c4700

Build: fix build error when WITH_FLUID_ONLY and WITH_GPU both set as …

868bdc9

…OFF. (PaddlePaddle#10756)

bug fix

a4fd375

add comment

d2d671e

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

f79779f

… develop

fix roi_pool gpu_backward kernel

1d7f91e

listen_and_serv use local scope (PaddlePaddle#10663)

ebc7303

* listen_and_serv use localscope * fix ut

bug fix

f688652

update op to trianer and pserver

821acdb

Merge pull request PaddlePaddle#10700 from baiyfbupt/develop

67b8a30

fix roi_pool op bug

Merge pull request PaddlePaddle#10741 from jacquesqiao/inferencer-sup…

54ae8e4

…port-multi-gpu Inferencer support parallel_executor

add trainer.stop and fix a bug for train_by_parallel_executor (Paddle…

eb7d875

…Paddle#10762)

merge develop

eff92d0

Merge pull request PaddlePaddle#10526 from panyx0718/infer_profile2

8e3e65f

allow inference test to generate timeline

Merge branch 'develop' of github.com:PaddlePaddle/Paddle into feature…

76d20c3

…/mul_converter

update

4218d83

Fix a compile error

871bc43

use latest pip version in dev and production Docker image (PaddlePadd…

17f5037

…le#10760) * use latest pip version * fix ci * update by comment

reyoung and others added 20 commits May 25, 2018 15:10

Change optimizer to old paddle style

c980e4c

Merge pull request PaddlePaddle#10919 from typhoonzero/remove_fluid_o…

7530366

…ld_cluster_benchmark_script remove old fluid cluster benchmark scripts

Merge pull request PaddlePaddle#10872 from JiayiFeng/dev_CustomReader

cd8700f

CustomReader

fix build error with testing and gpu on (PaddlePaddle#10932)

c770d5c

Merge pull request PaddlePaddle#10930 from reyoung/feature/make_under…

2013b1c

…stand_sentiment_serial Disable unstable test

add random reader op export (PaddlePaddle#10914)

36fd705

Merge pull request PaddlePaddle#10921 from reyoung/feature/change_opt…

dd428a0

…imizer_tests Change optimizer to old paddle style

fix rename var

b348e15

feature/inference api demo impl (PaddlePaddle#10825)

fd45c6d

add inference api demo impl

Merge pull request PaddlePaddle#10937 from Yancey1989/fix_renamevar

0930646

fix rename var

enable eigen multi-threads on mobile device (PaddlePaddle#10938)

83f4e9e

disable remove rpath from third party protoc (PaddlePaddle#10939)

391c274

Unified bilinear_interp op Python interface specification (PaddlePadd…

1ba2581

…le#10925) * unify UpsamplingBilinear2d interface specification * unify UpsamplingBilinear2d interface specification * fix name conventions * small fix about computation order

scripts: clean bash scripts. (PaddlePaddle#10721)

72149c1

* scripts: clean bash scripts. * Fix build related documents.

Add create LoDTensor from list option and simplify recommender book e…

c79ec9f

…xample (PaddlePaddle#10946) * add create lodtensor from list * modify book example

Fix attribute name in new API (PaddlePaddle#10947)

fb43c6b

fix float16 demo location issue (PaddlePaddle#10948)

a62bbd1

Merge branch 'develop' of github.com:PaddlePaddle/Paddle into feature…

96eb47b

…/mul_converter

add more tests

8c6150f

Superjomn force-pushed the feature/mul_converter branch 2 times, most recently from 5649827 to fbc9db7 Compare May 27, 2018 04:57

Superjomn closed this May 27, 2018

Superjomn reopened this May 27, 2018

update

5b30915

Superjomn force-pushed the feature/mul_converter branch from fbc9db7 to 5b30915 Compare May 27, 2018 05:26

Superjomn closed this May 27, 2018

Superjomn pushed a commit that referenced this pull request Jan 11, 2019

Merge pull request #3 from PaddlePaddle/develop

7324ffa

merge to dev

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature/mul converter #3

feature/mul converter #3

Superjomn commented May 27, 2018

feature/mul converter #3

feature/mul converter #3

Conversation

Superjomn commented May 27, 2018