-
Notifications
You must be signed in to change notification settings - Fork 5.6k
2017 12 20
Tao Luo edited this page Dec 9, 2019
·
1 revision
- cloud:
- cluster:
- Deploy PaddleCloud on k8s-k40 cluster
- scheduler:
WIP
- NTM with attention
- Rename XXDescBind --> XXDesc
Gradient Clipping
Design Of switching kernel
Enhancement
- https://github.com/PaddlePaddle/Paddle/pull/6771
- https://github.com/PaddlePaddle/Paddle/pull/6761
- https://github.com/PaddlePaddle/Paddle/pull/6673
- https://github.com/PaddlePaddle/Paddle/pull/6655
- https://github.com/PaddlePaddle/Paddle/pull/6653
- https://github.com/PaddlePaddle/Paddle/pull/6642
- Add a simple threadpool: https://github.com/PaddlePaddle/Paddle/pull/6684
- Muilti CPU design: https://github.com/PaddlePaddle/Paddle/pull/6223
- Review:
- [WIP] Parallel_do: https://github.com/PaddlePaddle/Paddle/pull/6730
- Issue:
- Review
Multi-device:
- add design doc on keys of operaror kernel type
- Discussion on integrating MKLDNN to fluid
Fix and Enhance
- add GetPlaces operator
- add more place test and rename Cudnn to CUDNN
- Add separable convolution:
- Add Xception model for image classification:
- Add documentation for C-API (have not passed all of reviewer's comments yet):
- Refine the documentation for fully connected layer's helper and mul_op:
- Some small fix:
- PR review:
- Set up and build PaddlePaddle for IOS in Xcode.
- Understand and write inference in Fluid: https://github.com/PaddlePaddle/Paddle/pull/6672
- Design doc for executor: https://github.com/PaddlePaddle/Paddle/pull/6721
- Collaborate with Yiqun and QingQing to understand current C-API and inference.
- Design Operator Markdown: https://github.com/PaddlePaddle/Paddle/pull/6805
- Polish and write documentations for operators in
layers.py
(issue: https://github.com/PaddlePaddle/Paddle/issues/6526)
-
Documentation of Layers [WIP]
- Polishing the documentation of the less than layer https://github.com/PaddlePaddle/Paddle/pull/6816
- Polish layer documentation for fill_constant ops https://github.com/PaddlePaddle/Paddle/pull/6808
- Polishing the embedding layer and the fc layer documentation https://github.com/PaddlePaddle/Paddle/pull/6806
- Polish the support new device doc https://github.com/PaddlePaddle/Paddle/pull/6594
-
PR Review
- https://github.com/PaddlePaddle/Paddle/pull/6799#pullrequestreview-84972870
- https://github.com/PaddlePaddle/Paddle/pull/6817
- https://github.com/PaddlePaddle/Paddle/pull/6810#pullrequestreview-84962168
- https://github.com/PaddlePaddle/Paddle/pull/6807#pullrequestreview-84950293
- https://github.com/PaddlePaddle/Paddle/pull/6805#pullrequestreview-84914464
- https://github.com/PaddlePaddle/Paddle/pull/6792#pullrequestreview-84862892
- refactoring of backward code [WIP]
- reviews:
- Profiling:
- Add design documentation for profiling tool: https://github.com/PaddlePaddle/Paddle/pull/6700
- Add the event to record the timeline: https://github.com/PaddlePaddle/Paddle/pull/6701
- Refine CUDA profiler and delete the test file: https://github.com/PaddlePaddle/Paddle/pull/6715
- VGG Net and RNN profiling (TODO): https://github.com/PaddlePaddle/Paddle/issues/6736
- Memory optimization for OCR model
- Reduce memory usage in conv layer and RoI layer for mobile inference.
- Memory optimization in convolution layer by the grouped im2col and gemm.
- Code Review:
- Correct the dropout_op's computation in test #6804
- feature/print op #6799
- Doc
- Models regtest
- Review
- Review
- [Memory optimization] https://github.com/PaddlePaddle/Paddle/pull/6659
- https://github.com/PaddlePaddle/Mobile/pull/66
- [Multi-Thread] https://github.com/PaddlePaddle/Paddle/pull/6751
- [C-API] https://github.com/PaddlePaddle/Paddle/pull/6596
- MKL
- [discuess] how to add MKL Packed python interface: https://github.com/PaddlePaddle/Paddle/issues/6612
- refine mkl&openblas script for benchmark: https://github.com/PaddlePaddle/Paddle/pull/6706
- review: MKL Packed design doc and code for RNN optimazation
- https://github.com/PaddlePaddle/Paddle/pull/6636, https://github.com/PaddlePaddle/Paddle/pull/6680 ,https://github.com/PaddlePaddle/Paddle/pull/6719
- use Intel OpenMP to speedup seq2batch when WITH_MKL: https://github.com/PaddlePaddle/Paddle/pull/6622
- docs:
- update the link of doc.paddlepaddle.org in README.md: https://github.com/PaddlePaddle/Paddle/pull/6618, https://github.com/PaddlePaddle/Paddle/pull/6734
- add fluid python api in chinese website: https://github.com/PaddlePaddle/Paddle/pull/6663
- reopen linkchecker: https://github.com/PaddlePaddle/Paddle/pull/6748
- [WIP] add python doc for sequence_pool: https://github.com/PaddlePaddle/Paddle/pull/6787
- review: k8s related, mobile, c-api docs
- [USER] Add refer code to get started: https://github.com/PaddlePaddle/Paddle/pull/6646
- https://github.com/PaddlePaddle/Paddle/pull/6674, https://github.com/PaddlePaddle/Paddle/pull/6742, https://github.com/PaddlePaddle/Paddle/pull/6711, https://github.com/PaddlePaddle/Mobile/pull/63, https://github.com/PaddlePaddle/Mobile/pull/62, https://github.com/PaddlePaddle/Paddle/pull/6596
- distributed training discussion
- add ios cross compiling doc in en
- fluid
- model CI with @haoshuang
- visualDL with @longfei @daming
- move repo to PaddlePaddle/
- https://github.com/VisualDL/VisualDL/pull/52
- https://github.com/PaddlePaddle/VisualDL/pull/3
- PR
- https://github.com/PaddlePaddle/Paddle/pull/6744/files/927f5d567c737fc9e754f911a1ced3def77b0651
- https://github.com/PaddlePaddle/models/pull/534#pullrequestreview-82481975
- https://github.com/PaddlePaddle/regtest/pull/8#pullrequestreview-84398256
- https://github.com/PaddlePaddle/regtest/pull/9#pullrequestreview-84693303
- https://github.com/PaddlePaddle/regtest/pull/8#pullrequestreview-84694853
-
Update doc of v2 api
-
performance validation of understand_sentiment in fluid
- Framework
- Read some documentation of fluid
- Review
- Auto pruning: https://github.com/PaddlePaddle/Paddle/pull/2603
- add doc for how to use C-API: https://github.com/PaddlePaddle/Paddle/pull/6596
- Mobile
- Refine the inference demo
- Refine the content of many README and add some Chinese README:
- Android AICamera Demo
- Port MultiBox Loss (Doing)
- Setup dev servers (Flask and NodeJS) for VisualDL
- Research on how TensorBoard API works on retrieving data for Image and Scalar
- Write Demo / Tutorial for VisualDL
- Table in Markdown is not rendered, fixing
- PR:
- Add Python wrapper for reduceSum, reduceMean
- Refine and merge the Inception-ResNet-v2 model
- Review:
-
Muiti Device
-
Code optimize
-
Review
PR
- Refine accuracy_op.cu
- Refine elementwiseAdd and im2col
- Add get parameters directly
- Add dilation for conv_trans_op
Benchmark
- Summary of performance optimization:02.Recognize_digits (WIP)
- Fix mnist
- fix nvprof gpu consistence
- 07.Semantic Role Labeling
Review
- Refine CUDA profiler and delete the test file
- Expose sequence_softmax_op
https://github.com/PaddlePaddle/Paddle/pull/6726 - Expose sequence_expand_op
https://github.com/PaddlePaddle/Paddle/pull/6670 - Expose lstm unit
https://github.com/PaddlePaddle/Paddle/pull/6669 - seq2seq [WIP]
https://github.com/dzhwinter/benchmark/pull/25 - Model update
https://github.com/PaddlePaddle/DeepSpeech/pull/88
https://github.com/PaddlePaddle/DeepSpeech/pull/90 - Code Review
https://github.com/PaddlePaddle/Paddle/pull/6789#pullrequestreview-84958477
https://github.com/PaddlePaddle/DeepSpeech/pull/92#pullrequestreview-84956016
https://github.com/PaddlePaddle/Paddle/pull/6716#pullrequestreview-84671460
https://github.com/PaddlePaddle/Paddle/pull/6708#pullrequestreview-84062907
https://github.com/PaddlePaddle/Paddle/pull/6663#pullrequestreview-83794521
https://github.com/PaddlePaddle/DeepSpeech/pull/68#pullrequestreview-83739165
https://github.com/PaddlePaddle/Paddle/pull/6538#pullrequestreview-84057565
- spp(Spatial pyramid pooling) op done
- detection_output op(for SSD, doing, code review
- norm op doing unittest
- Fluid
- remove GPU Sync Interface
- change OperatorBase Run with Place
- fix Place inherit
- cudnn operators change to cudnn kernel
- add global device pool
- polish executor design doc
- Reviews
- add kernel key
Deep Speech 2:
- Finish the BaiduEN8K model's training with lr decay + clip by global norm
- review:
Fluid:
- VGG16 convergence comparison on GPU between fluid and tensorflow
- VGG16 performance comparison between fluid and tensorflow
- [merged] Correct the dropout_op's computation in test
- [merged] Enable reshape_op to support dimension inference
-
Fix crf_layer, crf_decoding, chunk_evaluator when using GPU.
-
Refine the DOC of DSSM.
-
Book.04 word2vec performance comparison with V2.
- Concurrency discussions
- Multiple Machine Review: https://github.com/PaddlePaddle/Paddle/pull/6297#pullrequestreview-84871025
- Multiple Thread Review: https://github.com/PaddlePaddle/Paddle/pull/6684#pullrequestreview-84945672
- Paddle Cloud Review: https://github.com/PaddlePaddle/cloud/pull/526#pullrequestreview-83673564
- Cluster train transpiler updating:
- Train DeepFM on kubernetes cluster
- PR reviews of Paddlecloud, thread pool and some design doc
- benchmarking 05 recommender
- WIP: Updating documentation of layers: https://github.com/PaddlePaddle/Paddle/issues/6526
- Related PR reviews: