-
Notifications
You must be signed in to change notification settings - Fork 5.6k
2018 03 14
-
Fix some outdated links in doc/fluid/read_source.md
-
Fix some outdated contents in Contribute Documentations
-
Fix bugs about building documentations failed.
Memory optimization
-
[WIP] Recompute policy in memory optimization
-
Runtime Operator and Variable should take OpDesc and VarDesc as private data members
Fix and enhance:
- Refine cast operator
- [WIP] A prototype of ParallelExecutor in C++
- Link: https://github.com/PaddlePaddle/Paddle/pull/9080/files#diff-564dec854cf4f37015001783f71e06cb
- We have converted
ProgramDesc
intoDependencyGraph with SSA form
and analysis which operators can be paralleled.
- Add Readers and decorators
- RecordIO Reader
- Shuffle Decorator
- DoubleBuffer Decorator
- ISSUE:
- EDL
- DeepSpeech2 on Sys Kubernetes Cluster
- Distribute with CPU, done
- Distribute with CPU and EDL, training
- Distribute with GPU, debugging
- DeepSpeech2 on Sys Kubernetes Cluster
- document
- enhancement dist train doc, https://github.com/PaddlePaddle/Paddle/pull/8893
- sparse update doc, https://github.com/PaddlePaddle/Paddle/pull/8997
- bug fix
- fix save model bug, https://github.com/PaddlePaddle/Paddle/pull/8928
- fix dist compile failed, https://github.com/PaddlePaddle/Paddle/pull/8987
- design doc: distributed training with large parameter, https://github.com/PaddlePaddle/Paddle/pull/9068
- review:
- Object Detection
- Fix detection_map_op for multi-device.
- Clipping bbox in the mAP evaluator calculation.
- Fix bug in detection_output and mAP calculation in SSD.
- Refine network config for MobileNet-SSD.
- Train MobileNet-SSD on Pascal VOC dataset and do some contrastive experiments with Caffe.
- Adapt attention seq2seq
https://github.com/PaddlePaddle/models/pull/675 - [WIP] Enhance sequence expand operator
https://github.com/PaddlePaddle/Paddle/pull/9100 - Improve speed of async reader
https://github.com/PaddlePaddle/models/pull/726
- OCR
- Add sum accumulator with window for model average[WIP]
- Review
- [Speed] feature/ParallelExecutor: https://github.com/PaddlePaddle/Paddle/pull/8891
- Review:
- Cpp parallel executor: https://github.com/PaddlePaddle/Paddle/pull/9080
- Refine parallel_do_grad: https://github.com/PaddlePaddle/Paddle/pull/9072
- Repair nccl op test: https://github.com/PaddlePaddle/Paddle/pull/8575
- Participating Parallel Executor discussion and development with YangYang
- clean up EDL repo and fix CI
- review and issue discussions
- https://github.com/PaddlePaddle/Paddle/pull/8891#pullrequestreview-103229732
- https://github.com/PaddlePaddle/Paddle/issues/8903#issuecomment-371909647
- https://github.com/PaddlePaddle/Paddle/issues/8929#issuecomment-371918111
- https://github.com/PaddlePaddle/Paddle/pull/8997#pullrequestreview-103235570
- https://github.com/PaddlePaddle/cloud/pull/634#pullrequestreview-103244068
- https://github.com/PaddlePaddle/Paddle/pull/9033#pullrequestreview-103603905
- https://github.com/PaddlePaddle/Paddle/issues/9034
- https://github.com/PaddlePaddle/Paddle/pull/9068#issuecomment-373181797
- https://github.com/PaddlePaddle/Paddle/pull/9079#pullrequestreview-104011829
- https://github.com/PaddlePaddle/Paddle/pull/9080#pullrequestreview-104028959
-
Fluid API Std (merged)
-
Fluid API doc problems
https://github.com/PaddlePaddle/Paddle/issues/9056
https://github.com/PaddlePaddle/Paddle/issues/9060
https://github.com/PaddlePaddle/Paddle/issues/9062
https://github.com/PaddlePaddle/Paddle/issues/9063
https://github.com/PaddlePaddle/Paddle/issues/9064
https://github.com/PaddlePaddle/Paddle/issues/9070
https://github.com/PaddlePaddle/Paddle/issues/9092
-
PR Review:
-
CI
- get 198 back online
- added WITH_UBUNTU_MIRROR in CI configuration parameter https://github.com/PaddlePaddle/Paddle/issues/9073
-
EDL Guide update
-
Document translation
- Paddle CSP
- Add modification to Channel to support Select OP https://github.com/PaddlePaddle/Paddle/pull/9084
- Implement Select OP https://github.com/PaddlePaddle/Paddle/pull/9088#commits-pushed-b107575
- Implement non blocking CanSend and CanReceive in Channels for Select OP https://github.com/PaddlePaddle/Paddle/issues/8815
- Expose methods to create and add QueueMessage to Send or Receive Queue https://github.com/PaddlePaddle/Paddle/issues/8863
- Add ability to provide callbacks in QueueMEssage Notify https://github.com/PaddlePaddle/Paddle/issues/8864
- Add ability to provide condition variable externally to QueueMessage https://github.com/PaddlePaddle/Paddle/issues/9083
- Fluid Channels should support both move and copy semantics for data transfer https://github.com/PaddlePaddle/Paddle/issues/9085
- Move all Concurrency operators to paddle/fluid/operators/concurrency https://github.com/PaddlePaddle/Paddle/issues/9086
- Channel Destroy should inform Select callback of destruction https://github.com/PaddlePaddle/Paddle/issues/9087
- Fix the CPP Data Feeding design document https://github.com/PaddlePaddle/Paddle/pull/9033
- Correct language inLarge parameter distributed training https://github.com/PaddlePaddle/Paddle/pull/9068/commits/5948fd27170d91e2728e9366e9852f32dc17fd2f
- PR Reviews
- https://github.com/PaddlePaddle/ModelCE/pull/1#pullrequestreview-104119184
- https://github.com/PaddlePaddle/Paddle/pull/9079#pullrequestreview-103954554
- https://github.com/PaddlePaddle/Paddle/pull/9018#pullrequestreview-103976955
- https://github.com/PaddlePaddle/Paddle/pull/8934#pullrequestreview-103640702
- Inference Framework
- [Reviewing] Remove unnecessary clone of program in C++ Executor.Run, a performance gain of 1.3% ~ 20%
- Measure the performance gain of separate Executor.Prepare and Executor.RunPreparedContext, 1% ~ 9%
- [Reviewing] Limit the symbol table of fluid shared library
- Mobile
- Make a schedule with image group
- Train the DeepASR model on whole training dataset
- Add script for inference by using checkpoint
- Add pybind11 wrapper for decoder
- Fix the profiling bug in multi-gpu mode
Code Review:
- Refine data reader and move data_reader to async_data_reader
- Enhance reshape
- Add a test to ensure profiler works on multi-gpu
- Fix models #725
- Distributed GPU Version:
- Debugger: https://github.com/PaddlePaddle/Paddle/pull/9025
- Review:
- refine nccl ops, reduce streamSychronize call
- export scatter op to python
- [Speed] sequence_softmax_op, sequence_softmax_op_grad and some others.
- try to fix memory bug/ parallel.do bug
- fluid dist train:
- https://github.com/PaddlePaddle/Paddle/pull/9027
- update prototype bugs run in Linux: https://github.com/typhoonzero/grpc_zerocopy_async_example
- EDL:
- make TPR version workable: https://github.com/PaddlePaddle/edl/pull/7
- EDL PR reviews
- reviews:
- https://github.com/PaddlePaddle/Paddle/pull/9075
- https://github.com/PaddlePaddle/Paddle/pull/9068 TODO: async SGD fix simple transpiler that don’t split variables by default to boost dist train perf
- C++ readers refining and potential bugs fix:
- C++ readers for multi-threads and multi-GPU
- design doc: https://github.com/PaddlePaddle/Paddle/pull/9079
- readers and RecordIO review:
- NMT:
- Add beam search and inference program for Transformer.
- Add initializers for Transformer and the corresponding cost curve.
- Review:
- Other:
- Add guide for documentation-9-"RNN模型"
-
Fix the Accuracy interface in benchmark.
-
Profiling for Transformer model.
-
Add rnn_search in models.
-
Reviews:
- Remove Accuracy: https://github.com/PaddlePaddle/models/pull/704
-
Continue to improve profiler. Added nested block, nested event support, more tests coverage.
-
Other improvements
-
Work with Chunwei on Model CI and Contrib
-
Work with Input Pipeline Design
-
Work with Performance team and profiling
-
Multi-threading prototype -https://github.com/panyx0718/Paddle/commit/0c581b1df1a7acec331129604a9fcfe6566951a7
- Optimization
- [Speed]Refine parallel_do_grad
- [Speed]fuse optimize op transpiler (with @qiaolongfei)
- SE-ResNeXt-150 Optimization
- Enhancement regularization for sparse parameter
- Enhance regularizer.py
- Enhance look_up_table op
- Add ElementwiseOpInferVarType for Elementwise_op
- Review
- Better timeline
- Repair nccl op test
- Refine network config for MobileNet-SSD.
- A little optimize of optimizer
- compiler: enable WITH_FLUID option: https://github.com/PaddlePaddle/Paddle/pull/9067
- inference:
- add MKL for fluid static and shared library: https://github.com/PaddlePaddle/Paddle/pull/8887
- [WIP] fuse batch norm
- doc:
- fix document deployment: https://github.com/PaddlePaddle/Paddle/pull/8894
- [Discuss] Collect and reclassify fluid documentation: https://github.com/PaddlePaddle/Paddle/issues/9031
- code review:
- [doc guide]:
- API standard: https://github.com/PaddlePaddle/Paddle/pull/8927
- Contribute Documentations: https://github.com/PaddlePaddle/Paddle/pull/9016
- Refine the profile codes for inference: https://github.com/PaddlePaddle/Paddle/pull/8910
-
Profile
- fuse optimize op transpiler https://github.com/PaddlePaddle/Paddle/pull/8940
- [wip]add executor.prepare https://github.com/PaddlePaddle/Paddle/pull/9022
- Discuss the method to support regularization on sparse parameter with @chenduo
- Benchmark
-
Abacus on Fluid
- Add distributed lookup table design https://github.com/PaddlePaddle/Paddle/pull/9075
-
Review
- Extract Prepare from Executor. https://github.com/PaddlePaddle/Paddle/pull/9000
- Enhance regularizer.py https://github.com/PaddlePaddle/Paddle/pull/8934
- Enhance look_up_table op https://github.com/PaddlePaddle/Paddle/pull/8932
- remove unnecessary build graph logic https://github.com/PaddlePaddle/Paddle/pull/8896
-
Model Continuous Evaluation
- https://github.com/PaddlePaddle/ModelCE
- discussion, issue
- init design
-
WIP, initial implementation
- works for one ResNet30 model, tracks the train cost and time consume.
- truning the details
-
Conv sequence to sequence model
-
reviews
- PaddlePaddle.org
- Fix Paddle Doc Generator (https://github.com/PaddlePaddle/PaddlePaddle.org/pull/439)
- Paddle
- Implement Select OP (https://github.com/PaddlePaddle/Paddle/pull/9088)
- Create Select OP unit tests
- Create channel_utils, move shared helper methods for channel here
- Update Channel class to support select op
- Implement Select OP (https://github.com/PaddlePaddle/Paddle/pull/9088)
Performance
EDL
Distributed embedding operator design
Continuous evluation
Customier service
- https://github.com/PaddlePaddle/Paddle/issues/8999#issuecomment-372385999
- https://github.com/PaddlePaddle/Paddle/issues/8998#issuecomment-372388468
- https://github.com/PaddlePaddle/Paddle/issues/8949#issuecomment-372050985
- https://github.com/PaddlePaddle/Paddle/issues/8948#issuecomment-372389473
- https://github.com/PaddlePaddle/Paddle/issues/8942#issuecomment-372051198
- https://github.com/PaddlePaddle/Paddle/issues/8931#issuecomment-372395072
- https://github.com/PaddlePaddle/Paddle/issues/8892#issuecomment-372396478
- https://github.com/PaddlePaddle/Paddle/issues/8888#issuecomment-371678245
- https://github.com/PaddlePaddle/Paddle/issues/8885#issuecomment-371678718
- https://github.com/PaddlePaddle/Paddle/issues/8884#issuecomment-371679065
- https://github.com/PaddlePaddle/Paddle/issues/8883#issuecomment-371679505
- https://github.com/PaddlePaddle/Paddle/issues/8881#issuecomment-371680666
- https://github.com/PaddlePaddle/Paddle/issues/8881#issuecomment-373258541
- https://github.com/PaddlePaddle/Paddle/issues/8880#issuecomment-371674598
- https://github.com/PaddlePaddle/Paddle/issues/8878#issuecomment-371682403
- https://github.com/PaddlePaddle/Paddle/issues/8877#issuecomment-371685194
- https://github.com/PaddlePaddle/Paddle/issues/8876#issuecomment-371684707
- https://github.com/PaddlePaddle/Paddle/issues/8875#issuecomment-371685425
- https://github.com/PaddlePaddle/Paddle/issues/8860#issuecomment-371687425
PRs
- Deprecated san: https://github.com/PaddlePaddle/VisualDL/pull/302
- Only write the modified tablets to file system.: https://github.com/PaddlePaddle/VisualDL/pull/304
- Update the NODE_ENV variable to fix Vue in Production: https://github.com/PaddlePaddle/VisualDL/pull/310
- gcc can't properly parse the raw string literal with #define: https://github.com/PaddlePaddle/VisualDL/pull/311
- Add python 3 build support. : https://github.com/PaddlePaddle/VisualDL/pull/314
- Update Documentation for the release: Fixed the image not shown issue: https://github.com/PaddlePaddle/VisualDL/pull/317/files
- Update protobuf version to 3.4: https://github.com/PaddlePaddle/Paddle/pull/9091
Issue:
- import paddle.v2.fluid has TypeError: init() got an unexpected keyword argument ‘file'https://github.com/PaddlePaddle/Paddle/issues/9090
Support:
- https://github.com/PaddlePaddle/VisualDL/issues/320
- https://github.com/PaddlePaddle/VisualDL/issues/319
- https://github.com/PaddlePaddle/VisualDL/issues/315
- https://github.com/PaddlePaddle/VisualDL/issues/307
- https://github.com/PaddlePaddle/VisualDL/issues/305
- Issues & performance:
- Fix smoothing value and tooltip issues https://github.com/PaddlePaddle/VisualDL/pull/323
- Improve smoothing sliding performance by throttling https://github.com/PaddlePaddle/VisualDL/pull/312
- Design and UI Polish:
- Polish scalar and histogram chart UI, reorganize chart tools https://github.com/PaddlePaddle/VisualDL/pull/303
- Redesign chart toolbar icons and improve UX for download chart JSON data https://github.com/PaddlePaddle/VisualDL/pull/309
- Improve chart color sequence https://github.com/PaddlePaddle/VisualDL/pull/318
- Optimize expanded chart UI https://github.com/PaddlePaddle/VisualDL/pull/324
Inference:
- Bechmarking recognize_digits example using TensorRT:
- Discussion with people from Nvidia TensorRT regarding:
- paddle to UFF converter (UFF is the representation format from TensorRT)
- Issues with the Batch norm layer when using a Tensorflow model in Python with TensorRT
- Float16: Discussion with Kexin on modifying load-store ops for float16
Inference:
- Add float16 support Mul Op: https://github.com/PaddlePaddle/Paddle/pull/9017
- Bind numpy float16 to paddle float16: https://github.com/PaddlePaddle/Paddle/pull/9017
- Add GPU compute compatibility check: https://github.com/PaddlePaddle/Paddle/pull/8946
- [WIP] add float16 support for cudnn conv op: https://github.com/PaddlePaddle/Paddle/pull/9098
-
Implementation and refinement of Python API, and Python tests for Select Op and cases
-
Support miscellaneous issues and PRs in VDL: https://github.com/PaddlePaddle/VisualDL/issues/307, https://github.com/PaddlePaddle/VisualDL/pull/310, https://github.com/PaddlePaddle/VisualDL/pull/310, https://github.com/PaddlePaddle/VisualDL/issues/291 etc.