2017 12 27 - PaddlePaddle/Paddle GitHub Wiki

Yu Yang

Gradient Check of RNN
- [WIP] https://github.com/PaddlePaddle/Paddle/pull/7068
- Tensor::has_nan/has_inf
  - https://github.com/PaddlePaddle/Paddle/pull/7068
- Evaluator raise exception when NAN/Inf
  - https://github.com/PaddlePaddle/Paddle/pull/7093
Rename API of DevCtx
- https://github.com/PaddlePaddle/Paddle/pull/7055
Polish Scope::LocalVarNames
- https://github.com/PaddlePaddle/Paddle/pull/7030
Speed up ColwiseSum in CPU
- https://github.com/PaddlePaddle/Paddle/pull/6834
Rewrite AdamOp
- https://github.com/PaddlePaddle/Paddle/pull/6967
Set RelWithDebInfo
- https://github.com/PaddlePaddle/Paddle/pull/6975

qijun

Multi-device:

add data layout
- https://github.com/PaddlePaddle/Paddle/pull/6832
add library type
- https://github.com/PaddlePaddle/Paddle/pull/6874
refine OpKernelType
- https://github.com/PaddlePaddle/Paddle/pull/6879
add memory switch mechanism in operator kernel switch
- https://github.com/PaddlePaddle/Paddle/pull/6991
cache memory in local scope
- https://github.com/PaddlePaddle/Paddle/pull/7058

Fix and Enhance

update support new device docs
- https://github.com/PaddlePaddle/Paddle/pull/6963
remove unused place
- https://github.com/PaddlePaddle/Paddle/pull/6972

luotao

remove unused usage_stat script: https://github.com/PaddlePaddle/Paddle/pull/6880
unify the indentation of license: https://github.com/PaddlePaddle/Paddle/pull/7022
refine CMakeLists.txt when add op need DEPS: https://github.com/PaddlePaddle/Paddle/pull/7067
MKL
- update alexnet training data: https://github.com/PaddlePaddle/Paddle/pull/6878
- Add "download mklml failed" into FAQ: https://github.com/PaddlePaddle/Paddle/pull/7009
- code review:
  - enable alexnet benchmark: https://github.com/PaddlePaddle/Paddle/pull/6852
  - use small samples to infer openblas: https://github.com/PaddlePaddle/Paddle/pull/6755
  - enable MKL Packed Recurrent Layer: https://github.com/PaddlePaddle/Paddle/pull/6719
doc
- add python doc for sequence_pool: https://github.com/PaddlePaddle/Paddle/pull/6787, https://github.com/PaddlePaddle/Paddle/pull/6981
- code review:

fengjiayi

Complete refactor of backward
- https://github.com/PaddlePaddle/Paddle/pull/6741
Update DataFeeder and inference model io according to users' feedback
- https://github.com/PaddlePaddle/Paddle/pull/7073
- https://github.com/PaddlePaddle/Paddle/pull/7036
Other improving and fixes:
Reviews:
- https://github.com/PaddlePaddle/Paddle/pull/6797

ranqiu

Update doc of V2 api

https://github.com/PaddlePaddle/Paddle/pull/6654

https://github.com/PaddlePaddle/Paddle/pull/6940
performance validation of understand_sentiment in fluid

https://github.com/PaddlePaddle/Paddle/pull/7004

https://github.com/PaddlePaddle/Paddle/issues/7046

https://github.com/PaddlePaddle/Paddle/issues/7054

https://github.com/PaddlePaddle/Paddle/issues/7096

Zhangchao

Add gpu support for NCE_layer.

https://github.com/PaddlePaddle/Paddle/pull/7077
[WIP] Implement adaptive softmax.
Book.04 word2vec speed performance comparison with V2.

https://github.com/PaddlePaddle/Paddle/issues/7087

https://github.com/PaddlePaddle/Paddle/issues/7088

daming-lu

setup onnx environment and learned how it should interact with VisualDL
finished graph data design for graph in VisualDL
add edges to graph proto so that frontend can render more easily (WIP)
- https://github.com/PaddlePaddle/VisualDL/issues/38
updated data format design for VisualDL
- https://github.com/PaddlePaddle/VisualDL/pull/17

Yancey1989(Yan Xu)

Serialize and Deserialize SelectedRows, https://github.com/PaddlePaddle/Paddle/pull/7042
BlockingCounter for ThreadPool, https://github.com/PaddlePaddle/Paddle/pull/7000
Bug fix
- install python-tk, https://github.com/PaddlePaddle/Paddle/pull/7095
PR Review:
- https://github.com/PaddlePaddle/Paddle/pull/6983#pullrequestreview-85482289
- https://github.com/PaddlePaddle/Paddle/pull/6954#pullrequestreview-85452117

Dang Qingqing

Profiling:
- Refine the activation type getting in the LSTM operator to speed.
  - https://github.com/PaddlePaddle/Paddle/pull/6996
- Speed data reader for IMDB dataset.
  - https://github.com/PaddlePaddle/Paddle/pull/7002
- Optimize the rowwise add function.
  - https://github.com/PaddlePaddle/Paddle/pull/7047
- Speed based on three statcked LSTM model:
  - GPU: 166.95994s -> 87.30287s
  - CPU: 385.2211s -> 294.90407s
Benchmark Model:
- Make the ResNet of TensorFlow consistent with Paddle
  - https://github.com/dzhwinter/benchmark/pull/36
Mobile:
- Updating PR: https://github.com/PaddlePaddle/Paddle/pull/6802/commits/f14986674015bdc823f106fe1b4e4920d37655d5
Code Review:

wangmeng

Implement ResNeXt for image classification
- https://github.com/PaddlePaddle/models/pull/559
Working on SENet [WIP]

qiaolongfei

Fluid

Muiti Device
Code optimize
- fix math_function warning
Review

VisualDL

wanghaoshuang

Doc:
- Polish accuracy doc: https://github.com/PaddlePaddle/Paddle/pull/7091
- Fix transpose op doc: https://github.com/PaddlePaddle/Paddle/pull/7020
Models test:
- Use 'time' monitor resources while running train model
  - https://github.com/PaddlePaddle/regtest/pull/14
- Add script to analysis train log
  - https://github.com/PaddlePaddle/regtest/pull/13

Yibing Liu

VGG16 performance comparison with TensorFlow
- Convergence comparison with TF on CPU
  - https://github.com/PaddlePaddle/Paddle/issues/6944
- Speed comparison with TF on CPU
  - https://github.com/PaddlePaddle/Paddle/issues/6911
- Internal convergence comparison on CPU and GPU
  - https://github.com/PaddlePaddle/Paddle/issues/6945
- Memory allocation comparison with TF
  - https://github.com/PaddlePaddle/Paddle/issues/6968
- Update and merge the VGG16 benchmark scripts
  - https://github.com/dzhwinter/benchmark/pull/20
Add the parsing part for the profiling tool
- https://github.com/PaddlePaddle/Paddle/pull/7043
Polish the doc of cross_entropy_op
- https://github.com/PaddlePaddle/Paddle/issues/7018
Fix two docs' problem
- https://github.com/PaddlePaddle/Paddle/issues/7071
Code Review:
- Add design documentation for profiling tool
- unify the indentation of license

zhaochengduo

PR

Refine cos-sim-op
- https://github.com/PaddlePaddle/Paddle/pull/6601
Refine sgd-op
- https://github.com/PaddlePaddle/Paddle/pull/6913
Add conv2d_python doc
- https://github.com/PaddlePaddle/Paddle/pull/6850
Fix embedding example
- https://github.com/PaddlePaddle/Paddle/pull/6956

Performance analysis: ResNet and VGG16

Review

remove GPU Sync Interface
- https://github.com/PaddlePaddle/Paddle/pull/6793
Refine CUDA profiler and delete the test file
- https://github.com/PaddlePaddle/Paddle/pull/6715
Use for_range to rewrite adam
- https://github.com/PaddlePaddle/Paddle/pull/6967
Speed data reader for IMDB dataset.
- https://github.com/PaddlePaddle/Paddle/pull/7002
Optimize the rowwise add function
- https://github.com/PaddlePaddle/Paddle/pull/7047
Add vgg16 benchmark configuration
- https://github.com/dzhwinter/benchmark/pull/20

sweetsky0901(tiantian)

detection_output op（for SSD, doing, code review
- https://github.com/PaddlePaddle/Paddle/pull/6488
norm op doing doing, code review
- https://github.com/PaddlePaddle/Paddle/issues/6561
run caffe ssd demo

Liu Yiqun

Framework
- Add a simple C++ inference example for fluid
  - https://github.com/PaddlePaddle/Paddle/pull/7097
Mobile
- Always link protobuf-lite for mobile inference
  - https://github.com/PaddlePaddle/Paddle/pull/6828

wanghaox

Multi box loss operator: https://github.com/PaddlePaddle/Paddle/pull/6946
code review:
- https://github.com/PaddlePaddle/Paddle/pull/6488
- https://github.com/PaddlePaddle/Paddle/pull/6881
run paddle v2 SSD demo

Yan Chunwei

fluid
- print op
VisualDL with @longfei @daming
models ci with @haoshuang reviews https://github.com/PaddlePaddle/regtest/pull/8#pullrequestreview-85679209 https://github.com/PaddlePaddle/regtest/pull/9#pullrequestreview-85760472

gongweibao

improve send/recv op:
- single thread block => async: https://github.com/PaddlePaddle/Paddle/compare/develop...gongweibao:asyncsendrecv?expand=1
Fix bugs:
- create vars bugs: https://github.com/PaddlePaddle/Paddle/pull/7060
- Fix demo code bug in usage doc: https://github.com/PaddlePaddle/cloud/pull/535
ISSUE:
- https://github.com/PaddlePaddle/Paddle/issues/7037
code review:

typhoonzero(wuyi)

refine distributed transpiler
add scatter functors

dongzhihong

Fluid
- add DataType Transform
  - https://github.com/PaddlePaddle/Paddle/pull/7079
- Fix ThreadPool
  - https://github.com/PaddlePaddle/Paddle/pull/7017
- add multi kernel register
  - https://github.com/PaddlePaddle/Paddle/pull/6998
- add data layout in Tensor
  - https://github.com/PaddlePaddle/Paddle/pull/6955
- switch GPUPlace with CUDAPlace
  - https://github.com/PaddlePaddle/Paddle/pull/6960
- fix/copyfrom context
  - https://github.com/PaddlePaddle/Paddle/pull/6954
- refine op_kernel key
  - https://github.com/PaddlePaddle/Paddle/pull/6932
- remove GPU Sync interface
  - https://github.com/PaddlePaddle/Paddle/pull/6793
- switch Operaterbase Run with place/ Global DeviceContext
  - https://github.com/PaddlePaddle/Paddle/pull/6783
- fix/Place
  - https://github.com/PaddlePaddle/Paddle/pull/6766
Benchmark - Reviews - https://github.com/dzhwinter/benchmark/pull/35 - https://github.com/dzhwinter/benchmark/pull/34

hedaoyuan

PR & Review
- [optimized] https://github.com/PaddlePaddle/Paddle/pull/7034
- [multi-thread] https://github.com/PaddlePaddle/Paddle/pull/6751
- [CAPI-doc] https://github.com/PaddlePaddle/Paddle/pull/6596

yangyaming

Add stacked dynamic lstm model for fluid
https://github.com/dzhwinter/benchmark/pull/34
Add seq2seq model for tf
https://github.com/dzhwinter/benchmark/pull/31
Add stacked dynamic lstm model for tf
https://github.com/dzhwinter/benchmark/pull/35
Code Review
https://github.com/PaddlePaddle/Paddle/pull/6986#pullrequestreview-85519491
https://github.com/PaddlePaddle/Paddle/pull/6779#pullrequestreview-85241336

⚠️ GitHub.com Fallback ⚠️