1702.02447 - hassony2/inria-research-wiki GitHub Wiki

Arxiv 2017

Hengkai Guo, Guijin Wang, Xinghao Chen, Cairong Zhang, Fei Qiao, Huazhong Yang

read 29/05/2017

Directly regress 3D coordinate of hand position using a tree-structured Region Ensemble Network REN from unique depth image

Output : 3*J vector representing the 3D world coordinates for the hand joints

extract depth cube around hand
Depth normalized to [-1, 1]
uniformly divides the feature maps of the convnet into a nxn grid (n=2 in practice), for each grid, feed it into FC layers (Branches)
features from the last layer are concatenated
regression layer to predict outputs
end-to-end training

Claims state of the art

Implementation using caffe