TY - JOUR
T1 - Fixing the root node
T2 - Efficient tracking and detection of 3D human pose through local solutions
AU - Daubney, Ben
AU - Xie, Xianghua
AU - Deng, Jingjing
AU - MacParthalain, Neil
AU - Zwiggelaar, Reyer
N1 - This is the author accepted manuscript. The final version is available from Elsevier via http://dx.doi.org/10.1016/j.imavis.2016.05.010
PY - 2016/8/1
Y1 - 2016/8/1
N2 - 3D human pose estimation is a very difficult task. In this paper we propose that this problem can be more easily solved by first finding the solutions to a set of easier sub-problems. These are to locally estimate pose conditioned on a fixed root node state, which defines the global position and orientation of the person. The global solution can then be found using information extracted during this procedure. This approach has two key benefits: The first is that each local solution can be found by modeling the articulated object as a kinematic chain, which has far less degrees of freedom than alternative models. The second is that by using this approach we can represent, or support, a much larger area of the posterior than is currently possible. This allows far more robust algorithms to be implemented since there is far less pressure to prune the search space to free up computational resources. We apply this approach to two problems: The first is single frame monocular 3D pose estimation, where we propose a method to directly extract 3D pose without first extracting any intermediate 2D representation or being dependent on strong spatial prior models. The second is multi-view 3D tracking where we show that using the above technique results in an approach that is far more robust than current approaches, without relying on strong temporal prior models. In both domains we demonstrate the strength and versatility of the proposed method
AB - 3D human pose estimation is a very difficult task. In this paper we propose that this problem can be more easily solved by first finding the solutions to a set of easier sub-problems. These are to locally estimate pose conditioned on a fixed root node state, which defines the global position and orientation of the person. The global solution can then be found using information extracted during this procedure. This approach has two key benefits: The first is that each local solution can be found by modeling the articulated object as a kinematic chain, which has far less degrees of freedom than alternative models. The second is that by using this approach we can represent, or support, a much larger area of the posterior than is currently possible. This allows far more robust algorithms to be implemented since there is far less pressure to prune the search space to free up computational resources. We apply this approach to two problems: The first is single frame monocular 3D pose estimation, where we propose a method to directly extract 3D pose without first extracting any intermediate 2D representation or being dependent on strong spatial prior models. The second is multi-view 3D tracking where we show that using the above technique results in an approach that is far more robust than current approaches, without relying on strong temporal prior models. In both domains we demonstrate the strength and versatility of the proposed method
KW - 3D pose estimation
KW - tracking
KW - local solutions
KW - root node
UR - http://hdl.handle.net/2160/43894
U2 - 10.1016/j.imavis.2016.05.010
DO - 10.1016/j.imavis.2016.05.010
M3 - Article
SN - 0262-8856
VL - 52
SP - 73
EP - 87
JO - Image and Vision Computing
JF - Image and Vision Computing
ER -