We present a multitask network that supports various deep neural network based pedestrian detection functions. Besides 2D and 3D human pose, it also supports body and head orientation estimation based on full body bounding box input. This eliminates the need for explicit face recognition. We show that the performance of 3D human pose estimation and orientation estimation is comparable to the state-of-the-art. Since very few data sets exist for 3D human pose and in particular body and head orientation estimation based on full body data, we further show the benefit of particular simulation data to train the network. The network architecture is relatively simple, yet powerful, and easily adaptable for further research and applications.
翻译:我们提出了一个多任务网络,支持各种基于深神经网络的行人探测功能。除了2D和3D人造外,它还支持基于全身捆绑框输入的体形和头部定向估计,这就不需要明确面部识别。我们表明3D人构成估计和方向估计的性能与最新技术相当。由于3D人构成的数据集很少,特别是基于完整身体数据的身体和头部定向估计,我们进一步展示了特定模拟数据对培训网络的好处。网络结构相对简单,但强大,便于进一步研究和应用。