Most of the existing works on pedestrian pose estimation do not consider estimating the pose of an occluded pedestrian, as the annotations of the occluded parts are not available in relevant automotive datasets. For example, CityPersons, a well-known dataset for pedestrian detection in automotive scenes does not provide pose annotations, whereas MS-COCO, a non-automotive dataset, contains human pose estimation. In this work, we propose a multi-task framework to extract pedestrian features through detection and instance segmentation tasks performed separately on these two distributions. Thereafter, an encoder learns pose specific features using an unsupervised instance-level domain adaptation method for the pedestrian instances from both distributions. The proposed framework has improved state-of-the-art performances of pose estimation, pedestrian detection, and instance segmentation.
翻译:现有关于行人的估计多数工程没有考虑估计隐蔽行人的形象,因为有关的汽车数据集中没有隐蔽部分的说明。例如,在汽车场景中行人探测的众所周知的数据集《CityPersons》没有提供说明,而非汽车数据集《MS-COCO》包含人造估计。在这项工作中,我们提议了一个多任务框架,通过检测和实例分割,分别执行这两种分布的行人特征。随后,一个编码员学习了使用未经监督的实例级域适应方法对两次分布的行人进行具体特征调整。拟议的框架改善了人造估计、行人探测和实例分割的最新性能。