Detection of human body and its parts (e.g., head or hands) has been intensively studied. However, most of these CNNs-based detectors are trained independently, making it difficult to associate detected parts with body. In this paper, we focus on the joint detection of human body and its corresponding parts. Specifically, we propose a novel extended object representation integrating center-offsets of body parts, and construct a dense one-stage generic Body-Part Joint Detector (BPJDet). In this way, body-part associations are neatly embedded in a unified object representation containing both semantic and geometric contents. Therefore, we can perform multi-loss optimizations to tackle multi-tasks synergistically. BPJDet does not suffer from error-prone post matching, and keeps a better trade-off between speed and accuracy. Furthermore, BPJDet can be generalized to detect any one or more body parts. To verify the superiority of BPJDet, we conduct experiments on three body-part datasets (CityPersons, CrowdHuman and BodyHands) and one body-parts dataset COCOHumanParts. While keeping high detection accuracy, BPJDet achieves state-of-the-art association performance on all datasets comparing with its counterparts. Besides, we show benefits of advanced body-part association capability by improving performance of two representative downstream applications: accurate crowd head detection and hand contact estimation. Code is released in https://github.com/hnuzhy/BPJDet.
翻译:摘要:人体及其部位(例如头或手)的检测已经得到了广泛的研究。然而,大多数基于卷积神经网络的检测器是独立训练的,这使得难以将检测到的部位与身体相关联。在本文中,我们专注于人体及其相应部位的联合检测。具体而言,我们提出了一种新颖的扩展对象表示,该表示集成了身体部位的中心偏移量,并构建了一种密集的单阶段通用身体部位联合检测器(BPJDet)。通过这种方式,身体部位联系被整齐地嵌入到统一的对象表示中,其中包含语义和几何内容。因此,我们可以执行多丢失优化来协同解决多任务。 BPJDet 不会受到容易出错的后处理匹配的影响,并保持更好的速度和准确性的折衷。此外,BPJDet 可以推广到检测任何一个或多个身体部位。为了验证 BPJDet 的优越性,我们在三个身体部位数据集(CityPersons、CrowdHuman 和BodyHands)和一个身体部位数据集 COCOHumanParts 上进行了实验。在保持高检测精度的同时,BPJDet 与其他相应方法相比,在所有数据集上都实现了最新的联合性能。此外,我们通过提高两个代表性下游应用的性能:准确的人群头部检测和手接触估计,展示了先进的身体部位关联能力的好处。代码已发布于https://github.com/hnuzhy/BPJDet。