Salient human detection (SHD) in dynamic 360{\deg} immersive videos is of great importance for various applications such as robotics, inter-human and human-object interaction in augmented reality. However, 360{\deg} video SHD has been seldom discussed in the computer vision community due to a lack of datasets with large-scale omnidirectional videos and rich annotations. To this end, we propose SHD360, the first 360{\deg} video SHD dataset which contains various real-life daily scenes. Our SHD360 provides six-level hierarchical annotations for 6,268 key frames uniformly sampled from 37,403 omnidirectional video frames at 4K resolution. Specifically, each collected frame is labeled with a super-class, a sub-class, associated attributes (e.g., geometrical distortion), bounding boxes and per-pixel object-/instance-level masks. As a result, our SHD360 contains totally 16,238 salient human instances with manually annotated pixel-wise ground truth. Since so far there is no method proposed for 360{\deg} image/video SHD, we systematically benchmark 11 representative state-of-the-art salient object detection (SOD) approaches on our SHD360, and explore key issues derived from extensive experimenting results. We hope our proposed dataset and benchmark could serve as a good starting point for advancing human-centric researches towards 360{\deg} panoramic data. Our dataset and benchmark is publicly available at https://github.com/PanoAsh/SHD360.
翻译:在动态360(deg) 暗淡视频中,360 人类探测(SHD) 动态360 (SHD) 在动态360 (deg) 中,对于各种应用,例如机器人、人与人之间和人与人之间的相互作用,在扩大现实中具有非常重要的意义。然而,由于缺少带有大规模全向视频和丰富的注释的数据集,360 视频 SHD 很少在计算机视觉界中讨论360 (SHD) 视频 SHD 。 为此,我们提议SHD360, 首个包含各种真实生活每天场景的360 视频 SHD360 视频数据集。 我们的SHD360 提供了6 268 个关键级说明, 统一样本来自 4K 分辨率的37 403 omnidial-bective 视频框。 具体来说, 所收集的每个框架都有一个超级类、 亚类、 相关属性的数据集(例如,几何扭曲)、 框和每平ixel 对象- descrical 目标事实。 因此,我们没有提出用于Sdrive-rental dressal ladeal rodeal rodustrational roductional rodustration lablegalal rodustrational roduction