Salient object detection (SOD), which simulates the human visual perception system to locate the most attractive object(s) in a scene, has been widely applied to various computer vision tasks. Now, with the advent of depth sensors, depth maps with affluent spatial information that can be beneficial in boosting the performance of SOD, can easily be captured. Although various RGB-D based SOD models with promising performance have been proposed over the past several years, an in-depth understanding of these models and challenges in this topic remains lacking. In this paper, we provide a comprehensive survey of RGB-D based SOD models from various perspectives, and review related benchmark datasets in detail. Further, considering that the light field can also provide depth maps, we review SOD models and popular benchmark datasets from this domain as well. Moreover, to investigate the SOD ability of existing models, we carry out a comprehensive evaluation, as well as attribute-based evaluation of several representative RGB-D based SOD models. Finally, we discuss several challenges and open directions of RGB-D based SOD for future research. All collected models, benchmark datasets, source code links, datasets constructed for attribute-based evaluation, and codes for evaluation will be made publicly available at https://github.com/taozh2017/RGBDSODsurvey
翻译:模拟人类视觉感知系统以在现场定位最有吸引力的物体的高度物体探测(SOD),已广泛应用于各种计算机视觉任务;现在,随着深度传感器的到来,可以很容易地捕捉到具有有利于提高SOD性能的丰富空间信息的深度地图;虽然在过去几年里提出了各种基于RGB-D的、具有良好性能的SOD模型,但在这个主题上仍然缺乏对这些模型和挑战的深入了解;在本文件中,我们从各种角度对基于RGB-D的SOD模型进行了全面调查,并详细审查了有关基准数据集;此外,考虑到光场还可以提供深度地图,我们审查SOD模型和能够有助于提高SOD性能的流行基准数据集;此外,为了调查现有模型的SOD能力,我们进行了全面评价,并对若干具有代表性的基于RGB-D的SOD模型进行了基于属性的评价;最后,我们讨论了基于RGB-DOD的SOD模型的若干挑战和公开方向。所有收集的模型、基准数据集、源码链接、来自该领域的、用于公开评价的ASUDRB/SU的数据集,将建立用于在公开的GB/SUDSUD/SUDSUD/G17。