Aiming at discovering and locating most distinctive objects from visual scenes, salient object detection (SOD) plays an essential role in various computer vision systems. Coming to the era of high resolution, SOD methods are facing new challenges. The major limitation of previous methods is that they try to identify the salient regions and estimate the accurate objects boundaries simultaneously with a single regression task at low-resolution. This practice ignores the inherent difference between the two difficult problems, resulting in poor detection quality. In this paper, we propose a novel deep learning framework for high-resolution SOD task, which disentangles the task into a low-resolution saliency classification network (LRSCN) and a high-resolution refinement network (HRRN). As a pixel-wise classification task, LRSCN is designed to capture sufficient semantics at low-resolution to identify the definite salient, background and uncertain image regions. HRRN is a regression task, which aims at accurately refining the saliency value of pixels in the uncertain region to preserve a clear object boundary at high-resolution with limited GPU memory. It is worth noting that by introducing uncertainty into the training process, our HRRN can well address the high-resolution refinement task without using any high-resolution training data. Extensive experiments on high-resolution saliency datasets as well as some widely used saliency benchmarks show that the proposed method achieves superior performance compared to the state-of-the-art methods.
翻译:在发现和定位视觉场景中最独特的物体时,显要物体探测(SOD)在各种计算机视觉系统中发挥着关键作用。在进入高分辨率时代的时代,SOD方法面临新的挑战。以前方法的主要局限性是,它们试图识别突出区域并同时估计精确的物体界限,同时进行低分辨率的单一回归任务。这种做法忽略了两个难题之间的内在差异,导致检测质量差。在本文件中,我们提议为高分辨率物体探测(SOD)任务建立一个全新的深层次学习框架,将任务分解成低分辨率分类网络和高分辨率改进网络(HRN)。作为一个像素分解的分类任务,LRSCN设计了以低分辨率捕捉足够的精度物体界限,以找出明确的显著、背景和不确定的图像区域。HRRN是一个回归任务,目的是准确地改善不确定区域内的像素的突出值,以便在高分辨率记忆中保持清晰的物体边界。值得注意的是,通过在高分辨率培训过程中引入不确定性,在高分辨率的高级分辨率试验过程中,我们使用高分辨率的精确度数据显示高分辨率的方法。