Deep convolutional neural networks have become a key element in the recent breakthrough of salient object detection. However, existing CNN-based methods are based on either patch-wise (region-wise) training and inference or fully convolutional networks. Methods in the former category are generally time-consuming due to severe storage and computational redundancies among overlapping patches. To overcome this deficiency, methods in the second category attempt to directly map a raw input image to a predicted dense saliency map in a single network forward pass. Though being very efficient, it is arduous for these methods to detect salient objects of different scales or salient regions with weak semantic information. In this paper, we develop hybrid contrast-oriented deep neural networks to overcome the aforementioned limitations. Each of our deep networks is composed of two complementary components, including a fully convolutional stream for dense prediction and a segment-level spatial pooling stream for sparse saliency inference. We further propose an attentional module that learns weight maps for fusing the two saliency predictions from these two streams. A tailored alternate scheme is designed to train these deep networks by fine-tuning pre-trained baseline models. Finally, a customized fully connected CRF model incorporating a salient contour feature embedding can be optionally applied as a post-processing step to improve spatial coherence and contour positioning in the fused result from these two streams. Extensive experiments on six benchmark datasets demonstrate that our proposed model can significantly outperform the state of the art in terms of all popular evaluation metrics.
翻译:深相神经网络已成为最近显著天体探测突破中的一个关键要素。 但是,现有CNN使用的方法要么基于补丁(区域)培训和推断,要么基于补丁(区域)培训和推断,要么基于完全进化网络。前一类方法通常耗时,因为相互重叠的偏差存在严重的储存和计算重复。为了克服这一缺陷,第二类方法试图直接将原始输入图像映射成一个预测的密度显著的单一网络前行图。这些方法虽然效率很高,但很难发现不同规模或突出区域中语言信息薄弱的突出对象。在本文件中,我们开发了以对比为导向的深层神经网络,以克服上述限制。我们每个深层网络由两个互补部分组成,包括用于密集预测的完全进化流和用于低度突出推断的分层空间集合流。我们进一步提议了一个关注模型,用来学习从这两条星流中测出的两个突出度预测的重量图。一个定制的替代方案旨在用精细的模型来训练这些深度网络,用精细的精细的精细度外向外选的深神经网络来克服上述限制的深度基准模型模型。最后,可以将一个完全定制的精确地将精确的精确的基模型用于将精确的精确的精确的基底基模化的基模化的基模模模化的基模化的模型用于。