As a natural way for human-computer interaction, fixation provides a promising solution for interactive image segmentation. In this paper, we focus on Personal Fixations-based Object Segmentation (PFOS) to address issues in previous studies, such as the lack of appropriate dataset and the ambiguity in fixations-based interaction. In particular, we first construct a new PFOS dataset by carefully collecting pixel-level binary annotation data over an existing fixation prediction dataset, such dataset is expected to greatly facilitate the study along the line. Then, considering characteristics of personal fixations, we propose a novel network based on Object Localization and Boundary Preservation (OLBP) to segment the gazed objects. Specifically, the OLBP network utilizes an Object Localization Module (OLM) to analyze personal fixations and locates the gazed objects based on the interpretation. Then, a Boundary Preservation Module (BPM) is designed to introduce additional boundary information to guard the completeness of the gazed objects. Moreover, OLBP is organized in the mixed bottom-up and top-down manner with multiple types of deep supervision. Extensive experiments on the constructed PFOS dataset show the superiority of the proposed OLBP network over 17 state-of-the-art methods, and demonstrate the effectiveness of the proposed OLM and BPM components. The constructed PFOS dataset and the proposed OLBP network are available at https://github.com/MathLee/OLBPNet4PFOS.
翻译:作为人类计算机互动的一种自然方式,固定为交互式图像分割提供了一个有希望的解决方案。在本文中,我们侧重于个人固定和边界保护(OLBP)的新网络,以分割凝视对象。具体地说,OLBP网络利用目标定位模块分析个人固定和根据解释定位凝视物体。然后,边界保护模块(BPM)旨在引入更多的边界信息,以保障凝视物体的完整性。此外,OLBP以混合的底部和上下部方式组织,并进行多种深度监督。 建设的全氟辛烷磺酸定位模块(OLM)和拟议建立的全氟辛烷磺酸定位网络17的优势。