Interactive segmentation enables users to extract masks by providing simple annotations to indicate the target, such as boxes, clicks, or scribbles. Among these interaction formats, scribbles are the most flexible as they can be of arbitrary shapes and sizes. This enables scribbles to provide more indications of the target object. However, previous works mainly focus on click-based configuration, and the scribble-based setting is rarely explored. In this work, we attempt to formulate a standard protocol for scribble-based interactive segmentation. Basically, we design diversified strategies to simulate scribbles for training, propose a deterministic scribble generator for evaluation, and construct a challenging benchmark. Besides, we build a strong framework ScribbleSeg, consisting of a Prototype Adaption Module(PAM) and a Corrective Refine Module (CRM), for the task. Extensive experiments show that ScribbleSeg performs notably better than previous click-based methods. We hope this could serve as a more powerful and general solution for interactive segmentation. Our code will be made available.
翻译:交互式分割使用户能够通过提供简单的注释来指示目标对象,例如框、点击或手写线条。在这些交互格式中,手写线条最为灵活,因为它们可以是任意形状和大小,这使得手写线条能够提供更多的目标对象指示信息。然而,先前的工作主要集中在基于点击的配置上,并且很少探索基于手写线条的设置。在这项工作中,我们尝试制定基于手写线条的交互式分割的标准协议。基本上,我们设计了多种策略来模拟训练手写线条,并提出了一个确定性的手写线条生成器进行评估,同时构建了一个具有挑战性的基准。此外,我们构建了一个强大的框架 ScribbleSeg,由原型适应模块(PAM)和校正细化模块(CRM)组成。经过广泛的实验证明,ScribbleSeg的性能显著优于先前基于点击的方法。我们希望这可以作为交互式分割的更强大和通用的解决方案。我们的代码将提供给大众使用。