Automatic surgical instrument segmentation of endoscopic images is a crucial building block of many computer-assistance applications for minimally invasive surgery. So far, state-of-the-art approaches completely rely on the availability of a ground-truth supervision signal, obtained via manual annotation, thus expensive to collect at large scale. In this paper, we present FUN-SIS, a Fully-UNsupervised approach for binary Surgical Instrument Segmentation. FUN-SIS trains a per-frame segmentation model on completely unlabelled endoscopic videos, by solely relying on implicit motion information and instrument shape-priors. We define shape-priors as realistic segmentation masks of the instruments, not necessarily coming from the same dataset/domain as the videos. The shape-priors can be collected in various and convenient ways, such as recycling existing annotations from other datasets. We leverage them as part of a novel generative-adversarial approach, allowing to perform unsupervised instrument segmentation of optical-flow images during training. We then use the obtained instrument masks as pseudo-labels in order to train a per-frame segmentation model; to this aim, we develop a learning-from-noisy-labels architecture, designed to extract a clean supervision signal from these pseudo-labels, leveraging their peculiar noise properties. We validate the proposed contributions on three surgical datasets, including the MICCAI 2017 EndoVis Robotic Instrument Segmentation Challenge dataset. The obtained fully-unsupervised results for surgical instrument segmentation are almost on par with the ones of fully-supervised state-of-the-art approaches. This suggests the tremendous potential of the proposed method to leverage the great amount of unlabelled data produced in the context of minimally invasive surgery.
翻译:内窥镜图像的自动外科仪器分解是许多计算机辅助应用程序中用于最低入侵性手术的关键组成部分。 到目前为止,最先进的方法完全依赖通过人工批注获得的地面真实性监督信号,因此收集成本很高。 在本文中,我们介绍了FUN-SIS, 一种完全由联合国监督的二进制外科仪器分解方法。 FUN-SIS在完全无标签的内窥镜视频上, 仅依靠隐含的运动信息和仪器元件底线。 我们把形状前缀定义为工具的现实性分解掩码, 不一定来自与视频相同的数据集/主页。 形状质谱可以以各种方便的方式收集, 例如回收其他数据集的现有图纸。 我们利用它们作为新颖的基因化对抗性对抗性方法的一部分, 允许在培训期间对光学流图像进行非超导式的仪表分解。 我们随后将获得的仪器面罩作为虚拟的内嵌, 用于对服务器内部内部结构进行完全意义上的分解, 用于从内部分析结构中, 向内部结构中进行完全的货币缩缩缩略性分析。