Single-frame infrared small target (SIRST) detection aims at separating small targets from clutter backgrounds on infrared images. Recently, deep learning based methods have achieved promising performance on SIRST detection, but at the cost of a large amount of training data with expensive pixel-level annotations. To reduce the annotation burden, we propose the first method to achieve SIRST detection with single-point supervision. The core idea of this work is to recover the per-pixel mask of each target from the given single point label by using clustering approaches, which looks simple but is indeed challenging since targets are always insalient and accompanied with background clutters. To handle this issue, we introduce randomness to the clustering process by adding noise to the input images, and then obtain much more reliable pseudo masks by averaging the clustered results. Thanks to this "Monte Carlo" clustering approach, our method can accurately recover pseudo masks and thus turn arbitrary fully supervised SIRST detection networks into weakly supervised ones with only single point annotation. Experiments on four datasets demonstrate that our method can be applied to existing SIRST detection networks to achieve comparable performance with their fully supervised counterparts, which reveals that single-point supervision is strong enough for SIRST detection. Our code will be available at: https://github.com/YeRen123455/SIRST-Single-Point-Supervision.
翻译:单帧红外小目标(SIRST)检测旨在从红外图像中的杂波背景中分离出小目标。最近,基于深度学习的方法在SIRST检测方面取得了有前途的性能,但代价是需要大量的训练数据,并伴随着昂贵的像素级标注。为了减少注释负担,我们提出了第一种使用单点监督实现SIRST检测的方法。这项工作的核心思想是通过聚类方法从给定的单点标签恢复每个目标的每像素掩模,这看起来很简单,但实际上是具有挑战性的,因为目标始终是不显著的,并且伴随着背景杂波。为了解决这个问题,我们通过向输入图像添加噪声,将随机性引入到聚类过程中,然后通过平均聚类结果来获得更可靠的伪掩模。由于这种“蒙特卡罗”聚类方法,我们的方法能够准确地恢复伪掩模,从而将任意全监督的SIRST检测网络转换为只有单点注释的弱监督网络。在四个数据集上的实验证明,我们的方法可以应用于现有的SIRST检测网络中,以实现与其全监督对应物可比较的性能,这表明单点监督足以用于SIRST检测。我们的代码将在以下网址提供:https://github.com/YeRen123455/SIRST-Single-Point-Supervision。