Unsupervised image segmentation is an important task in many real-world scenarios where labelled data is of scarce availability. In this paper we propose a novel approach that harnesses recent advances in unsupervised learning using a combination of Mutual Information Maximization (MIM), Neural Superpixel Segmentation and Graph Neural Networks (GNNs) in an end-to-end manner, an approach that has not been explored yet. We take advantage of the compact representation of superpixels and combine it with GNNs in order to learn strong and semantically meaningful representations of images. Specifically, we show that our GNN based approach allows to model interactions between distant pixels in the image and serves as a strong prior to existing CNNs for an improved accuracy. Our experiments reveal both the qualitative and quantitative advantages of our approach compared to current state-of-the-art methods over four popular datasets.
翻译:在许多被贴上标签的数据难以获得的现实情景中,未经监督的图像分割是一个重要的任务。 在本文中,我们提出一种新的方法,利用以端对端方式结合相互信息最大化(MIM)、神经超像分解和图形神经网络(GNNS),利用无监督学习的最新进展,这一方法尚未得到探讨。我们利用超级像素的缩缩写,并将其与GNNs相结合,以便学习强有力和语义上有意义的图像表述。具体地说,我们基于GNN的方法可以模拟图像中遥远的像素之间的相互作用,并成为现有CNN之前的强大工具,以便提高准确性。我们的实验揭示了我们方法在质量和数量上与目前最先进的方法相比在四个流行数据集上的优势。