In this paper, we introduce a unique variant of the denoising Auto-Encoder and combine it with the perceptual loss to classify images in an unsupervised manner. The proposed method, called Pseudo Labelling, consists of first applying a randomly sampled set of data augmentation transformations to each training image. As a result, each initial image can be considered as a pseudo-label to its corresponding augmented ones. Then, an Auto-Encoder is used to learn the mapping between each set of the augmented images and its corresponding pseudo-label. Furthermore, the perceptual loss is employed to take into consideration the existing dependencies between the pixels in the same neighbourhood of an image. This combination encourages the encoder to output richer encodings that are highly informative of the input's class. Consequently, the Auto-Encoder's performance on unsupervised image classification is improved in terms of stability, accuracy and consistency across all tested datasets. Previous state-of-the-art accuracy on the MNIST, CIFAR-10 and SVHN datasets is improved by 0.3\%, 3.11\% and 9.21\% respectively.
翻译:在本文中, 我们引入了取消自动编码器的独特变体, 并将其与感知损失结合起来, 以不受监督的方式对图像进行分类。 名为 Pseudo Goal 的拟议方法包括首先对每张培训图像随机抽选一组数据增强变异。 因此, 每张初始图像都可以被视作与其相应的增强图像的假标签。 然后, 一个自动编码器被用于学习每套增强图像及其对应的伪标签之间的映射。 此外, 使用概念损失来考虑同一图像周边的像素之间的现有依赖性。 这种合并鼓励编码器输出输入输入类中信息丰富的编码。 因此, 自动编码器在未经监督的图像分类上的性能在稳定性、 准确性和一致性方面得到了改进。 MNIST、 CIFAR- 10 和 SVHN 数据集的先前状态精确度分别由 0. 3 / / / / / / / / / / / / / / / / / / / / 9 / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / / /