学习一个适应功能以评估图像视觉相似性 (Learning an Adaptation Function to Assess Image Visual Similarities)

Human perception is routinely assessing the similarity between images, both for decision making and creative thinking. But the underlying cognitive process is not really well understood yet, hence difficult to be mimicked by computer vision systems. State-of-the-art approaches using deep architectures are often based on the comparison of images described as feature vectors learned for image categorization task. As a consequence, such features are powerful to compare semantically related images but not really efficient to compare images visually similar but semantically unrelated. Inspired by previous works on neural features adaptation to psycho-cognitive representations, we focus here on the specific task of learning visual image similarities when analogy matters. We propose to compare different supervised, semi-supervised and self-supervised networks, pre-trained on distinct scales and contents datasets (such as ImageNet-21k, ImageNet-1K or VGGFace2) to conclude which model may be the best to approximate the visual cortex and learn only an adaptation function corresponding to the approximation of the the primate IT cortex through the metric learning framework. Our experiments conducted on the Totally Looks Like image dataset highlight the interest of our method, by increasing the retrieval scores of the best model @1 by 2.25x. This research work was recently accepted for publication at the ICIP 2021 international conference [1]. In this new article, we expand on this previous work by using and comparing new pre-trained feature extractors on other datasets.

翻译：人类的感知是常规地评估图像之间的相似性,既用于决策,也用于创造性思维。但基本的认知过程尚未真正被人们很好地理解,因此难以被计算机视觉系统模仿。使用深层结构的最先进的方法往往基于比较被描述为为为图像分类任务而学习的特质矢量的图像。因此,这些特征对于比较与语义相关的图像是强大的,但对于比较视觉相似但与语义无关的图像来说,效果并不十分有效。受先前关于神经特征适应心理认知表征的工作的启发,我们在此集中研究在类比问题时学习视觉图像相似性的具体任务。我们提议比较不同的受监督、半监督和自我监督的网络。我们建议对不同的尺度和内容数据集(例如图像Net-21k、图像Net-1K或VGGGFace2)进行预先训练,从而得出哪种模型可能是最接近视觉皮质的图像,并且只学习与通过新的计量框架对原始信息技术皮质进行近似的适应性功能。我们在这里进行的实验是在完全使用像图像特性的图象性图象模型上进行对比。我们最近通过改进了国际上的图象数据的图象数据的图象学研究1,通过增加了我们之前的图象学的图象学的图象学研究,从而增加了我们的图象学的图象学的图象学的图象学研究。通过最近的方法,从而增加了我们对20的图象的图象学的图象学的图象学的图象学的图象学的图象学的图理学的图象研究,从而增加了了我们对了我们的图象。