In contrast to human vision, common recognition algorithms often fail on partially occluded images. We propose characterizing, empirically, the algorithmic limits by finding a minimal recognizable patch (MRP) that is by itself sufficient to recognize the image. A specialized deep network allows us to find the most informative patches of a given size, and serves as an experimental tool. A human vision study recently characterized related (but different) minimally recognizable configurations (MIRCs) [1], for which we specify computational analogues (denoted cMIRCs). The drop in human decision accuracy associated with size reduction of these MIRCs is substantial and sharp. Interestingly, such sharp reductions were also found for the computational versions we specified.
翻译:与人类的视觉不同,共同的认知算法往往在部分隐蔽图像上失败。 我们通过寻找一个起码的可识别补丁(MRP)来提出算法限制的定性(MRP),该补丁本身足以识别图像。一个专门的深层网络让我们能找到一个特定尺寸的最丰富的信息补丁,并作为一种实验工具。 人类的视觉研究最近将相关的(但不同的)最低可识别配置[1]定性为(我们为此指定了计算类比(注意的cMIRCs ) 。 与这些MIRC的体积缩小相关的人类决策准确性下降幅度巨大且明显。 有趣的是,我们所指定的计算版本也出现了这种急剧的缩减。