视觉搜索不对称:深网和人类共享相似的固有分界线 (Visual Search Asymmetry: Deep Nets and Humans Share Similar Inherent Biases)

Visual search is a ubiquitous and often challenging daily task, exemplified by looking for the car keys at home or a friend in a crowd. An intriguing property of some classical search tasks is an asymmetry such that finding a target A among distractors B can be easier than finding B among A. To elucidate the mechanisms responsible for asymmetry in visual search, we propose a computational model that takes a target and a search image as inputs and produces a sequence of eye movements until the target is found. The model integrates eccentricity-dependent visual recognition with target-dependent top-down cues. We compared the model against human behavior in six paradigmatic search tasks that show asymmetry in humans. Without prior exposure to the stimuli or task-specific training, the model provides a plausible mechanism for search asymmetry. We hypothesized that the polarity of search asymmetry arises from experience with the natural environment. We tested this hypothesis by training the model on an augmented version of ImageNet where the biases of natural images were either removed or reversed. The polarity of search asymmetry disappeared or was altered depending on the training protocol. This study highlights how classical perceptual properties can emerge in neural network models, without the need for task-specific training, but rather as a consequence of the statistical properties of the developmental diet fed to the model. All source code and stimuli are publicly available https://github.com/kreimanlab/VisualSearchAsymmetry

翻译：视觉搜索是一种无处不在且往往具有挑战性的日常任务,例如在家里寻找汽车钥匙或者人群中的朋友。一些古典搜索任务的一个令人感兴趣的属性是不对称的,因此在分流器B中找到目标A比在A中找到目标B容易。为了阐明对视觉搜索不对称负责的机制,我们提议了一个计算模型,将一个目标和搜索图像作为投入,并生成一个视觉运动序列,直到找到目标。模型将偏心依赖直观的视觉识别与取决于目标的自上而下提示结合起来。我们在显示人类不对称的六种模式搜索任务中比较了人类行为模型。在不事先接触刺激或任务特定培训的情况下,该模型为搜索不对称提供了一种貌似合理的机制。我们假设搜索不对称的极性产生于自然环境的经验。我们测试了这一假设,在图像网络的扩大版本中,自然图像的偏差被删除或逆转。搜索偏差的极性消失或根据培训协议而改变。我们比较了人类的模型/偏差性,但需要根据培训协议。本研究强调,在不事先接触刺激或任务培训模型的情况下,A级的直观/直观性特性可以形成一个统计模型,因此,所有统计学特性,从而可以形成一个统计学模型。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【SIGIR2020】策略感知的无偏排序学习—Top-K排序，Policy-Aware Unbiased Learning to Rank for Top-𝑘 Rankings

专知会员服务

27+阅读 · 2020年6月10日