Neural networks today often recognize objects as well as people do, and thus might serve as models of the human recognition process. However, most such networks provide their answer after a fixed computational effort, whereas human reaction time varies, e.g. from 0.2 to 10 s, depending on the properties of stimulus and task. To model the effect of difficulty on human reaction time, we considered a classification network that uses early-exit classifiers to make anytime predictions. Comparing human and MSDNet accuracy in classifying CIFAR-10 images in added Gaussian noise, we find that the network equivalent input noise SD is 15 times higher than human, and that human efficiency is only 0.6\% that of the network. When appropriate amounts of noise are present to bring the two observers (human and network) into the same accuracy range, they show very similar dependence on duration or FLOPS, i.e. very similar speed-accuracy tradeoff. We conclude that Anytime classification (i.e. early exits) is a promising model for human reaction time in recognition tasks.
翻译:今天,神经网络往往与人们一样,都承认物体,因此可以作为人类识别过程的模型。然而,大多数这类网络在固定的计算努力之后提供答案,而人类反应时间则各不相同,例如,根据刺激和任务的性质,从0.2到10秒不等。为模拟困难对人类反应时间的影响,我们考虑了一个分类网络,使用提前出境分类器进行随时预测。在将CIFAR-10图像分类时,将人类和MSDNet的准确性与加载高山噪音作比较,我们发现网络等同的输入噪音SD比人类高15倍,而人类的效率仅为网络的0.6 ⁇ 。当出现适当数量的噪音,使两个观察者(人类和网络)进入相同的准确范围时,它们表现出非常相似地依赖时间或FLOPS,即非常相似的速度-准确性交换。我们的结论是,在识别任务中,任何时间分类(即早期退出)都是人类反应时间的一个很有希望的模式。