Few-shot recognition involves training an image classifier to distinguish novel concepts at test time using few examples (shot). Existing approaches generally assume that the shot number at test time is known in advance. This is not realistic, and the performance of a popular and foundational method has been shown to suffer when train and test shots do not match. We conduct a systematic empirical study of this phenomenon. In line with prior work, we find that shot sensitivity is broadly present across metric-based few-shot learners, but in contrast to prior work, larger neural architectures provide a degree of built-in robustness to varying test shot. More importantly, a simple, previously known but greatly overlooked class of approaches based on cosine distance consistently and greatly improves robustness to shot variation, by removing sensitivity to sample noise. We derive cosine alternatives to popular and recent few-shot classifiers, broadening their applicability to realistic settings. These cosine models consistently improve shot-robustness, outperform prior shot-robust state of the art, and provide competitive accuracy on a range of benchmarks and architectures, including notable gains in the very-low-shot regime.
翻译:少见的识别涉及培训一个图像分类员,用几个例子(照片)来区分试验时的新概念。现有方法一般假定试验时的射击次数是事先知道的。这是不现实的,在火车和试射不匹配时,流行和基本方法的性能被证明会受到影响。我们对此现象进行了系统的实证研究。根据以往的工作,我们发现射击敏感度广泛存在于以光学为基础的少发学生中,但与以前的工作不同,更大的神经结构为不同的试射提供了一定程度的内在强健性。更重要的是,基于直径的直径的简单、以前已知但被大大忽视的一类方法,通过消除对抽样噪音的敏感度,极大地改进了拍摄变化的稳健性。我们从流行和最近几发式的分类人员那里获取了可选的替代方法,将其应用范围扩大到现实环境。这些焦素模型不断改进射击-破裂性,超越了先前的射出紫色状态,并为一系列基准和结构提供了竞争性的准确性,包括非常低镜头制度取得的显著成果。