Image classification with small datasets has been an active research area in the recent past. However, as research in this scope is still in its infancy, two key ingredients are missing for ensuring reliable and truthful progress: a systematic and extensive overview of the state of the art, and a common benchmark to allow for objective comparisons between published methods. This article addresses both issues. First, we systematically organize and connect past studies to consolidate a community that is currently fragmented and scattered. Second, we propose a common benchmark that allows for an objective comparison of approaches. It consists of five datasets spanning various domains (e.g., natural images, medical imagery, satellite data) and data types (RGB, grayscale, multispectral). We use this benchmark to re-evaluate the standard cross-entropy baseline and ten existing methods published between 2017 and 2021 at renowned venues. Surprisingly, we find that thorough hyper-parameter tuning on held-out validation data results in a highly competitive baseline and highlights a stunted growth of performance over the years. Indeed, only a single specialized method dating back to 2019 clearly wins our benchmark and outperforms the baseline classifier.
翻译:利用小型数据集进行图像分类是最近一个积极的研究领域。然而,由于这一范围的研究仍处于初级阶段,因此缺乏确保可靠和真实进展的两个关键要素:对最新技术状况的系统而广泛的概览,以及用于对已公布方法进行客观比较的共同基准。这一条涉及这两个问题。首先,我们系统地组织和连接过去的研究,以巩固一个目前分散和分散的社区。第二,我们提出了一个共同的基准,以便客观地比较各种方法。它由五个数据集组成(例如自然图像、医疗图像、卫星数据)和数据类型(RGB、灰度、多光谱)。我们利用这一基准重新评估2017至2021年期间公布的标准跨作物基线和10种现有方法。令人惊讶的是,我们发现对搁置的验证数据进行彻底的超参数调整的结果是一个高度竞争的基线,并突出多年来业绩的缓慢增长。事实上,只有一种可追溯到2019年的单一专门方法,明确赢得了我们的基准,超越了基线的分类。