Imaging exams, such as chest radiography, will yield a small set of common findings and a much larger set of uncommon findings. While a trained radiologist can learn the visual presentation of rare conditions by studying a few representative examples, teaching a machine to learn from such a "long-tailed" distribution is much more difficult, as standard methods would be easily biased toward the most frequent classes. In this paper, we present a comprehensive benchmark study of the long-tailed learning problem in the specific domain of thorax diseases on chest X-rays. We focus on learning from naturally distributed chest X-ray data, optimizing classification accuracy over not only the common "head" classes, but also the rare yet critical "tail" classes. To accomplish this, we introduce a challenging new long-tailed chest X-ray benchmark to facilitate research on developing long-tailed learning methods for medical image classification. The benchmark consists of two chest X-ray datasets for 19- and 20-way thorax disease classification, containing classes with as many as 53,000 and as few as 7 labeled training images. We evaluate both standard and state-of-the-art long-tailed learning methods on this new benchmark, analyzing which aspects of these methods are most beneficial for long-tailed medical image classification and summarizing insights for future algorithm design. The datasets, trained models, and code are available at https://github.com/VITA-Group/LongTailCXR.
翻译:胸前X光片等成像测试将产生一小套共同发现和一系列大得多的不常见发现。 受过训练的放射学家可以通过研究几个具有代表性的例子来了解稀有条件的直观展示, 教一台机器从这种“ 长尾” 分布中学习, 困难得多, 因为标准方法很容易偏向于最经常的班级。 在本文中, 我们对胸前X光片特定领域的胸腔X光病特定领域的长尾学习问题进行综合基准研究。 我们的重点是从自然分发的胸前X光片数据中学习, 不仅在常见的“头”类中, 而且在稀有但至关重要的“尾”类中, 优化分类准确性。 为了实现这一点, 我们引入了具有挑战性的新型长尾X光线的机器, 以便利研究制定医学图像分类的长尾的学习方法。 基准包括两套用于19和20个胸X光轴疾病分类的胸X光谱数据集, 包括多达53 000个班级的班级, 以及只有7个标签的培训图像。 我们评估了标准与州- R- 最高级的直径直观/直观的长的图表, 分析了这些长期的图表, 分析方法, 用于这个医学/直观的医学图表, 分析。