The classification of histopathology images fundamentally differs from traditional image classification tasks because histopathology images naturally exhibit a range of diagnostic features, resulting in a diverse range of annotator agreement levels. However, examples with high annotator disagreement are often either assigned the majority label or discarded entirely when training histopathology image classifiers. This widespread practice often yields classifiers that do not account for example difficulty and exhibit poor model calibration. In this paper, we ask: can we improve model calibration by endowing histopathology image classifiers with inductive biases about example difficulty? We propose several label smoothing methods that utilize per-image annotator agreement. Though our methods are simple, we find that they substantially improve model calibration, while maintaining (or even improving) accuracy. For colorectal polyp classification, a common yet challenging task in gastrointestinal pathology, we find that our proposed agreement-aware label smoothing methods reduce calibration error by almost 70%. Moreover, we find that using model confidence as a proxy for annotator agreement also improves calibration and accuracy, suggesting that datasets without multiple annotators can still benefit from our proposed label smoothing methods via our proposed confidence-aware label smoothing methods. Given the importance of calibration (especially in histopathology image analysis), the improvements from our proposed techniques merit further exploration and potential implementation in other histopathology image classification tasks.
翻译:病理学图像的分类与传统的图像分类任务有根本的不同,因为病理学图像自然会显示一系列诊断特征,从而产生不同的诊断特征,因此,病理学图像的分类与传统的图像分类任务不同,因为病理学图像的分类与传统的图像分类有着根本的不同,因为病理学图像的分类自然会显示一系列诊断特征,从而产生各种不同的说明性协议水平。然而,在培训病理学图像分类师时,高注解分歧的例子往往被分配为多数标签,或者被完全抛弃。这种普遍的做法往往使分类人员不考虑困难,而且显示模型校正方法差差的模型校正方法。在本文中,我们问:我们能否通过将病理病理学图像分类的分类方法与对实例难度产生偏差,来改进模型的校正方法?我们建议采用模型信任方法来取代分类协议协议协议协议的难度。我们发现,用模型信任作为鉴定协议的代谢方法也能改善每个模型的校正和准确性,我们虽然方法很简单,但我们发现它们大大改进了模型的模型校正方法,但是,我们提出的修顺地分析方法仍然有利于我们提出的。