Deep learning-based classification of rare anemia disorders is challenged by the lack of training data and instance-level annotations. Multiple Instance Learning (MIL) has shown to be an effective solution, yet it suffers from low accuracy and limited explainability. Although the inclusion of attention mechanisms has addressed these issues, their effectiveness highly depends on the amount and diversity of cells in the training samples. Consequently, the poor machine learning performance on rare anemia disorder classification from blood samples remains unresolved. In this paper, we propose an interpretable pooling method for MIL to address these limitations. By benefiting from instance-level information of negative bags (i.e., homogeneous benign cells from healthy individuals), our approach increases the contribution of anomalous instances. We show that our strategy outperforms standard MIL classification algorithms and provides a meaningful explanation behind its decisions. Moreover, it can denote anomalous instances of rare blood diseases that are not seen during the training phase.
翻译:由于缺乏培训数据和实例说明,稀有贫血症的深层学习分类受到挑战。多例学习(MIL)已证明是一个有效的解决方案,但缺乏准确性和解释有限。虽然纳入关注机制已解决这些问题,但其有效性在很大程度上取决于培训样本中的细胞数量和多样性。因此,血液样本中稀有贫血症分类的机器学习表现不佳,尚未解决。在本文件中,我们建议MIL为处理这些限制提供一个可解释的集合方法。通过受益于负面袋(即健康个人的同质良性细胞)的例级信息,我们的方法增加了反常情况的贡献。我们表明,我们的战略超过了MIL标准分类算法,并为它的决定提供了有意义的解释。此外,它可以说明在培训阶段所看不到的稀有血病的异常情况。