Imbalanced training data is a significant challenge for medical image classification. In this study, we propose a novel Progressive Class-Center Triplet (PCCT) framework to alleviate the class imbalance issue particularly for diagnosis of rare diseases, mainly by carefully designing the triplet sampling strategy and the triplet loss formation. Specifically, the PCCT framework includes two successive stages. In the first stage, PCCT trains the diagnosis system via a class-balanced triplet loss to coarsely separate distributions of different classes. In the second stage, the PCCT framework further improves the diagnosis system via a class-center involved triplet loss to cause a more compact distribution for each class. For the class-balanced triplet loss, triplets are sampled equally for each class at each training iteration, thus alleviating the imbalanced data issue. For the class-center involved triplet loss, the positive and negative samples in each triplet are replaced by their corresponding class centers, which enforces data representations of the same class closer to the class center. Furthermore, the class-center involved triplet loss is extended to the pair-wise ranking loss and the quadruplet loss, which demonstrates the generalization of the proposed framework. Extensive experiments support that the PCCT framework works effectively for medical image classification with imbalanced training images. On two skin image datasets and one chest X-ray dataset, the proposed approach respectively obtains the mean F1 score 86.2, 65.2, and 90.66 over all classes and 81.4, 63.87, and 81.92 for rare classes, achieving state-of-the-art performance and outperforming the widely used methods for the class imbalance issue.
翻译:86. 在本研究中,我们提出一个新的进步级Center Triplet(PCCT)框架,以缓解班级不平衡问题,特别是诊断罕见疾病的班级不平衡问题,主要是仔细设计三重抽样战略和三重损失形成。具体地说,PCCT框架包括两个连续阶段。在第一阶段,PCCT通过阶级平衡三重损失将诊断系统培训成分级分布粗略的三重损失。在第二阶段,PCCT框架通过涉及三重损失的班级中心进一步改进诊断系统,为每类带来更紧凑的分布。对于班级平衡的三重损失,每个班均进行同等的抽样检查,从而缓解了三重损失的数据问题。对于班中心来说,每三重损失通过班平衡的三重损失将诊断系统训练系统培训。 PCC框架通过相应的班中心将同一班级的数据表示出更接近班中心的数据分布。 此外,涉及三重损失的班级系统系统将三重损失扩大到对齐排名损失的班级分布,每个班级分布更紧紧紧的班级分布。 对于班级三重的三重的班级损失,对于每个班级损失的班级,三重的班级,对等分级,对级,对班级进行抽样损失进行抽样分析,对每个班,对班级进行抽样分析,对班级进行抽样分析,分别进行抽样分析,分别进行抽样分析。