We propose performing imbalanced classification by regrouping majority classes into small classes so that we turn the problem into balanced multiclass classification. This new idea is dramatically different from popular loss reweighting and class resampling methods. Our preliminary result on imbalanced medical image classification shows that this natural idea can substantially boost the classification performance as measured by average precision (approximately area-under-the-precision-recall-curve, or AUPRC), which is more appropriate for evaluating imbalanced classification than other metrics such as balanced accuracy.
翻译:我们建议通过将多数类重新组合成小类来进行不平衡的分类,这样我们就可以把问题变成平衡的多级分类。 这个新想法与流行的流失重排和类重抽方法大不相同。 我们关于不平衡的医疗图像分类的初步结果显示,这个自然想法可以大大提升以平均精确度(即AUPRC)衡量的分类性能,后者比平衡准确性等其他指标更适合评估不平衡的分类。