限制优化培训关键和代表性不足的班级神经网络 (Constrained Optimization to Train Neural Networks on Critical and Under-Represented Classes)

Deep neural networks (DNNs) are notorious for making more mistakes for the classes that have substantially fewer samples than the others during training. Such class imbalance is ubiquitous in clinical applications and very crucial to handle because the classes with fewer samples most often correspond to critical cases (e.g., cancer) where misclassifications can have severe consequences. Not to miss such cases, binary classifiers need to be operated at high True Positive Rates (TPRs) by setting a higher threshold, but this comes at the cost of very high False Positive Rates (FPRs) for problems with class imbalance. Existing methods for learning under class imbalance most often do not take this into account. We argue that prediction accuracy should be improved by emphasizing reducing FPRs at high TPRs for problems where misclassification of the positive, i.e. critical, class samples are associated with higher cost. To this end, we pose the training of a DNN for binary classification as a constrained optimization problem and introduce a novel constraint that can be used with existing loss functions to enforce maximal area under the ROC curve (AUC) through prioritizing FPR reduction at high TPR. We solve the resulting constrained optimization problem using an Augmented Lagrangian method (ALM). Going beyond binary, we also propose two possible extensions of the proposed constraint for multi-class classification problems. We present experimental results for image-based binary and multi-class classification applications using an in-house medical imaging dataset, CIFAR10, and CIFAR100. Our results demonstrate that the proposed method improves the baselines in majority of the cases by attaining higher accuracy on critical classes while reducing the misclassification rate for the non-critical class samples.

翻译：深神经网络(DNNS)臭名昭著,因为那些在培训期间的样本比其他样本少得多的班级犯了更多的错误。这种班级不平衡在临床应用中普遍存在,而且对于处理来说非常重要,因为样品较少的班级往往与关键案例(如癌症)相对应,而分类错误可能带来严重后果。为了避免出现这种情况,二进制分类者需要以高真实正率(TPRs)运行,为此设定了一个更高的门槛,但这要以非常高的假正率(FPRs)的代价为代价,而对于班级不平衡的问题,这种班级不平衡现象的学习方法往往没有考虑到这一点。我们认为,应该通过强调高TPRS(FPR)的FPRs, 高TPR(例如癌症)的FPRs, 班级样本与较高的成本有关。为此,我们要将二进制分类培训DNN(DN)作为基于限制的优化问题,并引入新的制约因素,用现有的损失函数在ROC曲线下执行最大程度区域(AUC)的曲线(AUC)下的现有精度不平衡的学习方法,通过在高级压压压压压压压压压的市级中,然后用ARC的机级压压压压压压压压压压压压压压的机压压压压压压压压压压压压压压压压压压压压压。