Area under the ROC curve, a.k.a. AUC, is a measure of choice for assessing the performance of a classifier for imbalanced data. AUC maximization refers to a learning paradigm that learns a predictive model by directly maximizing its AUC score. It has been studied for more than two decades dating back to late 90s and a huge amount of work has been devoted to AUC maximization since then. Recently, stochastic AUC maximization for big data and deep AUC maximization for deep learning have received increasing attention and yielded dramatic impact for solving real-world problems. However, to the best our knowledge there is no comprehensive survey of related works for AUC maximization. This paper aims to address the gap by reviewing the literature in the past two decades. We not only give a holistic view of the literature but also present detailed explanations and comparisons of different papers from formulations to algorithms and theoretical guarantees. We also identify and discuss remaining and emerging issues for deep AUC maximization, and provide suggestions on topics for future work.
翻译:ACU的最大化是指一种学习模式,通过直接最大限度地增加ACU的得分来学习预测模型;20多年来,一直对它进行研究,自1990年代后期以来,它一直致力于AUC的最大化;最近,AUC对大数据最大化和AUC深度最大化以进行深层次学习的问题日益受到关注,对解决现实世界问题产生了重大影响;然而,据我们所知,AUC的最大化是指一种学习模式,它通过直接最大限度地增加ACU的得分来学习预测模型;20多年来,它一直被研究到90年代后期,并且自那时起,它一直致力于AUC的最大化。我们还查明并讨论从公式到算法和理论保证的不同论文,并详细解释和比较了从拟订到算法和理论保证的不同论文。我们还查明和讨论了其余的和正在出现的问题,以便深入实现ACU的最大化,并就未来工作的专题提出建议。