The Receiver Operating Characteristic (ROC) curve is a useful tool that measures the discriminating power of a continuous variable or the accuracy of a pharmaceutical or medical test to distinguish between two conditions or classes. In certain situations, the practitioner may be able to measure some covariates related to the diagnostic variable which can increase the discriminating power of the ROC curve. To protect against the existence of atypical data among the observations, a procedure to obtain robust estimators for the ROC curve in presence of covariates is introduced. The considered proposal focusses on a semiparametric approach which fits a location-scale regression model to the diagnostic variable and considers empirical estimators of the regression residuals distributions. Robust parametric estimators are combined with adaptive weighted empirical distribution estimators to down-weight the influence of outliers. The uniform consistency of the proposal is derived under mild assumptions. A Monte Carlo study is carried out to compare the performance of the robust proposed estimators with the classical ones both, in clean and contaminated samples. A real data set is also analysed.
翻译:接收器操作特征(ROC)曲线是一个有用的工具,用来衡量连续变量的区别力或药物或医学测试的准确性,以区分两种条件或类别;在某些情况下,执业者可能能够测量与诊断变量有关的一些共变变量,这种变量可以增加ROC曲线的差别性力量;为避免观测中存在非典型数据,引入了一个程序,以便在有共变的情况下获得对ROC曲线的稳健估计器;经过审议的提案侧重于一种半参数法,该半参数法适合诊断变量的定位尺度回归模型,并考虑回归残留分布的经验性估测器。强度准估计器与适应性加权实验性分布估计器相结合,以降低外部效应的影响。建议的统一一致性是在温和假设下得出的。开展蒙特卡洛研究的目的是将拟议的稳健的估测器的性能与清洁和受污染的样本中的典型的典型的性能进行比较。还分析了真实的数据集。