项目名称: 基于复合分位数回归和最大秩相关想法的ROC回归曲线估计
项目编号: No.11301031
项目类型: 青年科学基金项目
立项/批准年度: 2014
项目学科: 数理科学和化学
项目作者: 段小刚
作者单位: 北京师范大学
项目金额: 22万元
中文摘要: 受试者操作特征(ROC)回归曲线广泛用于评估连续型诊断检验的分类精度。文献上常使用间接建模方法估计ROC回归曲线:首先对诊断检验结果关于协变量和疾病状态建立位置刻度模型,并通过拟似然或局部多项式回归估计模型参数;然后基于获得的拟合残差使用非参或核光滑方法估计目标ROC曲线。这类方法对模型随机误差分布有很强的依赖性,且对观测数据中的异常值特别敏感。本课题中,我们拟使用复合分位数及最大秩相关等技巧估计ROC回归曲线。我们分别考虑数据完全观测和含随机缺失两种情况。为便于使用上述方法,我们针对一类参数位置刻度模型提出一种新的模型描述方法。在新的模型描述框架下,目标ROC回归曲线形式更加简洁,且更便于理论上比较各种估计的渐近方差。考虑到ROC曲线估计表达式的复杂性,我们拟借助M-估计理论及经验过程知识证明估计量的渐近正态性。我们拟通过大量数值模拟及老年痴呆症方面的实际数据考察各类方法的有限样本表现。
中文关键词: 复合分位数回归;诊断精度;位置刻度模型;随机缺失;重参数化
英文摘要: The covariate-specific receiver operating characteristic curve or ROC regression is frequently used to evaluate the classification accuracy of a diagnostic test when it is associated with certain covariates. Indirect modeling of ROC regression is widely discussed in the literature. This method first builds a disease-specific regression model between test results and covariates as well as disease status, and estimates model parameters based on quasilikelihood method or local polynomial regression; then it estimates the targeted ROC curve based on nonparametric or kernel smoothing estimates of the disease-specific survival function using fitted residuals obtained previously. Considering the limitations of this method (e.g.: depending heavily on properties of the random errors, and is sensitive to outliers), we propose in this project a new series of methods for ROC regression based on the ideas of composite quantile regression and maximum rank correlation. We discuss these methods both in the absense or presence of missing data at random. These include missing test result, missing covariates, missing gold standard, or a mixture of them. To facilitate the use and comparison of different regression methods, we also propose a new formulation for a class of traditional location-scale model. Under the new framework,the
英文关键词: composite quantile regression;diagnostic accuracy;location scale model;missing at random;reparameterization