In many biomedical problems, data are often heterogeneous, with samples spanning multiple patient subgroups, where different subgroups may have different disease subtypes, stages, or other medical contexts. These subgroups may be related, but they are also expected to have differences with respect to the underlying biology. The heterogeneous data presents a precious opportunity to explore the heterogeneities and commonalities between related subgroups. Unfortunately, effective statistical analysis methods are still lacking. Recently, several novel methods based on integrative analysis have been proposed to tackle this challenging problem. Despite promising results, the existing studies are still limited by ignoring data contamination and making strict assumptions of linear effects of covariates on response. As such, we develop a robust nonparametric integrative analysis approach to identify heterogeneity and commonality, as well as select important covariates and estimate covariate effects. Possible data contamination is accommodated by adopting the Cauchy loss function, and a nonparametric model is built to accommodate nonlinear effects. The proposed approach is based on a sparse boosting technique. The advantages of the proposed approach are demonstrated in extensive simulations. The analysis of The Cancer Genome Atlas data on glioblastoma multiforme and lung adenocarcinoma shows that the proposed approach makes biologically meaningful findings with satisfactory prediction.
翻译:在许多生物医学问题中,数据往往是多种多样的,其样本涉及多个病人分组,不同分组可能具有不同的疾病子子类型、阶段或其他医疗背景,不同分组可能具有不同的疾病子类型、阶段或其他医疗背景。这些分组可能相互关联,但预计它们也会在基本生物学方面有差异。各种数据为探索相关分组之间的异异异性和共性和共性提供了宝贵的机会,探索相关分组之间的异异异性和共性和共性提供了宝贵的机会。不幸的是,有效的统计分析方法仍然缺乏。最近,根据综合分析提出了几种新方法,以解决这一具有挑战性的问题。尽管取得了可喜的成果,但现有的研究仍然有限,因为忽视数据污染和严格假设同异变对反应的线性影响。因此,我们开发了一种强有力的、非对等综合综合分析方法,以找出异异异性和共性和共性和共性和共性,以及选择重要的同性和估计相异性,这是一次宝贵的机会,以探索相关分组之间的异异性和共性和共性和共性;不幸的是,仍然缺乏有效的统计分析方法,以非线效应为基础,以非线性影响为基础。拟议方法的基础是稀少推进技术,其优点。拟议方法的优点体现在。拟议方法的优点在广泛的模拟中展示了拟议方法的优点。因此显示拟议方法的优点。我们为此的优点的优点。我们开发了广泛模拟,分析了拟议的方法的系统分析了拟议的方法的系统,分析,其提议方法的图的图的图的优点是,其图的图的图的图的图的图,其分析显示的图的图的图的图,用,其分析显示,其图的图的图的图的图的图的图的图的图的图的图的图的图的图,以,用,用,用是,用的图的图的图的图的图的图的图显示的图的图的图的图的图的图的图的图的图的图的图的图的图的图的图的图的图的图的图的图的图,以。。。分析显示,分析显示,分析显示,用的图的图的图的图的图的图的图的图的图的图的图的图的图的图的图的图的图的图的图,是,分析显示,分析显示的图的图的图