We propose a new method for supervised learning with multiple sets of features ("views"). The multi-view problem is especially important in biology and medicine, where "-omics" data such as genomics, proteomics and radiomics are measured on a common set of samples. Cooperative learning combines the usual squared error loss of predictions with an "agreement" penalty to encourage the predictions from different data views to agree. By varying the weight of the agreement penalty, we get a continuum of solutions that include the well-known early and late fusion approaches. Cooperative learning chooses the degree of agreement (or fusion) in an adaptive manner, using a validation set or cross-validation to estimate test set prediction error. One version of our fitting procedure is modular, where one can choose different fitting mechanisms (e.g. lasso, random forests, boosting, neural networks) appropriate for different data views. In the setting of cooperative regularized linear regression, the method combines the lasso penalty with the agreement penalty. The method can be especially powerful when the different data views share some underlying relationship in their signals that can be exploited to strengthen signal, while each view has its idiosyncratic noise that needs to be reduced. We show that cooperative learning achieves higher predictive accuracy on simulated data and real multiomics examples of cancer stage and treatment response prediction. Leveraging aligned signals and allowing flexible fitting mechanisms for different modalities, cooperative learning offers a powerful approach to multiomics data fusion.
翻译:我们提出了一套有多种特征(“视图”)的监督性学习新方法。多观点问题在生物学和医学中特别重要,在生物和医学中,“工程”数据,如基因组学、蛋白质组学和放射学等,在一组共同样本中测量“工程学”数据。合作学习结合了通常的平方错误的预测损失和“协议”惩罚,以鼓励不同数据观点的预测,从而达成一致。通过改变协议处罚的权重,我们得到了一系列的解决方案,其中包括早期和晚期著名的聚合方法。合作学习以适应的方式选择了协议(或聚合)的程度,使用一个验证集或交叉校准来估计预测错误。我们一个适合程序的版本是模块化的,可以选择适合不同数据观点的不同机制(如:lasso、随机森林、提振动、神经网络)的“协议”惩罚。在确定合作性定期线性回归的设置中,将拉素治疗方法与协议处罚结合起来。当不同的数据观点以适应方式分享某些基本关系时,方法会特别强大,使用一种验证或交叉校准的方式来估计预测错误错误错误。我们可以使用不同的指标,同时学习各种指标,以显示一种更精确的预言的预言的预言,我们可以展示。