The reduced-rank regression model is a popular model to deal with multivariate response and multiple predictors, and is widely used in biology, chemometrics, econometrics, engineering, and other fields. In the reduced-rank regression modelling, a central objective is to estimate the rank of the coefficient matrix that represents the number of effective latent factors in predicting the multivariate response. Although theoretical results such as rank estimation consistency have been established for various methods, in practice rank determination still relies on information criterion based methods such as AIC and BIC or subsampling based methods such as cross validation. Unfortunately, the theoretical properties of these practical methods are largely unknown. In this paper, we present a novel method called StARS-RRR that selects the tuning parameter and then estimates the rank of the coefficient matrix for reduced-rank regression based on the stability approach. We prove that StARS-RRR achieves rank estimation consistency, i.e., the rank estimated with the tuning parameter selected by StARS-RRR is consistent to the true rank. Through a simulation study, we show that StARS-RRR outperforms other tuning parameter selection methods including AIC, BIC, and cross validation as it provides the most accurate estimated rank. In addition, when applied to a breast cancer dataset, StARS-RRR discovers a reasonable number of genetic pathways that affect the DNA copy number variations and results in a smaller prediction error than the other methods with a random-splitting process.
翻译:降级回归模型是处理多变反应和多个预测器的流行模式,在生物学、化学、计量经济学、计量经济学、工程和其他领域广泛使用。在降级回归模型中,一个中心目标是估计系数矩阵的等级,该模型代表了预测多变反应的有效潜在因素的数量。虽然为各种方法确定了等级估算一致性等理论结果,但在实践中,等级确定仍然依赖于信息标准依据的方法,如ACIC和BIC或跨级验证等基于分级法的方法。不幸的是,这些实用方法的理论性质基本上不为人所知。在本文件中,我们提出了一个名为SRIS-RRR的新方法,该方法选择调制参数,然后根据稳定性方法估算降低降级后回归率的系数矩阵的等级。我们证明,SARIS-RR的等级估计一致性,即与SARIS-RRR选定的调控参数的等级与真实等级相一致。我们通过模拟研究发现,SARIS-RRRRR的理论属性超越了其他最准确的路径变数,在AIC的测算中,对RISRRRRR的测算结果的测算中,提供了其他测算方法。