We propose a method for variable selection and basis learning for high-dimensional classification with ordinal responses. The proposed method extends sparse multiclass linear discriminant analysis, with the aim of identifying not only the variables relevant to discrimination but also the variables that are order-concordant with the responses. For this purpose, we compute for each variable an ordinal weight, where larger weights are given to variables with ordered group-means, and penalize the variables with smaller weights more severely in the proposed sparse basis learning. A two-step construction for ordinal weights is developed, and we show that the ordinal weights correctly separate ordinal variables from non-ordinal variables with high probability. The resulting sparse ordinal basis learning method is shown to consistently select either the discriminant variables or the ordinal and discriminant variables, depending on the choice of a tunable parameter. Such asymptotic guarantees are given under a high-dimensional asymptotic regime where the dimension grows much faster than the sample size. Simulated and real data analyses further confirm that the proposed basis learning provides more sparse basis, mostly consisting of ordinal variables, than other basis learning methods, including those developed for ordinal classification.
翻译:我们建议一种方法,用于为高维分类选择变量和基础学习,并有正反调。拟议方法扩展了稀少多级线性线性分辨分析,目的是不仅查明与歧视有关的变量,而且查明与答复相符合的变量。为此,我们为每个变量计算一个交点权重,其中对有定序组合值的变量给予较大的权重,并在拟议的稀疏基础学习中更严厉地惩罚重量较轻的变量。为正反调重量开发了两步制结构,并且我们表明,正反调权重从非正反调变量中正确地分离或分辨变量,而且概率高。由此产生的稀疏基或非正反调学习方法显示,始终选择相左变量或分辨变量,这主要取决于选择金枪鱼分量参数。在高维度的单调制度下提供保障,其维度比样本大小增长快得多。模拟和真实数据分析进一步证实,拟议的基础学习方法提供了更隐性的基础,其中大多包括学习的变量或其它方法。