We investigate the feature compression of high-dimensional ridge regression using the optimal subsampling technique. Specifically, based on the basic framework of random sampling algorithm on feature for ridge regression and the A-optimal design criterion, we first obtain a set of optimal subsampling probabilities. Considering that the obtained probabilities are uneconomical, we then propose the nearly optimal ones. With these probabilities, a two step iterative algorithm is established which has lower computational cost and higher accuracy. We provide theoretical analysis and numerical experiments to support the proposed methods. Numerical results demonstrate the decent performance of our methods.
翻译:具体地说,根据山脊回归特征随机抽样算法的基本框架和A-最佳设计标准,我们首先获得一套最佳次抽样概率。考虑到获得的概率是不经济的,我们然后提出近乎最佳的概率。有了这些概率,我们建立了两步迭代算法,其计算成本较低,精确度更高。我们提供了理论分析和数字实验以支持拟议方法。数字结果显示了我们方法的恰当性能。