In the era of big data, variable selection is a key technology for handling high-dimensional problems with a small sample size but a large number of covariables. Different variable selection methods were proposed for different models, such as linear model, logistic model and generalized linear model. However, fewer works focused on variable selection for single index models, especially, for single index logistic model, due to the difficulty arose from the unknown link function and the slow mixing rate of MCMC algorithm for traditional logistic model. In this paper, we proposed a Bayesian variable selection procedure for single index logistic model by taking the advantage of Gaussian process and data augmentation. Numerical results from simulations and real data analysis show the advantage of our method over the state of arts.
翻译:在大数据时代,可变选择是处理高维问题的关键技术,抽样规模小,但有多种可变因素,对线性模型、后勤模型和通用线性模型等不同模型提出了不同的可变选择方法,但是,由于联系功能不明和传统后勤模型的MCMC算法混合速度缓慢造成的困难,重点选择单一指数模型、特别是单一指数后勤模型的可变选择方法较少。在本文中,我们建议利用高西亚进程和数据增强,对单一指数后勤模型采用巴耶西亚变量选择程序。模拟和实际数据分析的数值结果表明,我们的方法优于艺术状态。