Conditional selective inference (SI) has been actively studied as a new statistical inference framework for data-driven hypotheses. The basic idea of conditional SI is to make inferences conditional on the selection event characterized by a set of linear and/or quadratic inequalities. Conditional SI has been mainly studied in the context of feature selection such as stepwise feature selection (SFS). The main limitation of the existing conditional SI methods is the loss of power due to over-conditioning, which is required for computational tractability. In this study, we develop a more powerful and general conditional SI method for SFS using the homotopy method which enables us to overcome this limitation. The homotopy-based SI is especially effective for more complicated feature selection algorithms. As an example, we develop a conditional SI method for forward-backward SFS with AIC-based stopping criteria and show that it is not adversely affected by the increased complexity of the algorithm. We conduct several experiments to demonstrate the effectiveness and efficiency of the proposed method.
翻译:作为数据驱动假设的新的统计推论框架,对有条件的选择性推断进行了积极研究,这是数据驱动假设的新统计推论框架。有条件的SI的基本想法是,以一系列线性和/或二次不平等为特点的选择活动为条件推论;有条件的SI主要在特征选择方面进行了研究,如分级特征选择(SFS),现有有条件的SI方法的主要局限性是,由于超调而丧失了权力,而这是计算可移动性的必要条件。在本研究中,我们用同质调法为SFS开发了一种更强大和一般的有条件的SI方法,使我们能够克服这一限制。基于同质调制的SI对于更复杂的特征选择算法特别有效。举例来说,我们开发了一种条件性SI方法,即前向后向SFSFS,采用基于AC的停止标准,并表明它不会受到算法日益复杂的不利影响。我们进行了几次实验,以证明拟议方法的有效性和效率。