多重分析(PIMA):基于标志翻转得分测试的推断框架 (Post-selection Inference in Multiverse Analysis (PIMA): an inferential framework based on the sign flipping score test)

When analyzing data researchers make some decisions that are either arbitrary, based on subjective beliefs about the data generating process, or for which equally justifiable alternative choices could have been made. This wide range of data-analytic choices can be abused, and has been one of the underlying causes of the replication crisis in several fields. Recently, the introduction of multiverse analysis provides researchers with a method to evaluate the stability of the results across reasonable choices that could be made when analyzing data. Multiverse analysis is confined to a descriptive role, lacking a proper and comprehensive inferential procedure. Recently, specification curve analysis adds an inferential procedure to multiverse analysis, but this approach is limited to simple cases related to the linear model, and only allows researchers to infer whether at least one specification rejects the null hypothesis, but not which specifications should be selected. In this paper we present a Post-selection Inference approach to Multiverse Analysis (PIMA) which is a flexible and general inferential approach that accounts for all possible models, i.e., the multiverse of reasonable analyses. The approach allows for a wide range of data specifications (i.e. pre-processing) and any generalized linear model; it allows testing the null hypothesis of a given predictor not being associated with the outcome, by merging information from all reasonable models of multiverse analysis, and provides strong control of the family-wise error rate such that it allows researchers to claim that the null-hypothesis can be rejected for each specification that shows a significant effect. The inferential proposal is based on a conditional resampling procedure. To be continued...

翻译：当分析数据研究人员根据对数据生成过程的主观信念作出一些武断的决定时,如果分析数据研究人员根据对数据生成过程的主观信念作出一些武断的决定,或者可以作出同样合理的替代选择。这种广泛的数据分析选择可能会被滥用,并且一直是若干领域复制危机的根本原因之一。最近,采用多角度分析为研究人员提供了一种方法,用以评价分析数据时可能作出的合理选择的结果的稳定性。多角度分析限于描述作用,缺乏适当和全面的推断程序。最近,规格曲线分析增加了多角度分析的推论程序,但这种方法仅限于与线性模型有关的简单案例,而且只能让研究人员推断至少一个规格是否拒绝无效假设,而不是应该选择哪些规格。在本文件中,我们提出了一个选择后推论方法,用以评价分析在分析数据分析时可能作出的合理选择。多角度分析的灵活和一般推论方法可以说明所有可能的模型,即:合理分析的多角度分析程序可以使广泛的数据规格(即:与线性模型有关的简单、预选前推论)允许进行一系列的比较(即逻辑),并且任何直线性分析的模型都允许进行彻底的模拟的模拟分析。