Replication studies are increasingly conducted but there is no established statistical criterion for replication success. We propose a novel approach combining reverse-Bayes analysis with Bayesian hypothesis testing: a sceptical prior is determined for the effect size such that the original finding is no longer convincing in terms of a Bayes factor. This prior is then contrasted to an advocacy prior (the reference posterior of the effect size based on the original study), and replication success is declared if the replication data favour the advocacy over the sceptical prior at a higher level than the original data favoured the sceptical prior over the null hypothesis. The sceptical Bayes factor is the highest level where replication success can be declared. A comparison to existing methods reveals that the sceptical Bayes factor combines several notions of replicability: it ensures that both studies show sufficient evidence against the null and penalises incompatibility of their effect estimates. Analysis of asymptotic properties and error rates, as well as case studies from the Social Sciences Replication Project show the advantages of the method for the assessment of replicability.
翻译:复制研究越来越多地进行,但没有关于复制成功的既定统计标准。我们建议采用新颖的方法,将反向贝耶斯分析与巴耶斯假设测试相结合:先怀疑先验,其影响大小使原始调查结果不再具有贝耶斯系数的说服力,然后将前者与先前的倡导工作作对比(根据原始研究报告,其影响规模的参考后背),如果复制数据比原始数据更有利于先前怀疑者,先验数据更有利于怀疑者,然后宣布无效假设,则推广成功。怀疑性湾系数是可宣布复制成功的最高水平。与现有方法进行比较后发现,怀疑性海湾系数结合了几种可复制性概念:它确保两项研究都足以证明无效和惩罚其影响估计不相容的无效和惩罚性。分析无症状特性和错误率,以及社会科学复制项目的案例研究显示评估可复制性方法的优点。