Statisticians are largely focused on developing methods that perform well in a frequentist sense -- even the Bayesians. But the widely-publicized replication crisis suggests that these performance guarantees alone are not enough to instill confidence in scientific discoveries. In addition to reliably detecting hypotheses that are (in)compatible with data, investigators require methods that can probe for hypotheses that are actually supported by the data. In this paper, we demonstrate that valid inferential models (IMs) achieve both performance and probativeness properties and we offer a powerful new result that ensures the IM's probing is reliable. We also compare and contrast the IM's dual performance and probativeness abilities with that of Deborah Mayo's severe testing framework.
翻译:基于可能性理论的统计推断提供性能和可靠性保证
翻译后的摘要:
统计学家主要关注在经频率意义下表现良好的方法的开发——甚至包括贝叶斯派。但是广为人知的复制危机表明,仅有这些性能保证还不足以为科学发现带来信心。除了能够可靠地检测和数据兼容或不兼容的假设之外,研究人员还需要能够探究哪些假设实际上被数据所支持的方法。在本文中,我们展示了有效的推断模型(IMs)可以实现性能保证和可靠性保证,并提供了一个强大的新结果,确保了IM的探测是可靠的。我们还将IM的双重性能保证和可靠性能力与Deborah Mayo的严格检验框架进行了比较和对比。