How can we draw trustworthy scientific conclusions? One criterion is that a study can be replicated by independent teams. While replication is critically important, it is arguably insufficient. If a study is biased for some reason and other studies recapitulate the approach then findings might be consistently incorrect. It has been argued that trustworthy scientific conclusions require disparate sources of evidence. However, different methods might have shared biases, making it difficult to judge the trustworthiness of a result. We formalize this issue by introducing a "distributional uncertainty model", which captures biases in the data collection process. Distributional uncertainty is related to other concepts in statistics, ranging from correlated data to selection bias and confounding. We show that a stability analysis on a single data set allows to construct confidence intervals that account for both sampling uncertainty and distributional uncertainty.
翻译:我们如何得出值得信赖的科学结论?一个标准是,一项研究可以由独立小组复制。虽然复制至关重要,但可能不够充分。如果一项研究出于某种原因有偏向,而其他研究则总结了这种方法,那么研究结果可能始终不正确。有人认为,可靠的科学结论需要不同的证据来源。然而,不同的方法可能有共同的偏见,难以判断结果的可信度。我们通过引入“分布不确定性模型”将这一问题正式化,该模型捕捉数据收集过程中的偏差。分布不确定性与统计中的其他概念有关,从相关数据到选择偏差和混杂。我们表明,对单一数据集的稳定分析可以建立信任间隔,既考虑到抽样不确定性,又考虑到分布不确定性。