Experiments adhering to the same protocol can nonetheless lead to different conclusions, for instance, due to batch effects or lab effects. A statistical test applied to measurements from one experiment may yield a vanishingly small $p$-value, yet applying the same test to measurements from a replicate experiment may yield a large $p$-value. Recent work has highlighted this lack of reproducibility in cell-perturbation experiments. We introduce the Reproducible Sign Rate (RSR), a new reproducibility metric for settings in which each hypothesis test has two alternatives (e.g., upregulation and downregulation of gene expression). The RSR identifies the proportion of discoveries that are expected to reproduce in a future replicate. We provide conditions under which the RSR can be estimated accurately -- even when as few as two experimental replicates are available. We also provide conditions under which high RSR implies a low Type S error rate. We demonstrate the uses of RSR with experiments based on several high-throughput technologies, including L1000, Sci-Plex, and CRISPR.
翻译:然而,遵守同一协议的实验仍可得出不同的结论,例如,由于分批效应或实验室效应,对一项实验的测量应用统计测试,可能会产生消失的小价值,但对复制实验的测量应用同样的测试,可能会产生大价值。最近的工作突出表明了细胞扰动实验中缺乏再复制能力的现象。我们引入了可复制信号率(RSR),这是一个新的可复制性指标,在每种假设测试都有两种替代方法(例如,基因表达方式的上调和下调)的环境下,每种假设测试都有两种可复制性(例如,对基因表达方式的上调和下调)。RSR可以确定预期在未来复制中复制的发现比例。我们提供了可以准确估计RSR的条件 -- -- 即使只有两种实验性复制件,我们也提供了高RSR意味着低S型错误率的条件。我们展示了RSR在几种高通量技术,包括L1000、Sci-Plex和CRISPR的实验中的用途。