We develop E variables for testing whether two data streams come from the same source or not, and more generally, whether the difference between the sources is larger than some minimal effect size. These E variables lead to tests that remain safe, i.e. keep their Type-I error guarantees, under flexible sampling scenarios such as optional stopping and continuation. We also develop the corresponding always-valid confidence intervals. In special cases our E variables also have an optimal `growth' property under the alternative. We illustrate the generic construction through the special case of 2x2 contingency tables, where we also allow for the incorporation of different restrictions on a composite alternative. Comparison to p-value analysis in simulations and a real-world example show that E variables, through their flexibility, often allow for early stopping of data collection, thereby retaining similar power as classical methods.
翻译:我们开发E变量,用于测试两个数据流是否来自同一来源,以及更笼统地说,两个来源之间的差异是否大于某些最小影响大小。这些E变量导致的测试仍然安全,即根据灵活的抽样假设,如选择性停止和继续,保留其类型I错误保证。我们还开发了相应的始终有效的信任间隔。在特殊情况下,我们的E变量在替代情况下也具有最佳的“增长”属性。我们通过2x2应急表的特例来说明通用结构,我们允许对复合替代表采用不同的限制。在模拟和现实世界实例中,与P值分析进行比较表明,E变量通过其灵活性,往往允许早期停止数据收集,从而保留与传统方法相似的力量。