We develop E variables for testing whether two data streams come from the same source or not, and more generally, whether the difference between the sources is larger than some minimal effect size. These E variables lead to tests that remain safe, i.e. keep their Type-I error guarantees, under flexible sampling scenarios such as optional stopping and continuation. In special cases our E variables also have an optimal `growth' property under the alternative. We illustrate the generic construction through the special case of 2x2 contingency tables, where we also allow for the incorporation of different restrictions on a composite alternative. Comparison to p-value analysis in simulations and a real-world example show that E variables, through their flexibility, often allow for early stopping of data collection, thereby retaining similar power as classical methods.
翻译:我们开发了E变量,用于测试两个数据流是否来自同一来源,以及更一般地测试来源之间的差别是否大于某些最小影响大小。这些E变量导致的测试仍然安全,即根据灵活抽样假设,如选择性停止和继续,保持其类型I错误保证。在特殊情况下,我们的E变量在替代情况下也具有最佳的“增长”属性。我们通过2x2应急表这一特殊案例来说明通用结构,我们允许对复合替代表采用不同的限制。在模拟和现实世界实例中,与P价值分析进行比较表明,E变量通过灵活性,往往允许早期停止数据收集,从而保留与传统方法相似的力量。