We introduce sequential $F$-tests and confidence sequences for subsets of coefficients of a linear model. This generalizes standard univariate Gaussian confidence sequences that are often used to perform sequential A/B tests. When performing inference on treatment effects, the ability to include covariate and treatment-covariate interaction terms reduces the stopping time of the sequential test and the width of the confidence sequences. Our sequential $F$-tests also have other practical applications concerning sequential tests of treatment effect heterogeneity and model selection. Our approach is based on a mixture martingale, using a Gaussian mixture over the coefficients of interest and the right-Haar mixture over the remaining model parameters. This exploits group invariance properties of the linear model to ensure that our procedure possesses a time-uniform Type I error probability of $\alpha$ and time-uniform $1-\alpha$ coverage for all values of the nuisance parameters. This allows experiments to be continuously monitored and stopped using data dependant stopping rules. While our contributions are to provide anytime-valid guarantees, that is, time-uniform frequentist guarantees in terms of Type I error and coverage, our approach is motivated through a Bayesian framework. More specifically, our test statistic can be interpreted as a Bayes factor, which can be readily used in a traditional Bayesian analysis. Our contributions can also be viewed as providing frequentist guarantees to the Bayesian sequential test.
翻译:我们为线性模型的子系数子引入了连续的美元测试和信任序列。 这概括了通常用于进行连续 A/B 测试的标准的单象数高萨信任序列。 当对治疗效果进行推断时, 包含共变和治疗- 共变互动条件的能力会减少连续测试的停止时间和信任序列的宽度。 我们的连续的美元测试还有关于连续测试治疗效果异质和模型选择的其他实际应用。 我们的方法基于混合的马丁格, 使用高萨混合的利息系数和对其余模型参数的右- 豪尔混合。 这利用线性模型的不变化特性来确保我们的程序具有时间- 一致的I 误差概率和时间- 单一的1美元覆盖范围。 这允许不断监测和停止使用数据依赖规则。 虽然我们的贡献是提供时间- 直线性系数系数系数的混合混合物, 具体地说, 测试性标准框架可以提供我们有动机的I- 标准框架 。