We propose a sequential, anytime valid method to test the conditional independence of a response $Y$ and a predictor $X$ given a random vector $Z$. The proposed test is based on $E$-statistics and test martingales, which generalize likelihood ratios and allow valid inference at arbitrary stopping times. In accordance with the recently introduced model-X setting, our test depends on the availability of the conditional distribution of $X$ given $Z$, or at least a sufficiently sharp approximation thereof. Within this setting, we derive a full characterization of $E$-statistics for testing conditional independence, investigate growth-rate optimal $E$-statistics and their power properties, and show that our method yields tests with asymptotic power one in the special case of a logistic regression model. A simulation study is done to demonstrate that the approach is robust with respect to violations of the model-X assumption and competitive in terms of power when compared to established sequential and non-sequential testing methods.
翻译:我们建议采用连续、随时有效的方法来测试一个应答美元和预测美元(X)的有条件独立性,给一个随机矢量(Z)美元。拟议的测试基于以美元为单位的统计学和测试马丁酸,该测试将概率比率普遍化,允许在任意停留时进行有效推断。根据最近采用的模型-X设定,我们的测试取决于有条件分配的美元(X)是否为Z美元,或至少是足够精确的近似值。在这个背景下,我们得出了以美元为单位的统计学的完整特征,用于测试有条件独立,调查增长率-最佳的美元统计学及其功率特性,并表明我们的方法在物流回归模型的特殊情况下,以无药力进行测试。进行了模拟研究,以证明在违反模式-X假设和与既定的顺序和非顺序测试方法相比,该方法在实力方面具有很强的竞争力。