We propose a sequential, anytime valid method to test the conditional independence of a response $Y$ and a predictor $X$ given a random vector $Z$. The proposed test is based on e-statistics and test martingales, which generalize likelihood ratios and allow valid inference at arbitrary stopping times. In accordance with the recently introduced model-X setting, our test depends on the availability of the conditional distribution of $X$ given $Z$, or at least a sufficiently sharp approximation thereof. Within this setting, we derive a full characterization of e-statistics for testing conditional independence, investigate growth-rate optimal e-statistics and their power properties, and show that our method yields tests with asymptotic power one in the special case of a logistic regression model. A simulation study is done to demonstrate that the approach is robust with respect to violations of the model-X assumption and competitive in terms of power when compared to established sequential and non-sequential testing methods.
翻译:我们建议采用连续、随时有效的方法来测试一个应答美元和预测美元(X)的有条件独立性,并给一个随机矢量(Z)美元。拟议的测试基于电子统计和测试马丁格,该测试将概率比率普遍化,允许在任意停留时进行有效推断。根据最近采用的模型X设定,我们的测试取决于有条件分配X美元给Z美元,或至少是足够尖锐的近似值。在这个背景下,我们得出了测试有条件独立的电子统计的充分特征,调查了增长率最佳电子统计及其功率特性,并表明我们的方法在物流回归模型的特殊情况下,以无药力进行测试。进行了模拟研究,以证明在与既定的顺序和非顺序测试方法相比,该方法在违反模型X假设和在实力方面具有竞争力。