Recently, generalization on out-of-distribution (OOD) data with correlation shift has attracted great attention. The correlation shift is caused by the spurious attributes that correlate to the class label, as the correlation between them may vary in training and test data. For such a problem, we show that given the class label, the conditionally independent models of spurious attributes are OOD generalizable. Based on this, a metric Conditional Spurious Variation (CSV) which controls OOD generalization error, is proposed to measure such conditional independence. To improve the OOD generalization, we regularize the training process with the proposed CSV. Under mild assumptions, our training objective can be formulated as a nonconvex-concave mini-max problem. An algorithm with provable convergence rate is proposed to solve the problem. Extensive empirical results verify our algorithm's efficacy in improving OOD generalization.
翻译:最近,与相关变化相关的分配(OOD)数据普遍化引起了极大关注。相关变化是由与阶级标签相关的虚假属性引起的,因为两者在培训和测试数据方面的相关性可能各不相同。对于这样一个问题,我们表明,根据等级标签,有条件独立的虚假属性模型是可普遍适用的OOD。在此基础上,建议采用控制OOOD一般化误差的通用条件(CSV)标准标准测量这种有条件的独立性。为了改进OOOD一般化,我们把培训过程与拟议的CSV规范化。在轻度假设下,我们的培训目标可以被确定为非convex-conculcave mini-max问题。建议采用具有可辨别一致率的算法来解决这个问题。广泛的经验结果可以验证我们的算法在改进 OOD一般化方面的功效。