When monitoring machine learning systems, two-sample tests of homogeneity form the foundation upon which existing approaches to drift detection build. They are used to test for evidence that the distribution underlying recent deployment data differs from that underlying the historical reference data. Often, however, various factors such as time-induced correlation mean that batches of recent deployment data are not expected to form an i.i.d. sample from the historical data distribution. Instead we may wish to test for differences in the distributions conditional on \textit{context} that is permitted to change. To facilitate this we borrow machinery from the causal inference domain to develop a more general drift detection framework built upon a foundation of two-sample tests for conditional distributional treatment effects. We recommend a particular instantiation of the framework based on maximum conditional mean discrepancies. We then provide an empirical study demonstrating its effectiveness for various drift detection problems of practical interest, such as detecting drift in the distributions underlying subpopulations of data in a manner that is insensitive to their respective prevalences. The study additionally demonstrates applicability to ImageNet-scale vision problems.
翻译:当监测机器学习系统时,对同质性进行两类抽样测试,形成现有漂移探测方法的基础,用于测试最近部署数据的分布与历史参考数据的基础不同的证据。然而,由于时间因素引起的关联,往往意味着最近部署数据的批次不会形成从历史数据分布中提取的i.d.样本。相反,我们不妨测试允许改变的基于\textit{context}的分布差异。为了便利这一测试,我们从因果推断域借用了机器,以开发一个更普遍的漂移探测框架,这个框架建立在有条件分布处理效果的两类抽样测试的基础上。我们建议根据最大有条件平均差异对框架进行特别的即时化。然后,我们提供一项实验性研究,表明它对于各种实际感兴趣的漂移探测问题的有效性,例如以对数据亚群分布中的漂移情况不敏感的方式检测这些数据。该研究还表明对图像网络规模的视觉问题的适用性。