The statistical machine learning community has demonstrated considerable resourcefulness over the years in developing highly expressive tools for estimation, prediction, and inference. The bedrock assumptions underlying these developments are that the data comes from a fixed population and displays little heterogeneity. But reality is significantly more complex: statistical models now routinely fail when released into real-world systems and scientific applications, where such assumptions rarely hold. Consequently, we pursue a different path in this paper vis-a-vis the well-worn trail of developing new methodology for estimation and prediction. In this paper, we develop tools and theory for detecting and identifying regions of the covariate space (subpopulations) where model performance has begun to degrade, and study intervening to fix these failures through refitting. We present empirical results with three real-world data sets -- including a time series involving forecasting the incidence of COVID-19 -- showing that our methodology generates interpretable results, is useful for tracking model performance, and can boost model performance through refitting. We complement these empirical results with theory proving that our methodology is minimax optimal for recovering anomalous subpopulations as well as refitting to improve accuracy in a structured normal means setting.
翻译:多年来,统计机学习界在开发高清晰度的估算、预测和推算工具方面表现出相当的智慧。这些发展的基本假设是数据来自固定人口,并显示出很少异质性。但现实则更为复杂:统计模型在向现实世界体系和科学应用发布时,通常会失败,而这种假设很少能维持。因此,我们在本文中寻求不同的道路,以发展新的估算和预测方法的老路。在本文中,我们开发了用于探测和确定模型性能开始退化的共变空间(子群)区域的工具和理论,并研究如何通过重新校正来弥补这些失败。我们用三个真实世界数据集介绍实证结果 -- -- 包括预测COVID-19发生率的时间序列 -- -- 表明,我们的方法产生可解释的结果,有助于跟踪模型性能,并通过重新校正来提高模型性能。我们用这些实证结果补充了这些理论,证明我们的方法在恢复形形形貌子群(亚群)方面最优于微速率,通过结构正常手段的精确度加以调整。