结构模型的近似交叉估价 (Approximate Cross-Validation for Structured Models)

from arxiv, 25 pages, 8 figures. NeurIPS 2020 camera ready. v2 fixes typos and provides additional empirical results. Code: https://github.com/SoumyaTGhosh/structured-infinitesimal-jackknife

Many modern data analyses benefit from explicitly modeling dependence structure in data -- such as measurements across time or space, ordered words in a sentence, or genes in a genome. A gold standard evaluation technique is structured cross-validation (CV), which leaves out some data subset (such as data within a time interval or data in a geographic region) in each fold. But CV here can be prohibitively slow due to the need to re-run already-expensive learning algorithms many times. Previous work has shown approximate cross-validation (ACV) methods provide a fast and provably accurate alternative in the setting of empirical risk minimization. But this existing ACV work is restricted to simpler models by the assumptions that (i) data across CV folds are independent and (ii) an exact initial model fit is available. In structured data analyses, both these assumptions are often untrue. In the present work, we address (i) by extending ACV to CV schemes with dependence structure between the folds. To address (ii), we verify -- both theoretically and empirically -- that ACV quality deteriorates smoothly with noise in the initial fit. We demonstrate the accuracy and computational benefits of our proposed methods on a diverse set of real-world applications.

翻译：许多现代数据分析得益于数据中明确的依赖性结构模型化 -- -- 例如不同时间或空间的测量、句子中的定单词、或基因组中的基因。金标准评价技术是结构化的交叉验证技术,每个折叠中留有某些数据子集(例如一个时间间隔内的数据或地理区域的数据),但是,由于需要重新运行已经很昂贵的学习算法,这里的分类分析可能过于缓慢。以前的工作显示交叉验证(ACV)方法近似于交叉验证(ACV)方法,为尽量减少实证风险提供了一个快速和可辨别的准确的替代方法。但是,现有的ACV工作限于更简单的模型,其假设是:(一) 跨CV折叠的数据是独立的,和(二) 完全的初始模型是合适的。在结构化数据分析中,这两种假设往往都是不真实的。在目前的工作中,我们处理的是(一) 通过将ACV扩大为CV计划,在折叠叠中具有依赖性结构。为了解决(二),我们从理论上和实验上都核查 -- -- -- ACV质量会随着最初适合的噪音而平稳地恶化。我们提出的应用方法的精确和计算。

相关内容

MoDELS

关注 0

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【微软亚洲研究院】CodeBERT:用于编程和自然语言的预训练模型，CodeBERT: A Pre-Trained Model for Programming and Natural Languages

专知会员服务

32+阅读 · 2020年2月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日