mlr3spatatiotempcv: R 中机器学习的模拟时间抽样方法 (mlr3spatiotempcv: Spatiotemporal resampling methods for machine learning in R)

Spatial and spatiotemporal machine-learning models require a suitable framework for their model assessment, model selection, and hyperparameter tuning, in order to avoid error estimation bias and over-fitting. This contribution reviews the state-of-the-art in spatial and spatiotemporal cross-validation, and introduces the {R} package {mlr3spatiotempcv} as an extension package of the machine-learning framework {mlr3}. Currently various {R} packages implementing different spatiotemporal partitioning strategies exist: {blockCV}, {CAST}, {skmeans} and {sperrorest}. The goal of {mlr3spatiotempcv} is to gather the available spatiotemporal resampling methods in {R} and make them available to users through a simple and common interface. This is made possible by integrating the package directly into the {mlr3} machine-learning framework, which already has support for generic non-spatiotemporal resampling methods such as random partitioning. One advantage is the use of a consistent nomenclature in an overarching machine-learning toolkit instead of a varying package-specific syntax, making it easier for users to choose from a variety of spatiotemporal resampling methods. This package avoids giving recommendations which method to use in practice as this decision depends on the predictive task at hand, the autocorrelation within the data, and the spatial structure of the sampling design or geographic objects being studied.

翻译：空间和空间时空机学习模型需要一个适合模型评估、模型选择和超参数调整的框架,以避免错误估计偏差和过度配置。此贡献会审查空间和空间时空交叉校准方面的最新技术, 并引入 {R} 软件包 {mlr3spatotempcv} 作为机器学习框架 {mlr3} 的扩展包。目前存在各种执行不同时空分配战略的 {R} 软件包 : {bockCV}, {CAST}, {skmeys} 和{sperroest} 。贡献会审查空间和空间时空交叉校验交叉校准的状态。 {ml3spatotempcv} 的目标是在{R} 中收集可用的时空再版套件包 {ml3spatototempcv}, 通过一个简单和通用的界面向用户提供。通过将软件包直接整合到 {mlr3} 机器学习框架, 它已经支持通用的不值得忍受的物体。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日