Assessing predictive models can be challenging. Modelers must navigate a wide array of evaluation methodologies implemented with incompatible interfaces across multiple packages which may give different or even contradictory results, while ensuring that their chosen approach properly estimates the performance of their model when generalizing to new observations. Assessing models fit to spatial data can be particularly difficult, given that model errors may exhibit spatial autocorrelation, model predictions are often aggregated to multiple spatial scales by end users, and models are often tasked with generalizing into spatial regions outside the boundaries of their initial training data. The waywiser package for the R language attempts to make assessing spatial models easier by providing an ergonomic toolkit for model evaluation tasks, with functions for multiple assessment methodologies sharing a unified interface. Functions from waywiser share standardized argument names and default values, making the user-facing interface simple and easy to learn. These functions are additionally designed to be easy to integrate into a wide variety of modeling workflows, accepting standard classes as inputs and returning size- and type-stable outputs, ensuring that their results are of consistent and predictable data types and dimensions. Additional features make it particularly easy to use waywiser along packages and workflows in the tidymodels ecosystem.
翻译:评估预测模型可能具有挑战性。建模者必须跨多个包实现不兼容界面的广泛评估方法,并确保他们选择的方法在一般化到新观测时正确估计其模型的性能。评估拟合到空间数据的模型可能特别困难,因为模型误差可能具有空间自相关性,模型预测通常被最终用户聚合到多种空间尺度中,并且模型通常需要推广到超出其初始训练数据边界的空间区域。R语言的waywiser包试图通过为模型评估任务提供人体工程学工具包来使评估空间模型更加容易。waywiser的函数共享统一的界面,支持多个评估方法。waywiser函数共享标准化的参数名称和默认值,使用户面向接口简单易学。这些功能还被设计为易于集成到各种建模工作流程中,接受标准类作为输入并返回大小和类型稳定的输出,确保其结果具有一致和可预测的数据类型和维度。其他功能使waywiser非常容易与tidymodels生态系统中的包和工作流程一起使用。