How can we quantify uncertainty if our favorite computational tool - be it a numerical, a statistical, or a machine learning approach, or just any computer model - provides single-valued output only? In this article, we introduce the Easy Uncertainty Quantification (EasyUQ) technique, which transforms real-valued model output into calibrated statistical distributions, based solely on training data of model output-outcome pairs, without any need to access model input. In its basic form, EasyUQ is a special case of the recently introduced Isotonic Distributional Regression (IDR) technique that leverages the pool-adjacent-violators algorithm for nonparametric isotonic regression. EasyUQ yields discrete predictive distributions that are calibrated and optimal in finite samples, subject to stochastic monotonicity. The workflow is fully automated, without any need for tuning. The Smooth EasyUQ approach supplements IDR with kernel smoothing, to yield continuous predictive distributions that preserve key properties of the basic form, including both, stochastic monotonicity with respect to the original model output, and asymptotic consistency. For the selection of kernel parameters, we introduce multiple one-fit grid search, a computationally much less demanding approximation to leave-one-out cross-validation. We use simulation examples and the WeatherBench challenge in data-driven weather prediction to illustrate the techniques. In a study of benchmark problems from machine learning, we show how EasyUQ and Smooth EasyUQ can be integrated into the workflow of modern neural network learning and hyperparameter tuning, and find EasyUQ to be competitive with more elaborate input-based approaches.
翻译:如果我们最喜欢的计算工具 — 无论是数字工具、统计工具或机器学习方法,还是任何计算机模型 — 仅提供单价产出,我们如何量化不确定性呢?在本篇文章中,我们引入了“简单不确定性量化(EasyUQ)”技术,该技术将实际价值模型产出转化为校准的统计分布,仅以模型输出结果配对的培训数据为基础,无需调整模式输入。在基本形式中,“简单UQ”是一个特殊的例子,它最近引入了“高压分布分析(IDR)”技术,将组合对齐机机算的机算计算法用于非等值回归。“简单UQUQ”技术将实际价值模型输出的模型输出转化为经校准的统计分布,而无需调整。“简便UQ”方法以“内流”方法补充了“内流”系统,从而产生“持续预测方法”,维护了基本形式的关键特性,包括“静态对调”和“机算”的计算方法。 Q- 简单化的计算,在原始输出的模型中, 以最不易的计算方式显示原始输出的模型和计算中, 显示一个模型的精确的模型的精确的模型的模型,可以找到一个稳定的计算。