The practice of uncertainty quantification (UQ) validation, notably in machine learning for the physico-chemical sciences, rests on several graphical methods (scattering plots, calibration curves, reliability diagrams and confidence curves) which explore complementary aspects of calibration, without covering all the desirable ones. For instance, none of these methods deals with the reliability of UQ metrics across the range of input features. Based on three complementary concepts, calibration, consistency and adaptivity, the toolbox of common validation methods for variance- and intervals- based metrics is revisited with the aim to provide a better grasp on their capabilities. This study is conceived as an introduction to UQ validation, and all methods are derived from a few basic rules. The methods are illustrated and tested on synthetic datasets and examples extracted from the recent physico-chemical machine learning UQ literature.
翻译:不确定性量化(UQ)验证做法,特别是在物理化学科学的机器学习中,依靠几种图形方法(缩略图、校准曲线、可靠性图表和信心曲线)来探索校准的互补方面,而不涵盖所有可取方面,例如,这些方法没有一个涉及各种输入特征的UQ度量的可靠性,根据三个互补概念(校准、一致性和适应性),重新审视了差异和间隔基度通用校准方法的工具箱,目的是更好地掌握它们的能力,这一研究设想为UQ验证的导言,所有方法都取自少数基本规则,这些方法以合成数据集和从最近的物理化学机器学习UQ文献中提取的例子加以说明和测试。</s>