To improve uncertainty quantification of variance networks, we propose a novel tree-structured local neural network model that partitions the feature space into multiple regions based on uncertainty heterogeneity. A tree is built upon giving the training data, whose leaf nodes represent different regions where region-specific neural networks are trained to predict both the mean and the variance for quantifying uncertainty. The proposed Uncertainty-Splitting Neural Regression Tree (USNRT) employs novel splitting criteria. At each node, a neural network is trained on the full data first, and a statistical test for the residuals is conducted to find the best split, corresponding to the two sub-regions with the most significant uncertainty heterogeneity. USNRT is computationally friendly because very few leaf nodes are sufficient and pruning is unnecessary. On extensive UCI datasets, in terms of both calibration and sharpness, USNRT shows superior performance compared to some recent popular methods for variance prediction, including vanilla variance network, deep ensemble, dropout-based methods, tree-based models, etc. Through comprehensive visualization and analysis, we uncover how USNRT works and show its merits.
翻译:为了改进差异网络的不确定性量化,我们提议了一个新的树结构本地神经网络模型,根据不确定性的异质性,将特征空间分割到多个区域。一棵树建在提供培训数据的基础上,其叶节代表不同区域,这些区域特有的神经网络经过培训,可以预测不确定性的平均值和差异。拟议的不确定分解神经回退树(USNRT)采用了新的分解标准。在每个节点,神经网络首先接受关于完整数据的培训,然后对残余物进行统计测试,以找到与两个具有最显著不确定性异性的区域相对应的最佳分解方法。美国国家遥感中心在计算上是友好的,因为很少有叶节点足够且运行不必要。在广泛的UCI数据集中,在校准和清晰度方面,美国国家遥感中心显示的优劣性,与最近一些流行的差异预测方法相比,包括香草差异网络、深层合金、基于辍学的方法、树基模型等等。通过全面的视觉化和分析,我们展示了美国的工作和优势。