多维度不确定度量化方法应用于深度神经网络 (Multidimensional Uncertainty Quantification for Deep Neural Networks)

Deep neural networks (DNNs) have received tremendous attention and achieved great success in various applications, such as image and video analysis, natural language processing, recommendation systems, and drug discovery. However, inherent uncertainties derived from different root causes have been realized as serious hurdles for DNNs to find robust and trustworthy solutions for real-world problems. A lack of consideration of such uncertainties may lead to unnecessary risk. For example, a self-driving autonomous car can misdetect a human on the road. A deep learning-based medical assistant may misdiagnose cancer as a benign tumor. In this work, we study how to measure different uncertainty causes for DNNs and use them to solve diverse decision-making problems more effectively. In the first part of this thesis, we develop a general learning framework to quantify multiple types of uncertainties caused by different root causes, such as vacuity (i.e., uncertainty due to a lack of evidence) and dissonance (i.e., uncertainty due to conflicting evidence), for graph neural networks. We provide a theoretical analysis of the relationships between different uncertainty types. We further demonstrate that dissonance is most effective for misclassification detection and vacuity is most effective for Out-of-Distribution (OOD) detection. In the second part of the thesis, we study the significant impact of OOD objects on semi-supervised learning (SSL) for DNNs and develop a novel framework to improve the robustness of existing SSL algorithms against OODs. In the last part of the thesis, we create a general learning framework to quantity multiple uncertainty types for multi-label temporal neural networks. We further develop novel uncertainty fusion operators to quantify the fused uncertainty of a subsequence for early event detection.

翻译：深度神经网络在图像和视频分析、自然语言处理、推荐系统和药物发现等各方面应用广泛，但来自不同根源的内在不确定性成为深度神经网络寻找强健、可信解决方案的严重障碍。忽略这种不确定性可能会带来无谓的风险，例如自动驾驶汽车可能发生人行道上不明人物检测错误，深度学习为基础的医疗助手可能将癌症诊断为良性肿瘤。本研究旨在研究如何衡量DNN中的不同不确定性因素并利用它们更有效地解决各种决策问题。第一部分中，我们提出了针对图神经网络多种不同根源造成的多个不确定度量化方法的学习框架，如空虚（即由于缺乏证据导致的不确定性）和冲突（即由于相互冲突的证据导致的不确定性）。我们提供了不同不确定性类型之间关系的理论分析。我们进一步证明了，与其他类型不确定相关，冲突对于误分类检测最为有效，而空虚对于超出分布范围的检测最为有效。在论文的第二部分中，我们研究了过度的类别对深度学习半监督学习（SSL）的重要影响，并开发了一种新的框架来提高现有SSL算法对超出分布范围对象的鲁棒性。在该论文的最后部分，我们创建了一个学习框架，针对多标签时间神经网络量化多种不确定性类型，并制定了新型不确定度融合操作符，以量化子序列的融合不确定性，以实现早期事件检测。