深入学习不确定性工具包调查 (A Survey on Uncertainty Toolkits for Deep Learning)

The success of deep learning (DL) fostered the creation of unifying frameworks such as tensorflow or pytorch as much as it was driven by their creation in return. Having common building blocks facilitates the exchange of, e.g., models or concepts and makes developments easier replicable. Nonetheless, robust and reliable evaluation and assessment of DL models has often proven challenging. This is at odds with their increasing safety relevance, which recently culminated in the field of "trustworthy ML". We believe that, among others, further unification of evaluation and safeguarding methodologies in terms of toolkits, i.e., small and specialized framework derivatives, might positively impact problems of trustworthiness as well as reproducibility. To this end, we present the first survey on toolkits for uncertainty estimation (UE) in DL, as UE forms a cornerstone in assessing model reliability. We investigate 11 toolkits with respect to modeling and evaluation capabilities, providing an in-depth comparison for the three most promising ones, namely Pyro, Tensorflow Probability, and Uncertainty Quantification 360. While the first two provide a large degree of flexibility and seamless integration into their respective framework, the last one has the larger methodological scope.

翻译：深层次学习的成功(DL)促进了统一框架的建立,如龙卷风或热火炉等,而这种框架的创建正是由它们所驱动的。共同的构件有助于交换模型或概念,使发展更容易复制。然而,对DL模型的有力和可靠的评价和评估往往证明具有挑战性。这与这些模型日益增加的安全相关性不相符合,最近在“可信赖的ML”领域达到了高潮。我们认为,除其他外,进一步统一评估和维护工具包方面的方法,即小型和专门的框架衍生物,可能会对可信任性和可复制性产生积极的影响。为此目的,我们介绍DL的不确定性估算工具包(UE)的第一次调查,作为评估模型可靠性的基石。我们调查了11个关于建模和评价能力的工具包,为三种最有前途的工具包,即Pyro、Tensorpro Probility和不确定性量化360提供了深入的比较。前两个工具包提供了较大程度的灵活性和无缝合的各自框架。最后一种是更大的方法。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日