检查、理解、克服:AI安全实用方法调查 (Inspect, Understand, Overcome: A Survey of Practical Methods for AI Safety)

Sebastian Houben,Stephanie Abrecht,Maram Akila,Andreas Bär,Felix Brockherde,Patrick Feifel,Tim Fingscheidt,Sujan Sai Gannamaneni,Seyed Eghbal Ghobadi,Ahmed Hammam,Anselm Haselhoff,Felix Hauser,Christian Heinzemann,Marco Hoffmann,Nikhil Kapoor,Falk Kappel,Marvin Klingner,Jan Kronenberger,Fabian Küppers,Jonas Löhdefink,Michael Mlynarski,Michael Mock,Firas Mualla,Svetlana Pavlitskaya,Maximilian Poretschkin,Alexander Pohl,Varun Ravi-Kumar,Julia Rosenzweig,Matthias Rottmann,Stefan Rüping,Timo Sämann,Jan David Schneider,Elena Schulz,Gesina Schwalbe,Joachim Sicking,Toshika Srivastava,Serin Varghese,Michael Weber,Sebastian Wirkert,Tim Wirtz,Matthias Woehrle

from arxiv, 94 pages

The use of deep neural networks (DNNs) in safety-critical applications like mobile health and autonomous driving is challenging due to numerous model-inherent shortcomings. These shortcomings are diverse and range from a lack of generalization over insufficient interpretability to problems with malicious inputs. Cyber-physical systems employing DNNs are therefore likely to suffer from safety concerns. In recent years, a zoo of state-of-the-art techniques aiming to address these safety concerns has emerged. This work provides a structured and broad overview of them. We first identify categories of insufficiencies to then describe research activities aiming at their detection, quantification, or mitigation. Our paper addresses both machine learning experts and safety engineers: The former ones might profit from the broad range of machine learning topics covered and discussions on limitations of recent methods. The latter ones might gain insights into the specifics of modern ML methods. We moreover hope that our contribution fuels discussions on desiderata for ML systems and strategies on how to propel existing approaches accordingly.

翻译：在诸如移动健康和自主驾驶等安全关键应用中使用深神经网络(DNNs)具有挑战性,原因是许多模型内在缺陷,这些缺陷多种多样,从缺乏对可解释性不足的概括性到恶意投入问题不等,因此,使用DNS的网络物理系统可能会受到安全问题的困扰;近年来,出现了一个旨在解决这些安全问题的先进技术园区;这项工作提供了对这些安全问题的结构化和广泛的概览;我们首先确定了描述旨在探测、量化或减轻这些缺陷的研究活动的不足之处类别;我们的文件既涉及机器学习专家,也涉及安全工程师:前者可能获益于涵盖的广泛机器学习专题和关于近期方法局限性的讨论;后者可能深入了解现代ML方法的具体特点;我们还希望我们的贡献能促进关于ML系统及其如何相应推进现有方法的战略的讨论。

相关内容

AIM

关注 655

医学人工智能AIM（Artificial Intelligence in Medicine）杂志发表了多学科领域的原创文章，涉及医学中的人工智能理论和实践，以医学为导向的人类生物学和卫生保健。医学中的人工智能可以被描述为与研究、项目和应用相关的科学学科，旨在通过基于知识或数据密集型的计算机解决方案支持基于决策的医疗任务，最终支持和改善人类护理提供者的性能。官网地址：http://dblp.uni-trier.de/db/journals/artmed/

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

如何构建你的推荐系统？这份21页ppt教程为你讲解

专知会员服务

65+阅读 · 2021年2月12日

【PKDD2020教程】可解释人工智能XAI:算法到应用，200页ppt

专知会员服务

41+阅读 · 2020年10月13日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日