Trojaned神经网络的地形探测 (Topological Detection of Trojaned Neural Networks)

Deep neural networks are known to have security issues. One particular threat is the Trojan attack. It occurs when the attackers stealthily manipulate the model's behavior through Trojaned training samples, which can later be exploited. Guided by basic neuroscientific principles we discover subtle -- yet critical -- structural deviation characterizing Trojaned models. In our analysis we use topological tools. They allow us to model high-order dependencies in the networks, robustly compare different networks, and localize structural abnormalities. One interesting observation is that Trojaned models develop short-cuts from input to output layers. Inspired by these observations, we devise a strategy for robust detection of Trojaned models. Compared to standard baselines it displays better performance on multiple benchmarks.

翻译：已知深神经网络有安全问题。一种特殊的威胁是Trojan攻击。当攻击者通过Trojan训练样本悄悄地操纵模型的行为时, 它就会发生, 这些样本后来可以被利用。遵循基本的神经科学原则, 我们发现了微妙的, 但却是关键的结构偏差, 以Trojaned模型为特征。在我们的分析中, 我们使用地形学工具。它们允许我们模拟网络的高阶依赖性, 强力比较不同的网络, 并且将结构异常化地方化。一个有趣的观察是, Trojaned 模型从输入到输出层之间发展了短径。在这些观察的启发下, 我们设计了一种战略, 以强力探测Trojaned模型。与它显示在多个基准上更好表现的标准基线相比。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【MLSS2020】流数据贝叶斯预测，米兰Sonia Petrone教授，80页ppt

专知会员服务

48+阅读 · 2020年7月5日

【CVPR2020】视觉导航的神经拓扑SLAM，56页ppt，Neural Topological SLAM for Visual Navigation

专知会员服务

14+阅读 · 2020年6月18日