量化投影梯度下降对抗训练中模型梯度的优先方向 (Quantifying the Preferential Direction of the Model Gradient in Adversarial Training With Projected Gradient Descent) - 专知论文

会员服务 ·

0

对抗训练 · 对抗 · 梯度 · 投影 · 度量 ·

2023 年 4 月 20 日

Quantifying the Preferential Direction of the Model Gradient in Adversarial Training With Projected Gradient Descent

翻译：量化投影梯度下降对抗训练中模型梯度的优先方向

Ricardo Bigolin Lanfredi,Joyce D. Schroeder,Tolga Tasdizen

from arxiv, This paper was published in Pattern Recognition

Adversarial training, especially projected gradient descent (PGD), has proven to be a successful approach for improving robustness against adversarial attacks. After adversarial training, gradients of models with respect to their inputs have a preferential direction. However, the direction of alignment is not mathematically well established, making it difficult to evaluate quantitatively. We propose a novel definition of this direction as the direction of the vector pointing toward the closest point of the support of the closest inaccurate class in decision space. To evaluate the alignment with this direction after adversarial training, we apply a metric that uses generative adversarial networks to produce the smallest residual needed to change the class present in the image. We show that PGD-trained models have a higher alignment than the baseline according to our definition, that our metric presents higher alignment values than a competing metric formulation, and that enforcing this alignment increases the robustness of models.

翻译：---- 对抗训练，尤其是投影梯度下降（PGD），已被证明是提高对抗攻击鲁棒性的成功方法。经过对抗训练后，模型相对于输入的梯度具有优先方向。然而，对齐的方向在数学上并未得到很好的证明，因此难以进行定量评估。我们提出了一种新的定义，将该方向定义为指向决策空间中最接近的错误类别的支持点的向量方向。为了评估对抗性训练后该方向的对齐情况，我们应用了一个度量标准，利用生成对抗网络产生最小的残差以更改图像中存在的类别。我们表明，PGD训练出的模型根据我们的定义具有更高的对齐度，我们的度量标准与竞争度量公式相比具有更高的对齐度值，并且强制实现此对齐度可以提高模型的稳健性。

0

相关内容

对抗训练

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

【斯坦福&Facebook】生成式对抗变换器，Generative Adversarial Transformers

专知会员服务

21+阅读 · 2021年4月21日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

专知会员服务

28+阅读 · 2020年3月11日

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

专知会员服务

34+阅读 · 2020年2月27日

【论文】生成式教学网络:通过学习生成合成训练数据来加速神经结构搜索（Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data）

【论文】生成式教学网络:通过学习生成合成训练数据来加速神经结构搜索（Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data）

专知会员服务

14+阅读 · 2019年11月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

ICLR 2022 | 基于对抗自注意力机制的预训练语言模型

ICLR 2022 | 基于对抗自注意力机制的预训练语言模型

PaperWeekly

1+阅读 · 2022年7月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

专知

31+阅读 · 2018年6月4日

OpenAI ICLR 2018论文汇总：主要兴趣为强化学习

OpenAI ICLR 2018论文汇总：主要兴趣为强化学习

论智

19+阅读 · 2018年5月1日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

分布式优化算法及其隐私保护策略研究

国家自然科学基金

2+阅读 · 2013年12月31日

基于系统参数化理论的信号稀疏表示和观测系统优化设计

国家自然科学基金

0+阅读 · 2013年12月31日

基于叠加训练（ST）信道估计的相干光正交频分复用系统研究

国家自然科学基金

0+阅读 · 2013年12月31日

OFDM传输体制的MIMO雷达自通信系统研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于结构模型及统计模型的图像质量评价算法的研究

国家自然科学基金

0+阅读 · 2012年12月31日

MIMO认知无线电系统的最优线性联合收发机设计的统一框架研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于虚拟成阵技术的DIFAR浮标网络测向方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

RICCI流的整体解和收敛性

国家自然科学基金

0+阅读 · 2012年12月31日

Nrf2-AREs和NF-κB信号通路及其交互作用在低水平砷诱导细胞凋亡中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

基于Grouplet变换的SAR图像压缩感知编码

国家自然科学基金

0+阅读 · 2009年12月31日

Online Learning under Adversarial Nonlinear Constraints

Arxiv

0+阅读 · 2023年6月6日

Robust Universal Adversarial Perturbations

Arxiv

0+阅读 · 2023年6月6日

Understanding the Effectiveness of Early Weight Averaging for Training Large Language Models

Arxiv

0+阅读 · 2023年6月5日

On the complexity of isomorphism problems for tensors, groups, and polynomials III: actions by classical groups

Arxiv

0+阅读 · 2023年6月5日

Practical Differentially Private Hyperparameter Tuning with Subsampling

Arxiv

0+阅读 · 2023年6月4日

On the Reduction in Accuracy of Finite Difference Schemes on Manifolds without Boundary

Arxiv

0+阅读 · 2023年6月2日

Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial Defense

Arxiv

0+阅读 · 2023年6月2日

Unified Detoxifying and Debiasing in Language Generation via Inference-time Adaptive Optimization

Arxiv

0+阅读 · 2023年6月2日

Extragradient SVRG for Variational Inequalities: Error Bounds and Increasing Iterate Averaging

Arxiv

0+阅读 · 2023年6月1日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

VIP会员

文章信息

相关主题

相关VIP内容

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

【斯坦福&Facebook】生成式对抗变换器，Generative Adversarial Transformers

专知会员服务

21+阅读 · 2021年4月21日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

专知会员服务

28+阅读 · 2020年3月11日

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

专知会员服务

34+阅读 · 2020年2月27日

【论文】生成式教学网络:通过学习生成合成训练数据来加速神经结构搜索（Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data）

【论文】生成式教学网络:通过学习生成合成训练数据来加速神经结构搜索（Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data）

专知会员服务

14+阅读 · 2019年11月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】基础模型训练中网络规模数据的负责任与高效使用

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

人工智能时代背景下的未来海战

相关资讯

ICLR 2022 | 基于对抗自注意力机制的预训练语言模型

ICLR 2022 | 基于对抗自注意力机制的预训练语言模型

PaperWeekly

1+阅读 · 2022年7月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

【论文推荐】最新四篇CVPR2018 视频描述生成相关论文—双向注意力、Transformer、重构网络、层次强化学习

专知

31+阅读 · 2018年6月4日

OpenAI ICLR 2018论文汇总：主要兴趣为强化学习

OpenAI ICLR 2018论文汇总：主要兴趣为强化学习

论智

19+阅读 · 2018年5月1日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Online Learning under Adversarial Nonlinear Constraints

Arxiv

0+阅读 · 2023年6月6日

Robust Universal Adversarial Perturbations

Arxiv

0+阅读 · 2023年6月6日

Understanding the Effectiveness of Early Weight Averaging for Training Large Language Models

Arxiv

0+阅读 · 2023年6月5日

On the complexity of isomorphism problems for tensors, groups, and polynomials III: actions by classical groups

Arxiv

0+阅读 · 2023年6月5日

Practical Differentially Private Hyperparameter Tuning with Subsampling

Arxiv

0+阅读 · 2023年6月4日

On the Reduction in Accuracy of Finite Difference Schemes on Manifolds without Boundary

Arxiv

0+阅读 · 2023年6月2日

Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial Defense

Arxiv

0+阅读 · 2023年6月2日

Unified Detoxifying and Debiasing in Language Generation via Inference-time Adaptive Optimization

Arxiv

0+阅读 · 2023年6月2日

Extragradient SVRG for Variational Inequalities: Error Bounds and Increasing Iterate Averaging

Arxiv

0+阅读 · 2023年6月1日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

相关基金

分布式优化算法及其隐私保护策略研究

国家自然科学基金

2+阅读 · 2013年12月31日

基于系统参数化理论的信号稀疏表示和观测系统优化设计

国家自然科学基金

0+阅读 · 2013年12月31日

基于叠加训练（ST）信道估计的相干光正交频分复用系统研究

国家自然科学基金

0+阅读 · 2013年12月31日

OFDM传输体制的MIMO雷达自通信系统研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于结构模型及统计模型的图像质量评价算法的研究

国家自然科学基金

0+阅读 · 2012年12月31日

MIMO认知无线电系统的最优线性联合收发机设计的统一框架研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于虚拟成阵技术的DIFAR浮标网络测向方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

RICCI流的整体解和收敛性

国家自然科学基金

0+阅读 · 2012年12月31日

Nrf2-AREs和NF-κB信号通路及其交互作用在低水平砷诱导细胞凋亡中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

基于Grouplet变换的SAR图像压缩感知编码

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员