学习针对动态攻击者的近似最优入侵响应 (Learning Near-Optimal Intrusion Responses Against Dynamic Attackers) - 专知论文

会员服务 ·

0

防御策略 · 近似最优 · 入侵响应 · 最优 · 攻击 ·

2023 年 4 月 14 日

Learning Near-Optimal Intrusion Responses Against Dynamic Attackers

翻译：学习针对动态攻击者的近似最优入侵响应

Kim Hammar,Rolf Stadler

from arxiv, arXiv admin note: substantial text overlap with arXiv:2205.14694

We study automated intrusion response and formulate the interaction between an attacker and a defender as an optimal stopping game where attack and defense strategies evolve through reinforcement learning and self-play. The game-theoretic modeling enables us to find defender strategies that are effective against a dynamic attacker, i.e. an attacker that adapts its strategy in response to the defender strategy. Further, the optimal stopping formulation allows us to prove that optimal strategies have threshold properties. To obtain near-optimal defender strategies, we develop Threshold Fictitious Self-Play (T-FP), a fictitious self-play algorithm that learns Nash equilibria through stochastic approximation. We show that T-FP outperforms a state-of-the-art algorithm for our use case. The experimental part of this investigation includes two systems: a simulation system where defender strategies are incrementally learned and an emulation system where statistics are collected that drive simulation runs and where learned strategies are evaluated. We argue that this approach can produce effective defender strategies for a practical IT infrastructure.

翻译：我们研究了自动化入侵响应，并将攻击者和防御者之间的交互形式化为最优停止博弈，通过强化学习和自我对弈来发展攻击和防御策略。博弈理论建模使我们能够找到对动态攻击者有效的防御策略，即攻击者会根据防御者策略而适应其策略。此外，最优停止的形式化允许我们证明最优策略具有阈值特性。为了获得近似最优的防御策略，我们开发了阈值虚构自我对弈（T-FP）算法，该算法通过随机逼近来学习纳什均衡。我们展示了T-FP在我们的用例中胜过了最先进的算法。这项研究的实验包括两个系统：一个模拟系统，其中增量地学习防御策略；以及一个仿真系统，其中收集统计数据以驱动模拟运行，并评估学习到的策略。我们认为这种方法可以为实际的IT基础设施产生有效的防御策略。

0

相关内容

防御策略

【干货书】机器学习时代的艺术，Art in the Age of Machine Learning，214页pdf

【干货书】机器学习时代的艺术，Art in the Age of Machine Learning，214页pdf

专知会员服务

71+阅读 · 2023年1月17日

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

【PKDD2021】成对偏好学习，109页ppt，Pairwise Preference Learning

专知会员服务

21+阅读 · 2021年6月10日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

人工智能如何用于抵抗COVID-19？Mila这份《AI against COVID-19 》PPT

专知会员服务

48+阅读 · 2020年5月17日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Uber AI NeurIPS 2019《元学习meta-learning》教程，附92页PPT下载

Uber AI NeurIPS 2019《元学习meta-learning》教程，附92页PPT下载

专知会员服务

113+阅读 · 2019年12月13日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

专知

14+阅读 · 2018年3月28日

复杂网络上数据传输博弈的合作性优化与控制研究

国家自然科学基金

3+阅读 · 2015年12月31日

回声干扰抑制中的自适应信号处理算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

44+阅读 · 2015年12月31日

异构网络中的干扰统计建模研究

国家自然科学基金

0+阅读 · 2014年12月31日

智能电网高级量测体系中数据完整性攻击防御机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于小世界的无线传感器网络病毒传播与防范策略研究

国家自然科学基金

0+阅读 · 2012年12月31日

下一代互联网DDoS防御关键技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

网络Euler-Lagrange系统的分布式协调控制问题研究

国家自然科学基金

1+阅读 · 2011年12月31日

脑缺血大鼠脑内AngⅡ#21450;受体在小胶质细胞中的变化研究

国家自然科学基金

0+阅读 · 2009年12月31日

奶牛乳房炎金黄色葡萄球菌对乳腺上皮细胞Toll样受体信号通路影响的研究

国家自然科学基金

0+阅读 · 2009年12月31日

IBP Regularization for Verified Adversarial Robustness via Branch-and-Bound

Arxiv

0+阅读 · 2023年5月31日

Adversarial Attacks on Online Learning to Rank with Stochastic Click Models

Arxiv

0+阅读 · 2023年5月30日

Proximal Point Imitation Learning

Arxiv

0+阅读 · 2023年5月30日

Robust Lipschitz Bandits to Adversarial Corruptions

Arxiv

0+阅读 · 2023年5月29日

Emergent Incident Response for Unmanned Warehouses with Multi-agent Systems*

Arxiv

0+阅读 · 2023年5月29日

No-Regret Learning in Dynamic Competition with Reference Effects Under Logit Demand

Arxiv

0+阅读 · 2023年5月27日

Reinforcement Learning With Reward Machines in Stochastic Games

Arxiv

0+阅读 · 2023年5月27日

Adversarial Attacks on Online Learning to Rank with Click Feedback

Arxiv

0+阅读 · 2023年5月26日

Uncertain Pose Estimation during Contact Tasks using Differentiable Contact Features

Arxiv

0+阅读 · 2023年5月26日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】机器学习时代的艺术，Art in the Age of Machine Learning，214页pdf

【干货书】机器学习时代的艺术，Art in the Age of Machine Learning，214页pdf

专知会员服务

71+阅读 · 2023年1月17日

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

【PKDD2021】成对偏好学习，109页ppt，Pairwise Preference Learning

专知会员服务

21+阅读 · 2021年6月10日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

人工智能如何用于抵抗COVID-19？Mila这份《AI against COVID-19 》PPT

专知会员服务

48+阅读 · 2020年5月17日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Uber AI NeurIPS 2019《元学习meta-learning》教程，附92页PPT下载

Uber AI NeurIPS 2019《元学习meta-learning》教程，附92页PPT下载

专知会员服务

113+阅读 · 2019年12月13日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】面向企业的图学习扩展：生产级图学习与推理，485页pdf

AI智能体编程：技术、挑战与机遇综述

【国家标准】数据安全技术数据安全风险评估方法

【CMU博士论文】交互式学习的进展：替代性反馈机制与自适应因果推理

相关资讯

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

【论文推荐】最新七篇推荐系统相关论文—影响兴趣、知识Embeddings、音乐推荐、非结构化、一致性、显式和隐式特征、知识图谱

专知

14+阅读 · 2018年3月28日

相关论文

IBP Regularization for Verified Adversarial Robustness via Branch-and-Bound

Arxiv

0+阅读 · 2023年5月31日

Adversarial Attacks on Online Learning to Rank with Stochastic Click Models

Arxiv

0+阅读 · 2023年5月30日

Proximal Point Imitation Learning

Arxiv

0+阅读 · 2023年5月30日

Robust Lipschitz Bandits to Adversarial Corruptions

Arxiv

0+阅读 · 2023年5月29日

Emergent Incident Response for Unmanned Warehouses with Multi-agent Systems*

Arxiv

0+阅读 · 2023年5月29日

No-Regret Learning in Dynamic Competition with Reference Effects Under Logit Demand

Arxiv

0+阅读 · 2023年5月27日

Reinforcement Learning With Reward Machines in Stochastic Games

Arxiv

0+阅读 · 2023年5月27日

Adversarial Attacks on Online Learning to Rank with Click Feedback

Arxiv

0+阅读 · 2023年5月26日

Uncertain Pose Estimation during Contact Tasks using Differentiable Contact Features

Arxiv

0+阅读 · 2023年5月26日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

相关基金

复杂网络上数据传输博弈的合作性优化与控制研究

国家自然科学基金

3+阅读 · 2015年12月31日

回声干扰抑制中的自适应信号处理算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

44+阅读 · 2015年12月31日

异构网络中的干扰统计建模研究

国家自然科学基金

0+阅读 · 2014年12月31日

智能电网高级量测体系中数据完整性攻击防御机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于小世界的无线传感器网络病毒传播与防范策略研究

国家自然科学基金

0+阅读 · 2012年12月31日

下一代互联网DDoS防御关键技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

网络Euler-Lagrange系统的分布式协调控制问题研究

国家自然科学基金

1+阅读 · 2011年12月31日

脑缺血大鼠脑内AngⅡ#21450;受体在小胶质细胞中的变化研究

国家自然科学基金

0+阅读 · 2009年12月31日

奶牛乳房炎金黄色葡萄球菌对乳腺上皮细胞Toll样受体信号通路影响的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员