战略性防御以反馈控制为控制,防止可靠性和安全性失灵的平行问题 (Strategic Defense of Feedback-Controlled Parallel Queues against Reliability and Security Failures) - 专知论文

会员服务 ·

0

Markov · 阈值 · 情景 · 控制器 · dynamic programming ·

2023 年 1 月 27 日

Strategic Defense of Feedback-Controlled Parallel Queues against Reliability and Security Failures

翻译：战略性防御以反馈控制为控制,防止可靠性和安全性失灵的平行问题

Qian Xie,Jiayi Wang,Li Jin

from arxiv, Submitted to Automatica

Parallel traffic service systems such as transportation, manufacturing, and computer systems typically involve feedback control (e.g., dynamic routing) to ensure stability and to improve throughput. Such control relies on connected cyber components for computation and communication. These components are susceptible to random malfunctions and malicious attacks, which motivates the design of strategic defense that are both traffic-stabilizing and cost-efficient under reliability/security failures. In this paper, we consider a parallel queuing system with dynamic routing subject to such failures. For the reliability setting, we consider an infinite-horizon Markov decision process where the system operator strategically activates the protection mechanism upon each job arrival based on the traffic state. We use Hamilton-Jacobi-Bellman equation to show that the optimal protection strategy is a deterministic threshold policy. For the security setting, we extend the model to an infinite-horizon stochastic game where the attacker strategically manipulates routing assignment. We show that a Markov perfect equilibrium of this game always exists and that both players follow a threshold strategy at each equilibrium. For both settings, we also consider the stability of the traffic queues in the face of failures. Finally, we develop approximate dynamic programming algorithms to compute the optimal/equilibrium policies and present numerical examples for validation and illustration.

翻译：交通、制造和计算机系统等平行交通系统通常涉及反馈控制(例如动态路由),以确保稳定和改善输送量。这种控制依靠连接的网络部件进行计算和通信。这些部件容易发生随机故障和恶意袭击,促使设计战略防御,在可靠性/安全性故障的情况下,这种系统既能稳定交通,又具有成本效益。在本文中,我们考虑的是具有动态路由的平行排队系统,这种系统可能会发生故障。在可靠性设定方面,我们考虑的是无穷无尽的马可夫决定程序,即系统操作员在每次到达时战略启动基于交通状态的保护机制。我们使用汉密尔顿-雅可比-贝尔曼方程式来显示最佳保护战略是确定性的门槛政策。对于安全环境,我们将模型扩大到攻击者战略性地操纵路线分配的无限偏高的组合游戏。我们发现,这个游戏的完美平衡始终存在,而且两个玩家在每个平衡点上都遵循一个门槛战略。我们用汉密尔顿-贾科比-贝尔曼方方程式来显示最佳保护战略是确定动态/数字序列的稳定性。我们最后的排序。

0

相关内容

Markov

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

最新报告64页《军事中的人工智能和自主性：北约成员国的战略和部署概述》北约卓越合作网络防御中心，Artificial Intelligence and Autonomy in the Military: An Overview of NATO Member States’ Strategies and Deployment

最新报告64页《军事中的人工智能和自主性：北约成员国的战略和部署概述》北约卓越合作网络防御中心，Artificial Intelligence and Autonomy in the Military: An Overview of NATO Member States’ Strategies and Deployment

专知会员服务

31+阅读 · 2022年4月7日

【KDD2021】图神经网络，NUS- Xavier Bresson教授

【KDD2021】图神经网络，NUS- Xavier Bresson教授

专知会员服务

67+阅读 · 2021年8月20日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

Akt磷酸化Prohibitin介导其线粒体转位促进膀胱癌的增殖

国家自然科学基金

0+阅读 · 2014年12月31日

考虑观测值时空相关性的InSAR三维形变估计方法

国家自然科学基金

0+阅读 · 2013年12月31日

具有状态约束的Navier-Stokes方程的最优控制问题

国家自然科学基金

0+阅读 · 2013年12月31日

非线性离散可积方程与离散Painlevé方程族的连续极限理论

国家自然科学基金

0+阅读 · 2013年12月31日

Erdos-Sos猜想及几个相关的极值组合问题

国家自然科学基金

0+阅读 · 2012年12月31日

非凸Hamilton系统的Aubry-Mather理论

国家自然科学基金

0+阅读 · 2012年12月31日

哈密顿系统的定性理论与渐近性理论

国家自然科学基金

0+阅读 · 2011年12月31日

不可压Navier-Stokes方程的适定性与正则性研究

国家自然科学基金

0+阅读 · 2009年12月31日

半导体诱发的可见光-Fenton体系降解有机污染物研究

国家自然科学基金

0+阅读 · 2009年12月31日

Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model

Arxiv

0+阅读 · 2023年3月17日

Wages and Utilities in a Closed Economy

Arxiv

0+阅读 · 2023年3月17日

Compensating for Sensing Failures via Delegation in Human-AI Hybrid Systems

Arxiv

0+阅读 · 2023年3月17日

Nonlinearity parameter imaging in the frequency domain

Arxiv

0+阅读 · 2023年3月17日

Rethinking Certification for Higher Trust and Ethical Safeguarding of Autonomous Systems

Arxiv

0+阅读 · 2023年3月16日

Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks

Arxiv

0+阅读 · 2023年3月15日

Trust in Human-AI Interaction: Scoping Out Models, Measures, and Methods

Arxiv

22+阅读 · 2022年4月30日

Australia's Approach to AI Governance in Security and Defence

Arxiv

15+阅读 · 2021年11月23日

Game Theory in defence applications: a review

Arxiv

29+阅读 · 2021年11月2日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

VIP会员

文章信息

相关主题

dynamic programming

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

最新报告64页《军事中的人工智能和自主性：北约成员国的战略和部署概述》北约卓越合作网络防御中心，Artificial Intelligence and Autonomy in the Military: An Overview of NATO Member States’ Strategies and Deployment

最新报告64页《军事中的人工智能和自主性：北约成员国的战略和部署概述》北约卓越合作网络防御中心，Artificial Intelligence and Autonomy in the Military: An Overview of NATO Member States’ Strategies and Deployment

专知会员服务

31+阅读 · 2022年4月7日

【KDD2021】图神经网络，NUS- Xavier Bresson教授

【KDD2021】图神经网络，NUS- Xavier Bresson教授

专知会员服务

67+阅读 · 2021年8月20日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型智能体强化学习：全景综述

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

【伯克利博士论文】从推理服务到训练：面向大规模 LLM 智能体的高效系统

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

相关论文

Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model

Arxiv

0+阅读 · 2023年3月17日

Wages and Utilities in a Closed Economy

Arxiv

0+阅读 · 2023年3月17日

Compensating for Sensing Failures via Delegation in Human-AI Hybrid Systems

Arxiv

0+阅读 · 2023年3月17日

Nonlinearity parameter imaging in the frequency domain

Arxiv

0+阅读 · 2023年3月17日

Rethinking Certification for Higher Trust and Ethical Safeguarding of Autonomous Systems

Arxiv

0+阅读 · 2023年3月16日

Quantifying the Effect of Feedback Frequency in Interactive Reinforcement Learning for Robotic Tasks

Arxiv

0+阅读 · 2023年3月15日

Trust in Human-AI Interaction: Scoping Out Models, Measures, and Methods

Arxiv

22+阅读 · 2022年4月30日

Australia's Approach to AI Governance in Security and Defence

Arxiv

15+阅读 · 2021年11月23日

Game Theory in defence applications: a review

Arxiv

29+阅读 · 2021年11月2日

The Causal Learning of Retail Delinquency

Arxiv

15+阅读 · 2020年12月17日

相关基金

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

Akt磷酸化Prohibitin介导其线粒体转位促进膀胱癌的增殖

国家自然科学基金

0+阅读 · 2014年12月31日

考虑观测值时空相关性的InSAR三维形变估计方法

国家自然科学基金

0+阅读 · 2013年12月31日

具有状态约束的Navier-Stokes方程的最优控制问题

国家自然科学基金

0+阅读 · 2013年12月31日

非线性离散可积方程与离散Painlevé方程族的连续极限理论

国家自然科学基金

0+阅读 · 2013年12月31日

Erdos-Sos猜想及几个相关的极值组合问题

国家自然科学基金

0+阅读 · 2012年12月31日

非凸Hamilton系统的Aubry-Mather理论

国家自然科学基金

0+阅读 · 2012年12月31日

哈密顿系统的定性理论与渐近性理论

国家自然科学基金

0+阅读 · 2011年12月31日

不可压Navier-Stokes方程的适定性与正则性研究

国家自然科学基金

0+阅读 · 2009年12月31日

半导体诱发的可见光-Fenton体系降解有机污染物研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员