流行病控制：基于深度确定性策略梯度的大规模代理流行病模型 (Epidemic Control on a Large-Scale-Agent-Based Epidemiology Model using Deep Deterministic Policy Gradient) - 专知论文

会员服务 ·

0

确定性策略 · 策略梯度 · 疫苗 · 流行病模型 · 梯度 ·

2023 年 4 月 10 日

Epidemic Control on a Large-Scale-Agent-Based Epidemiology Model using Deep Deterministic Policy Gradient

翻译：流行病控制：基于深度确定性策略梯度的大规模代理流行病模型

Gaurav Deshkar,Jayanta Kshirsagar,Harshal Hayatnagarkar,Janani Venugopalan

To mitigate the impact of the pandemic, several measures include lockdowns, rapid vaccination programs, school closures, and economic stimulus. These interventions can have positive or unintended negative consequences. Current research to model and determine an optimal intervention automatically through round-tripping is limited by the simulation objectives, scale (a few thousand individuals), model types that are not suited for intervention studies, and the number of intervention strategies they can explore (discrete vs continuous). We address these challenges using a Deep Deterministic Policy Gradient (DDPG) based policy optimization framework on a large-scale (100,000 individual) epidemiological agent-based simulation where we perform multi-objective optimization. We determine the optimal policy for lockdown and vaccination in a minimalist age-stratified multi-vaccine scenario with a basic simulation for economic activity. With no lockdown and vaccination (mid-age and elderly), results show optimal economy (individuals below the poverty line) with balanced health objectives (infection, and hospitalization). An in-depth simulation is needed to further validate our results and open-source our framework.

翻译：为了减轻疫情带来的影响，采取了多种措施，包括封锁、快速疫苗接种、学校关闭和经济刺激。这些干预措施可能会产生积极或意外的负面影响。目前，通过往返自动建模和确定最佳干预措施的相关研究受到了限制，主要是因为他们面临的仿真目标、规模（几千个个体）、不适合干预研究的模型类型以及他们可以探索的干预策略数量（离散与连续）等方面的限制。因此我们使用一个基于深度确定性策略梯度（DDPG）的策略优化框架，在一个大规模（100,000个个体）的流行病学代理模拟中进行多目标优化，确定了封锁和疫苗接种的最佳政策。本文对年龄分层多种疫苗方案的模拟进行了研究，模拟也考虑了经济活动。在没有封锁和疫苗接种的情况下（适用于中年人和老年人），结果显示经济效益最佳（处于贫困线以下的个体最少），但感染和住院方面的健康目标达到了平衡。需要进一步验证我们的结果和开源我们的框架。

0

相关内容

确定性策略

确定性策略

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

多智能体顶级会议AAMAS2022最佳论文

多智能体顶级会议AAMAS2022最佳论文

专知会员服务

64+阅读 · 2022年5月15日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

232+阅读 · 2022年4月10日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【NeurIPS 2019-教程】强化学习:过去、现在和未来展望（Rinforcement Learning: Past, Present, and Future Perspectives），微软首席研究员Katja Hofmann

【NeurIPS 2019-教程】强化学习:过去、现在和未来展望（Rinforcement Learning: Past, Present, and Future Perspectives），微软首席研究员Katja Hofmann

专知会员服务

59+阅读 · 2019年12月9日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

黑河流域高时空分辨率未来气候变化情景模拟与不确定性评估

国家自然科学基金

0+阅读 · 2014年12月31日

基于高阶矩风险的非常规突发事件应急管理优化模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

统计学习理论中的分位数回归和MEE算法

国家自然科学基金

1+阅读 · 2012年12月31日

含大规模风电的电力系统输电固定成本集成分摊方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

电压诱导型多风场连锁脱网扩散机理及其预警预控关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于动力学建模的突发传染病应急预案优化配置

国家自然科学基金

0+阅读 · 2012年12月31日

云计算环境下数据中心的power capping关键问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

大规模风电并网的运行风险预警与协调防御

国家自然科学基金

0+阅读 · 2011年12月31日

飞行器颤振的不确定性试验建模及鲁棒抑制研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于反馈控制的自主系统的安全建模与防御

国家自然科学基金

3+阅读 · 2009年12月31日

Actor-Critic or Critic-Actor? A Tale of Two Time Scales

Arxiv

1+阅读 · 2023年5月26日

INVICTUS: Optimizing Boolean Logic Circuit Synthesis via Synergistic Learning and Search

Arxiv

0+阅读 · 2023年5月25日

C-MCTS: Safe Planning with Monte Carlo Tree Search

Arxiv

0+阅读 · 2023年5月25日

DeepGate2: Functionality-Aware Circuit Representation Learning

Arxiv

0+阅读 · 2023年5月25日

Density Ratio Estimation-based Bayesian Optimization with Semi-Supervised Learning

Arxiv

0+阅读 · 2023年5月24日

Multi-Agent Reinforcement Learning with Common Policy for Antenna Tilt Optimization

Arxiv

0+阅读 · 2023年5月24日

Policy Learning based on Deep Koopman Representation

Arxiv

0+阅读 · 2023年5月24日

Learning with Differentiable Algorithms

Arxiv

11+阅读 · 2022年9月1日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

VIP会员

文章信息

相关主题

确定性策略

流行病模型

相关VIP内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

多智能体顶级会议AAMAS2022最佳论文

多智能体顶级会议AAMAS2022最佳论文

专知会员服务

64+阅读 · 2022年5月15日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

【AI+军事】美国HRL实验室AAAI2020《基于强化学习的多智能体任务规划》，Multi-Agent Mission Planning with Reinforcement Learning

专知会员服务

232+阅读 · 2022年4月10日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【NeurIPS 2019-教程】强化学习:过去、现在和未来展望（Rinforcement Learning: Past, Present, and Future Perspectives），微软首席研究员Katja Hofmann

【NeurIPS 2019-教程】强化学习:过去、现在和未来展望（Rinforcement Learning: Past, Present, and Future Perspectives），微软首席研究员Katja Hofmann

专知会员服务

59+阅读 · 2019年12月9日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

最新《扩散模型原理》新书，470页pdf

无人机作战：演进、创新与未来战场

AI 智能体简史

多模态空间推理在大模型时代：综述与基准测试

相关资讯

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Actor-Critic or Critic-Actor? A Tale of Two Time Scales

Arxiv

1+阅读 · 2023年5月26日

INVICTUS: Optimizing Boolean Logic Circuit Synthesis via Synergistic Learning and Search

Arxiv

0+阅读 · 2023年5月25日

C-MCTS: Safe Planning with Monte Carlo Tree Search

Arxiv

0+阅读 · 2023年5月25日

DeepGate2: Functionality-Aware Circuit Representation Learning

Arxiv

0+阅读 · 2023年5月25日

Density Ratio Estimation-based Bayesian Optimization with Semi-Supervised Learning

Arxiv

0+阅读 · 2023年5月24日

Multi-Agent Reinforcement Learning with Common Policy for Antenna Tilt Optimization

Arxiv

0+阅读 · 2023年5月24日

Policy Learning based on Deep Koopman Representation

Arxiv

0+阅读 · 2023年5月24日

Learning with Differentiable Algorithms

Arxiv

11+阅读 · 2022年9月1日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

相关基金

黑河流域高时空分辨率未来气候变化情景模拟与不确定性评估

国家自然科学基金

0+阅读 · 2014年12月31日

基于高阶矩风险的非常规突发事件应急管理优化模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

统计学习理论中的分位数回归和MEE算法

国家自然科学基金

1+阅读 · 2012年12月31日

含大规模风电的电力系统输电固定成本集成分摊方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

电压诱导型多风场连锁脱网扩散机理及其预警预控关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于动力学建模的突发传染病应急预案优化配置

国家自然科学基金

0+阅读 · 2012年12月31日

云计算环境下数据中心的power capping关键问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

大规模风电并网的运行风险预警与协调防御

国家自然科学基金

0+阅读 · 2011年12月31日

飞行器颤振的不确定性试验建模及鲁棒抑制研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于反馈控制的自主系统的安全建模与防御

国家自然科学基金

3+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员