Theta-Resonance:用于设计空间探索的单级强化学习方法 (Theta-Resonance: A Single-Step Reinforcement Learning Method for Design Space Exploration) - 专知论文

会员服务 ·

0

设计 · Networking · Learning · 情景 · 强化学习 ·

2022 年 11 月 3 日

Theta-Resonance: A Single-Step Reinforcement Learning Method for Design Space Exploration

翻译：Theta-Resonance:用于设计空间探索的单级强化学习方法

Masood S. Mortazavi,Tiancheng Qin,Ning Yan

Given an environment (e.g., a simulator) for evaluating samples in a specified design space and a set of weighted evaluation metrics -- one can use Theta-Resonance, a single-step Markov Decision Process (MDP), to train an intelligent agent producing progressively more optimal samples. In Theta-Resonance, a neural network consumes a constant input tensor and produces a policy as a set of conditional probability density functions (PDFs) for sampling each design dimension. We specialize existing policy gradient algorithms in deep reinforcement learning (D-RL) in order to use evaluation feedback (in terms of cost, penalty or reward) to update our policy network with robust algorithmic stability and minimal design evaluations. We study multiple neural architectures (for our policy network) within the context of a simple SoC design space and propose a method of constructing synthetic space exploration problems to compare and improve design space exploration (DSE) algorithms. Although we only present categorical design spaces, we also outline how to use Theta-Resonance in order to explore continuous and mixed continuous-discrete design spaces.

翻译：鉴于在特定设计空间评估样本的环境(例如模拟器)和一套加权评价指标 -- -- 可以使用Seta-Resonance,即单步的Markov决策程序(MDP),对生产更优化样品的智能剂进行培训。在Theta-Resonance,神经网络消耗一个恒定输入点,并产生一套政策,作为每个设计层面抽样的有条件概率密度功能(PDFs)。我们专门将现有政策梯度算法用于深加学习(D-RL),以便利用评价反馈(成本、处罚或奖励)更新我们的政策网络,以稳健的算法稳定性和最低限度的设计评价。我们在一个简单的SoC设计空间的背景下研究多种神经结构(我们的政策网络),并提出构建合成空间探索问题的方法,以比较和改进设计空间探索的算法。虽然我们只是提出明确的设计空间,但我们也概述了如何利用Theta-Reson,以探索连续和混合连续干扰设计空间的设计空间。

0

相关内容

设计是对现有状的一种重新认识和打破重组的过程，设计让一切变得更美。

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

深度强化学习实验室

1+阅读 · 2022年1月11日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

氮原子α位C-H键的官能团化研究

国家自然科学基金

0+阅读 · 2015年12月31日

藏药绿萝花中作用于2型糖尿病PTP-1B，PPARs多靶标的活性成分及作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

小麦DNA损伤修复基因TaKu70和TaKu80与辐射敏感性间的相关性分析

国家自然科学基金

0+阅读 · 2013年12月31日

无界区域最优控制问题的无限元方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

非线性Cahn-Hilliard型方程自适应高阶稳定数值方法分析

国家自然科学基金

0+阅读 · 2013年12月31日

晶胞厚度纳米片分子筛的创制及其催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

有机催化的多组分串联反应合成含氮杂环的方法学研究

国家自然科学基金

0+阅读 · 2012年12月31日

原子经济反应导向的催化基础

国家自然科学基金

0+阅读 · 2011年12月31日

有机催化的串联反应及其在复杂化合物分子合成中的应用

国家自然科学基金

0+阅读 · 2011年12月31日

金属与有机小分子共催化合成几类环状化合物

国家自然科学基金

0+阅读 · 2009年12月31日

Incremental Methods for Weakly Convex Optimization

Arxiv

0+阅读 · 2022年12月23日

Self-Optimizing Feature Transformation

Arxiv

0+阅读 · 2022年12月23日

No pressure? Energy-consistent ROMs for the incompressible Navier-Stokes equations with time-dependent boundary conditions

Arxiv

0+阅读 · 2022年12月22日

Relative Importance Sampling For Off-Policy Actor-Critic in Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年12月22日

Inverse Reinforcement Learning for Text Summarization

Arxiv

0+阅读 · 2022年12月19日

GA+DDPG+HER: Genetic Algorithm-Based Function Optimizer in Deep Reinforcement Learning for Robotic Manipulation Tasks

Arxiv

0+阅读 · 2022年11月13日

Deep Reinforcement Learning for IRS Phase Shift Design in Spatiotemporally Correlated Environments

Arxiv

0+阅读 · 2022年11月2日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Coding for Distributed Multi-Agent Reinforcement Learning

Arxiv

32+阅读 · 2021年1月7日

DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning

Arxiv

20+阅读 · 2018年1月8日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

深度强化学习实验室

1+阅读 · 2022年1月11日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Incremental Methods for Weakly Convex Optimization

Arxiv

0+阅读 · 2022年12月23日

Self-Optimizing Feature Transformation

Arxiv

0+阅读 · 2022年12月23日

No pressure? Energy-consistent ROMs for the incompressible Navier-Stokes equations with time-dependent boundary conditions

Arxiv

0+阅读 · 2022年12月22日

Relative Importance Sampling For Off-Policy Actor-Critic in Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年12月22日

Inverse Reinforcement Learning for Text Summarization

Arxiv

0+阅读 · 2022年12月19日

GA+DDPG+HER: Genetic Algorithm-Based Function Optimizer in Deep Reinforcement Learning for Robotic Manipulation Tasks

Arxiv

0+阅读 · 2022年11月13日

Deep Reinforcement Learning for IRS Phase Shift Design in Spatiotemporally Correlated Environments

Arxiv

0+阅读 · 2022年11月2日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Coding for Distributed Multi-Agent Reinforcement Learning

Arxiv

32+阅读 · 2021年1月7日

DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning

Arxiv

20+阅读 · 2018年1月8日

相关基金

氮原子α位C-H键的官能团化研究

国家自然科学基金

0+阅读 · 2015年12月31日

藏药绿萝花中作用于2型糖尿病PTP-1B，PPARs多靶标的活性成分及作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

小麦DNA损伤修复基因TaKu70和TaKu80与辐射敏感性间的相关性分析

国家自然科学基金

0+阅读 · 2013年12月31日

无界区域最优控制问题的无限元方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

非线性Cahn-Hilliard型方程自适应高阶稳定数值方法分析

国家自然科学基金

0+阅读 · 2013年12月31日

晶胞厚度纳米片分子筛的创制及其催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

有机催化的多组分串联反应合成含氮杂环的方法学研究

国家自然科学基金

0+阅读 · 2012年12月31日

原子经济反应导向的催化基础

国家自然科学基金

0+阅读 · 2011年12月31日

有机催化的串联反应及其在复杂化合物分子合成中的应用

国家自然科学基金

0+阅读 · 2011年12月31日

金属与有机小分子共催化合成几类环状化合物

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员