以模型为基础的可靠离线模型优化工业加工控制 (Reliable Offline Model-based Optimization for Industrial Process Control) - 专知论文

会员服务 ·

0

控制器 · Processing（编程语言） · 优化器 · Extensibility · 学成 ·

2022 年 5 月 15 日

Reliable Offline Model-based Optimization for Industrial Process Control

翻译：以模型为基础的可靠离线模型优化工业加工控制

Cheng Feng,Jinyan Guan

In the research area of offline model-based optimization, novel and promising methods are frequently developed. However, implementing such methods in real-world industrial systems such as production lines for process control is oftentimes a frustrating process. In this work, we address two important problems to extend the current success of offline model-based optimization to industrial process control problems: 1) how to learn a reliable dynamics model from offline data for industrial processes? 2) how to learn a reliable but not over-conservative control policy from offline data by utilizing existing model-based optimization algorithms? Specifically, we propose a dynamics model based on ensemble of conditional generative adversarial networks to achieve accurate reward calculation in industrial scenarios. Furthermore, we propose an epistemic-uncertainty-penalized reward evaluation function which can effectively avoid giving over-estimated rewards to out-of-distribution inputs during the learning/searching of the optimal control policy. We provide extensive experiments with the proposed method on two representative cases (a discrete control case and a continuous control case), showing that our method compares favorably to several baselines in offline policy learning for industrial process control.

翻译：在离线模型优化研究领域,经常会开发出新的和有希望的方法。然而,在现实世界工业系统中实施这种方法,例如生产流程控制生产线,往往是一个令人沮丧的过程。在这项工作中,我们处理两个重要问题,将离线模型优化目前的成功扩大到工业流程控制问题:(1) 如何从工业流程离线数据中学习可靠的动态模型?(2) 如何利用现有基于模型优化算法,从离线数据中学习可靠但非过度保守的控制政策?具体地说,我们提出了一个动态模型,其基础是有条件的配对对抗网络,以便在工业情景中实现准确的奖赏计算。此外,我们提出一个具有共认别式的、不确定性、附带报酬的奖赏评价功能,这可以有效避免在学习/研究最佳控制政策期间对分配投入给予过高的奖励。我们对两个具有代表性的案例(一个离散的控制案例和一个持续控制案例)建议的方法进行了广泛的试验,表明我们的方法优于工业流程控制离线政策学习的若干基线。

0

相关内容

控制器

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Zr基非晶合金耐中子辐照性的微观机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

Ag微合金化形变Cu-Cr原位复合材料的界面特性及韧化机理

国家自然科学基金

0+阅读 · 2014年12月31日

Al-Cr-Si系中十次准晶体原位三维晶体结构的电子断层成像三维重构

国家自然科学基金

0+阅读 · 2014年12月31日

采用原位同步辐射衍射研究纳米结构Cu/Ag多层膜的微机械行为

国家自然科学基金

0+阅读 · 2013年12月31日

Prohibitin调控癌组织内源性雄激素合成促进前列腺癌激素抵抗性进展机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

异种金属焊接接头材料高温高压水应力腐蚀的原位研究

国家自然科学基金

0+阅读 · 2012年12月31日

时效镁合金的沉淀析出与强韧化机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

高能超声原位制备镁基复合材料半固态浆料及流变成形研究

国家自然科学基金

0+阅读 · 2011年12月31日

微量Zr、Mg等在Cu-Cr-Zr铜合金时效过程中的作用机理

国家自然科学基金

0+阅读 · 2011年12月31日

Z-pin复合材料的空间非均匀纤维排布和残余应力

国家自然科学基金

0+阅读 · 2011年12月31日

ST-CoNAL: Consistency-Based Acquisition Criterion Using Temporal Self-Ensemble for Active Learning

ST-CoNAL: Consistency-Based Acquisition Criterion Using Temporal Self-Ensemble for Active Learning

Arxiv

0+阅读 · 2022年7月5日

Bayesian model selection for multilevel models using marginal likelihoods

Arxiv

0+阅读 · 2022年7月5日

On Effective Scheduling of Model-based Reinforcement Learning

Arxiv

0+阅读 · 2022年7月5日

Discriminator-Guided Model-Based Offline Imitation Learning

Arxiv

0+阅读 · 2022年7月5日

Stability Approach to Regularization Selection for Reduced-Rank Regression

Arxiv

0+阅读 · 2022年7月3日

The Bandwagon Effect: Not Just Another Bias

Arxiv

0+阅读 · 2022年7月1日

Conditional Variable Selection for Intelligent Test

Arxiv

0+阅读 · 2022年7月1日

Robust Bayesian Learning for Reliable Wireless AI: Framework and Applications

Arxiv

0+阅读 · 2022年7月1日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

Arxiv

10+阅读 · 2021年1月24日

VIP会员

文章信息

相关主题

Processing（编程语言）

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

ST-CoNAL: Consistency-Based Acquisition Criterion Using Temporal Self-Ensemble for Active Learning

ST-CoNAL: Consistency-Based Acquisition Criterion Using Temporal Self-Ensemble for Active Learning

Arxiv

0+阅读 · 2022年7月5日

Bayesian model selection for multilevel models using marginal likelihoods

Arxiv

0+阅读 · 2022年7月5日

On Effective Scheduling of Model-based Reinforcement Learning

Arxiv

0+阅读 · 2022年7月5日

Discriminator-Guided Model-Based Offline Imitation Learning

Arxiv

0+阅读 · 2022年7月5日

Stability Approach to Regularization Selection for Reduced-Rank Regression

Arxiv

0+阅读 · 2022年7月3日

The Bandwagon Effect: Not Just Another Bias

Arxiv

0+阅读 · 2022年7月1日

Conditional Variable Selection for Intelligent Test

Arxiv

0+阅读 · 2022年7月1日

Robust Bayesian Learning for Reliable Wireless AI: Framework and Applications

Arxiv

0+阅读 · 2022年7月1日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

Arxiv

10+阅读 · 2021年1月24日

相关基金

Zr基非晶合金耐中子辐照性的微观机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

Ag微合金化形变Cu-Cr原位复合材料的界面特性及韧化机理

国家自然科学基金

0+阅读 · 2014年12月31日

Al-Cr-Si系中十次准晶体原位三维晶体结构的电子断层成像三维重构

国家自然科学基金

0+阅读 · 2014年12月31日

采用原位同步辐射衍射研究纳米结构Cu/Ag多层膜的微机械行为

国家自然科学基金

0+阅读 · 2013年12月31日

Prohibitin调控癌组织内源性雄激素合成促进前列腺癌激素抵抗性进展机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

异种金属焊接接头材料高温高压水应力腐蚀的原位研究

国家自然科学基金

0+阅读 · 2012年12月31日

时效镁合金的沉淀析出与强韧化机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

高能超声原位制备镁基复合材料半固态浆料及流变成形研究

国家自然科学基金

0+阅读 · 2011年12月31日

微量Zr、Mg等在Cu-Cr-Zr铜合金时效过程中的作用机理

国家自然科学基金

0+阅读 · 2011年12月31日

Z-pin复合材料的空间非均匀纤维排布和残余应力

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员