控制任务中的约束推断：通过反向优化从专家演示中推断出来 (Constraint Inference in Control Tasks from Expert Demonstrations via Inverse Optimization) - 专知论文

会员服务 ·

0

推断 · 约束 · 演示 · 逆优化 · 机器人应用 ·

2023 年 4 月 6 日

Constraint Inference in Control Tasks from Expert Demonstrations via Inverse Optimization

翻译：控制任务中的约束推断：通过反向优化从专家演示中推断出来

Dimitris Papadimitriou,Jingqi Li

Inferring unknown constraints is a challenging and crucial problem in many robotics applications. When only expert demonstrations are available, it becomes essential to infer the unknown domain constraints to deploy additional agents effectively. In this work, we propose an approach to infer affine constraints in control tasks after observing expert demonstrations. We formulate the constraint inference problem as an inverse optimization problem, and we propose an alternating optimization scheme that infers the unknown constraints by minimizing a KKT residual objective. We demonstrate the effectiveness of our method in a number of simulations, and show that our method can infer less conservative constraints than a recent baseline method while maintaining comparable safety guarantees.

翻译：- 推断未知约束是许多机器人应用中的一项具有挑战性和关键的问题。当只有专家演示可用时，推断未知的领域约束变得至关重要，以有效地部署其他代理。在这项工作中，我们提出了一种方法，通过观察专家演示来推断控制任务中的仿射约束。我们将约束推断问题形式化为逆优化问题，并提出了一种交替优化方案，通过最小化KKT残差目标来推断未知约束。我们在许多模拟中展示了我们方法的有效性，并显示出我们的方法可以推断出不那么保守的约束，同时仍然保持可比较的安全保障。

0

相关内容

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

65+阅读 · 2023年2月15日

【蒙特利尔大学博士论文】可微世界程序，Differentiable World Programs，161页pdf

【蒙特利尔大学博士论文】可微世界程序，Differentiable World Programs，161页pdf

专知会员服务

30+阅读 · 2022年6月7日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

【伯克利-Ke Li】学习优化，74页ppt，Learning to Optimize

【伯克利-Ke Li】学习优化，74页ppt，Learning to Optimize

专知会员服务

41+阅读 · 2020年7月23日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【CVPR2020】用多样性最大化克服单样本NAS中的多模型遗忘

【CVPR2020】用多样性最大化克服单样本NAS中的多模型遗忘

专知会员服务

21+阅读 · 2020年5月16日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

专知会员服务

67+阅读 · 2019年11月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

碳约束下基于信息共享和风险偏好的供应链碳减排优化与协调机制

国家自然科学基金

0+阅读 · 2015年12月31日

基于非独立同分布样本的统计学习理论研究与应用

国家自然科学基金

0+阅读 · 2014年12月31日

控制方向未知的随机非线性系统的神经网络自适应控制

国家自然科学基金

2+阅读 · 2013年12月31日

非凸稀疏先验图像恢复建模理论和算法

国家自然科学基金

0+阅读 · 2012年12月31日

钇氮共掺杂的铌酸钾太阳能可见光光催化剂的制备及其性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

多智能体系统的分布式动态覆盖控制

国家自然科学基金

5+阅读 · 2011年12月31日

塔式太阳能热电系统的高效仿真与运行优化

国家自然科学基金

0+阅读 · 2011年12月31日

过渡金属催化卤代芳烃对芳醛的Barbier类型反应研究

国家自然科学基金

0+阅读 · 2009年12月31日

面向移动目标的无线传感器网络覆盖度量与优化研究

国家自然科学基金

0+阅读 · 2008年12月31日

Levin Tree Search with Context Models

Arxiv

0+阅读 · 2023年5月26日

Uncertain Pose Estimation during Contact Tasks using Differentiable Contact Features

Arxiv

0+阅读 · 2023年5月26日

On the Efficacy of Differentially Private Few-shot Image Classification

Arxiv

0+阅读 · 2023年5月26日

Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis

Arxiv

0+阅读 · 2023年5月26日

Learning Safety Constraints from Demonstrations with Unknown Rewards

Arxiv

0+阅读 · 2023年5月25日

Differentially-Private Decision Trees with Probabilistic Robustness to Data Poisoning

Arxiv

0+阅读 · 2023年5月24日

Inverse Preference Learning: Preference-based RL without a Reward Function

Arxiv

0+阅读 · 2023年5月24日

First- and Second-Order Bounds for Adversarial Linear Contextual Bandits

Arxiv

0+阅读 · 2023年5月24日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

VIP会员

文章信息

相关主题

机器人应用

相关VIP内容

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

65+阅读 · 2023年2月15日

【蒙特利尔大学博士论文】可微世界程序，Differentiable World Programs，161页pdf

【蒙特利尔大学博士论文】可微世界程序，Differentiable World Programs，161页pdf

专知会员服务

30+阅读 · 2022年6月7日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

【伯克利-Ke Li】学习优化，74页ppt，Learning to Optimize

【伯克利-Ke Li】学习优化，74页ppt，Learning to Optimize

专知会员服务

41+阅读 · 2020年7月23日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【CVPR2020】用多样性最大化克服单样本NAS中的多模型遗忘

【CVPR2020】用多样性最大化克服单样本NAS中的多模型遗忘

专知会员服务

21+阅读 · 2020年5月16日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

专知会员服务

67+阅读 · 2019年11月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

大型语言模型遇上文本属性图：一种融合框架与应用的综述

人工智能赋能自主武器与人类控制第三部分：人类控制与系统操作员 | 35页

【博士论文】用于概率程序与生成模型的变分推断

军事指挥控制系统：2025年5种用途

相关资讯

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Levin Tree Search with Context Models

Arxiv

0+阅读 · 2023年5月26日

Uncertain Pose Estimation during Contact Tasks using Differentiable Contact Features

Arxiv

0+阅读 · 2023年5月26日

On the Efficacy of Differentially Private Few-shot Image Classification

Arxiv

0+阅读 · 2023年5月26日

Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis

Arxiv

0+阅读 · 2023年5月26日

Learning Safety Constraints from Demonstrations with Unknown Rewards

Arxiv

0+阅读 · 2023年5月25日

Differentially-Private Decision Trees with Probabilistic Robustness to Data Poisoning

Arxiv

0+阅读 · 2023年5月24日

Inverse Preference Learning: Preference-based RL without a Reward Function

Arxiv

0+阅读 · 2023年5月24日

First- and Second-Order Bounds for Adversarial Linear Contextual Bandits

Arxiv

0+阅读 · 2023年5月24日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

相关基金

碳约束下基于信息共享和风险偏好的供应链碳减排优化与协调机制

国家自然科学基金

0+阅读 · 2015年12月31日

基于非独立同分布样本的统计学习理论研究与应用

国家自然科学基金

0+阅读 · 2014年12月31日

控制方向未知的随机非线性系统的神经网络自适应控制

国家自然科学基金

2+阅读 · 2013年12月31日

非凸稀疏先验图像恢复建模理论和算法

国家自然科学基金

0+阅读 · 2012年12月31日

钇氮共掺杂的铌酸钾太阳能可见光光催化剂的制备及其性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

多智能体系统的分布式动态覆盖控制

国家自然科学基金

5+阅读 · 2011年12月31日

塔式太阳能热电系统的高效仿真与运行优化

国家自然科学基金

0+阅读 · 2011年12月31日

过渡金属催化卤代芳烃对芳醛的Barbier类型反应研究

国家自然科学基金

0+阅读 · 2009年12月31日

面向移动目标的无线传感器网络覆盖度量与优化研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员