时间逻辑模拟:示范活动学习计划----满足计划要求的动态政策 (Temporal Logic Imitation: Learning Plan-Satisficing Motion Policies from Demonstrations) - 专知论文

会员服务 ·

0

离散化 · Learning · Continuity · 可辨认的 · state-of-the-art ·

2022 年 11 月 30 日

Temporal Logic Imitation: Learning Plan-Satisficing Motion Policies from Demonstrations

翻译：时间逻辑模拟:示范活动学习计划----满足计划要求的动态政策

Yanwei Wang,Nadia Figueroa,Shen Li,Ankit Shah,Julie Shah

from arxiv, To appear in NeurIPS 2022 HiLL workshop and CoRL 2022

Learning from demonstration (LfD) has succeeded in tasks featuring a long time horizon. However, when the problem complexity also includes human-in-the-loop perturbations, state-of-the-art approaches do not guarantee the successful reproduction of a task. In this work, we identify the roots of this challenge as the failure of a learned continuous policy to satisfy the discrete plan implicit in the demonstration. By utilizing modes (rather than subgoals) as the discrete abstraction and motion policies with both mode invariance and goal reachability properties, we prove our learned continuous policy can simulate any discrete plan specified by a linear temporal logic (LTL) formula. Consequently, an imitator is robust to both task- and motion-level perturbations and guaranteed to achieve task success. Project page: https://sites.google.com/view/ltl-ds

翻译：从演示中学习( LfD) 成功地完成了具有长期时间跨度的任务。但是,当问题的复杂性也包括人与人之间在环形扰动中, 最先进的方法并不能保证任务的成功复制。在这项工作中, 我们确定这项挑战的根源是, 学习的连续政策未能满足演示中隐含的离散计划。通过使用模式( 而不是子目标), 将模式( 而不是子目标) 和运动政策作为具有模式差异性和目标可达性特性的离散抽象和运动政策, 我们证明我们所学的连续政策可以模拟线性时间逻辑( LTL) 公式所指定的任何离散计划。因此, 模拟器对任务和运动级的扰动都非常有力, 并保证任务成功。项目网页 : https://sites.gogle.com/view/ltl-ds: https://sites.glegle.com/ view/ltl-ds

0

相关内容

离散化

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

煤炭矿区复合生态系统管理及其调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

节水灌溉稻田多尺度水碳通量变化机理与耦合模拟

国家自然科学基金

0+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

银屑病皮损上皮/内皮-间充质转化及间充质-上皮/内皮转化的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Pharicin B稳定维甲酸受体的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Distillation Policy Optimization

Distillation Policy Optimization

Arxiv

0+阅读 · 2023年2月1日

Toward Efficient Transportation Electrification of Heavy-Duty Trucks: Joint Scheduling of Truck Routing and Charging

Arxiv

0+阅读 · 2023年2月1日

What is the Solution for State-Adversarial Multi-Agent Reinforcement Learning?

Arxiv

0+阅读 · 2023年2月1日

Whats Missing? Learning Hidden Markov Models When the Locations of Missing Observations are Unknown

Arxiv

0+阅读 · 2023年1月31日

Temporal Graph Networks for Deep Learning on Dynamic Graphs

Arxiv

37+阅读 · 2020年10月9日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

前沿人工智能趋势报告（Frontier AI Trends Report）

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

音退化问题：基于输入操控的鲁棒语音转换综述

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Distillation Policy Optimization

Distillation Policy Optimization

Arxiv

0+阅读 · 2023年2月1日

Toward Efficient Transportation Electrification of Heavy-Duty Trucks: Joint Scheduling of Truck Routing and Charging

Arxiv

0+阅读 · 2023年2月1日

What is the Solution for State-Adversarial Multi-Agent Reinforcement Learning?

Arxiv

0+阅读 · 2023年2月1日

Whats Missing? Learning Hidden Markov Models When the Locations of Missing Observations are Unknown

Arxiv

0+阅读 · 2023年1月31日

Temporal Graph Networks for Deep Learning on Dynamic Graphs

Arxiv

37+阅读 · 2020年10月9日

相关基金

煤炭矿区复合生态系统管理及其调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

节水灌溉稻田多尺度水碳通量变化机理与耦合模拟

国家自然科学基金

0+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

银屑病皮损上皮/内皮-间充质转化及间充质-上皮/内皮转化的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Pharicin B稳定维甲酸受体的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员