Robust Subtask Learning for Compositional Generalization - 专知论文

会员服务 ·

0

Performer · 泛化理论 · Learning · 稳健性 · Continuity ·

2023 年 6 月 8 日

Robust Subtask Learning for Compositional Generalization

翻译：暂无翻译

Kishor Jothimurugan,Steve Hsu,Osbert Bastani,Rajeev Alur

Compositional reinforcement learning is a promising approach for training policies to perform complex long-horizon tasks. Typically, a high-level task is decomposed into a sequence of subtasks and a separate policy is trained to perform each subtask. In this paper, we focus on the problem of training subtask policies in a way that they can be used to perform any task; here, a task is given by a sequence of subtasks. We aim to maximize the worst-case performance over all tasks as opposed to the average-case performance. We formulate the problem as a two agent zero-sum game in which the adversary picks the sequence of subtasks. We propose two RL algorithms to solve this game: one is an adaptation of existing multi-agent RL algorithms to our setting and the other is an asynchronous version which enables parallel training of subtask policies. We evaluate our approach on two multi-task environments with continuous states and actions and demonstrate that our algorithms outperform state-of-the-art baselines.

翻译：暂无翻译

0

相关内容

Performer

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

轻质高韧牙科纳米复合材料的制备和界面力学行为研究

国家自然科学基金

0+阅读 · 2014年12月31日

多品种小批量生产模式下基于约束规划的生产调度方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

纳米金属在电负载下的结构演化和力学行为

国家自然科学基金

0+阅读 · 2012年12月31日

考虑参数不确定性的机械动态稳健设计理论与方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

非完全信任供应链调度的扩展RTN模型与协同演化算法

国家自然科学基金

0+阅读 · 2009年12月31日

Learning to Collaborate by Grouping: a Consensus-oriented Strategy for Multi-agent Reinforcement Learning

Arxiv

0+阅读 · 2023年7月28日

Improvable Gap Balancing for Multi-Task Learning

Arxiv

0+阅读 · 2023年7月28日

Prompt Guided Transformer for Multi-Task Dense Prediction

Arxiv

0+阅读 · 2023年7月28日

Compositional federated learning: Applications in distributionally robust averaging and meta learning

Arxiv

0+阅读 · 2023年7月26日

Multi-task Learning of Order-Consistent Causal Graphs

Arxiv

10+阅读 · 2021年11月3日

VIP会员

文章信息

相关主题

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【书籍】从零开始构建文本生成图像生成器：基于 Transformers 与扩散模型

人工智能与未来指挥

【伯克利博士论文】将大语言模型绑定至虚拟人格：实现人类行为模拟

稀疏自编码器综述：解释大语言模型的内部机制

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

相关论文

Learning to Collaborate by Grouping: a Consensus-oriented Strategy for Multi-agent Reinforcement Learning

Arxiv

0+阅读 · 2023年7月28日

Improvable Gap Balancing for Multi-Task Learning

Arxiv

0+阅读 · 2023年7月28日

Prompt Guided Transformer for Multi-Task Dense Prediction

Arxiv

0+阅读 · 2023年7月28日

Compositional federated learning: Applications in distributionally robust averaging and meta learning

Arxiv

0+阅读 · 2023年7月26日

Multi-task Learning of Order-Consistent Causal Graphs

Arxiv

10+阅读 · 2021年11月3日

相关基金

轻质高韧牙科纳米复合材料的制备和界面力学行为研究

国家自然科学基金

0+阅读 · 2014年12月31日

多品种小批量生产模式下基于约束规划的生产调度方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

纳米金属在电负载下的结构演化和力学行为

国家自然科学基金

0+阅读 · 2012年12月31日

考虑参数不确定性的机械动态稳健设计理论与方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

非完全信任供应链调度的扩展RTN模型与协同演化算法

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员