元强化学习论文汇总

Challenges of meta-RL

  • design a set of tasks that are interrelated
  • find the inter-representation
  • fast adaptation to new tasks

Papers

environment

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

model-based meta-RL

[Learning to reinforcement learn]({% post_url 2021-06-01-Learning-to-reinforcement-learn %})

[RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning]({% post_url 2021-06-02-RL2 %})

[Prefrontal cortex as a meta-reinforcement learning system]({% post_url 2021-06-07-prefrontal %})

A Simple Neural Attentive Meta-Learner]({% post_url 2021-06-08-snail %})

PixelSNAIL: An Improved Autoregressive Generative Model

Concurrent Meta Reinforcement Learning

  • source: arXiv:1903.02710 preprint
  • method: CMRL
  • environment:
    • N-Monty-Hall
    • N-Color-Choice
    • N-Reacher (Reacher-V2 from gym)
  • paper link: https://arxiv.org/pdf/1903.02710v1.pdf
  • code:
  • interpretation:

Reinforcement Learning, Fast and Slow

Improving Generalization in Meta Reinforcement Learning using Learned Objectives

Discovering Reinforcement Learning Algorithms

Model-based Adversarial Meta-Reinforcement Learning

optimization-based meta-RL

[Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks]({{ site.url }}/2021/06/03/MAML.html)

[On First-Order Meta-Learning Algorithms]({{ site.url }}/2021/06/04/Reptile.html)

Meta-Reinforcement Learning of Structured Exploration Strategies

Some Considerations on Learning to Explore via Meta-Reinforcement Learning

Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables

Learning to Adapt in Dynamic, Real-World Environments Through Meta-Reinforcement Learning

[Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning]({{ site.url}}/2021/06/06/savn.html)

Meta-Q-Learning

[Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices]({{ site.url }}/2021/06/05/dream.html)

Meta Learning via Learned Loss

未分类

Alchemy: A structured task distribution for meta-reinforcement learning

Learning Robust State Abstractions for Hidden-Parameter Block MDPs

Meta reinforcement learning as task inference

MELD: Meta-Reinforcement Learning from Images via Latent State Models

Meta Reinforcement Learning with Task Embedding and Shared Policy

Fast Adaptive Task Offloading in Edge Computing based on Meta Reinforcement Learning

Learning Associative Inference Using Fast Weight Memory

Few-Shot Complex Knowledge Base Question Answering via Meta Reinforcement Learning

Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies

Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning

Causal Reasoning from Meta-reinforcement Learning

Introducing Neuromodulation in Deep Neural Networks to Learn Adaptive Behaviours

Policy Gradient RL Algorithms as Directed Acyclic Graphs

Evolving Inborn Knowledge For Fast Adaptation in Dynamic POMDP Problems

Model-Based Meta-Reinforcement Learning for Flight with Suspended Payloads

Hierarchical Meta Reinforcement Learning for Multi-Task Environments

Modeling and Optimization Trade-off in Meta-learning

Meta-Learning of Structured Task Distributions in Humans and Machines

Offline Meta Learning of Exploration

Meta-Reinforcement Learning for Reliable Communication in THz/VLC Wireless VR Networks

posted @ 2021-07-17 22:42  tianyma的技术博客  阅读(411)  评论(0编辑  收藏  举报