Explicitly show the relationships between various techniques of deep reinforcement learning methods.
Dedicated for learning and researching on DRL.
Policy gradient methods
Equivalence Between Policy Gradients and Soft Q-Learning
Trust Region Policy Optimization
Reinforcement Learning with Deep Energy-Based Policies
Q-PROP: SAMPLE-EFFICIENT POLICY GRADIENT WITH AN OFF-POLICY CRITIC
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning 1 Jun 2017
Explorations in DRL
The Reactor: A Sample-Efficient Actor-Critic Architecture 15 Apr 2017
SAMPLE EFFICIENT ACTOR-CRITIC WITH EXPERIENCE REPLAY
REINFORCEMENT LEARNING WITH UNSUPERVISED AUXILIARY TASKS
Continuous control with deep reinforcement learning
Connection with other methods
Connecting value and policy methods
Apply RL to other domains