Reinforcement Learning
Overview
Value-based, policy-based, and actor-critic methods for sequential decision making.
Introduction
Value-Based RL
- Rescorla-Wagner Learning Rule
- Temporal Difference Learning
- Q-Learning
- Successor Representation
Policy-Based RL
Actor-Critic RL
Hierarchical RL (HRL)
Options
Feudal
- Feudal RL (Dayan 1992)
- Feudal Networks for HRL (Vezhnevets 2017)
Bisimulation
Distributional RL
- Introduction
- Categorical (C51)
- Expectile Regression
- Connection to Neuroscience: Dabney et al. 2020
TODO
- https://proceedings.neurips.cc/paper/2020/file/9dd16e049becf4d5087c90a83fea403b-Paper.pdf
