Rylan Schaeffer

Resume
Publications
Learning
Blog
Teaching
Jokes
Kernel Papers

Reinforcement Learning

Machine Learning RL

Overview

Value-based, policy-based, and actor-critic methods for sequential decision making.

Introduction

Markov Process

Value-Based RL

Rescorla-Wagner Learning Rule
Temporal Difference Learning
Q-Learning
Successor Representation

Policy-Based RL

Actor-Critic RL

Introduction

Hierarchical RL (HRL)

Options

Feudal

Feudal RL (Dayan 1992)
Feudal Networks for HRL (Vezhnevets 2017)

Bisimulation

Distributional RL

Introduction
Categorical (C51)
Expectile Regression
Quantile Regression
Connection to Neuroscience: Dabney et al. 2020

TODO

https://proceedings.neurips.cc/paper/2020/file/9dd16e049becf4d5087c90a83fea403b-Paper.pdf

Distributed RL