Rylan Schaeffer

Logo
Resume
Publications
Learning
Blog
Teaching
Jokes
Kernel Papers


Reinforcement Learning

Overview

Value-based, policy-based, and actor-critic methods for sequential decision making.

Introduction

Value-Based RL

Policy-Based RL

Actor-Critic RL

Hierarchical RL (HRL)

Options

Feudal

  • Feudal RL (Dayan 1992)
  • Feudal Networks for HRL (Vezhnevets 2017)

Bisimulation

Distributional RL

TODO

  • https://proceedings.neurips.cc/paper/2020/file/9dd16e049becf4d5087c90a83fea403b-Paper.pdf

Distributed RL