Introduction to Reinforcement Learning

Spring 2022

This page presents lecture materials for CS 4789/5789: Introduction to Reinforcement Learning taught by Sarah Dean at Cornell University in spring 2022. For the most recent materials look here . This course was first taught by Wen Sun in spring 2021.

Schedule

Date no. Lecture Title Materials
1/24 1 Introduction to RL Lecture Notes
Slides, Live Notes, Video
1/26 2 MDPs and Bellman Equations Lecture Notes
Slides, Live Notes, Video
1/31 3 MDPs, Optimal Policies, and Value Iteration Lecture Notes
Slides, Live Notes, Video
2/2 4 Policy Iteration and Dynamic Programming Lecture Notes
Slides, Live Notes, Video
2/7 5 Continuous Control Lecture Notes
Slides, Live Notes, Video
2/9 6 Linear Quadratic Regulation Lecture Notes
Slides, Live Notes, Video
2/14 7 Nonlinear Control Lecture Notes
Slides, Live Notes, Video
2/16 8 Limitations in Control and Observation Lecture Notes
Slides, Live Notes, Video
2/21 9 Prediction and Estimation Lecture Notes
Slides, Live Notes, Video
2/23 10 Model-based RL Lecture Notes
Slides, Live Notes, Video
2/28 February Break
3/2 11 Approximate and Conservative Policy Iteration Lecture Notes
Slides, Live Notes, Video
3/7 12 Supervision via Bellman Lecture Notes
Slides, Live Notes, Video
3/9 13 Optimization Background Lecture Notes
Slides, Live Notes, Video
3/14 14 Policy Optimization: Random Search and Policy Gradient Lecture Notes
Slides, Live Notes, Video
3/16 15 Policy Optimization: Trust Region and Natural PG Lecture Notes
Slides, Live Notes, Video
3/21 16 Prelim Review Slides, Video
3/23 17 Exploration: Multi-Armed Bandits Lecture Notes
Slides, Live Notes, Video
Code, Notebook
3/28 18 Upper Confidence Bound Algorithm Lecture Notes
Slides, Live Notes, Video
3/30 19 Contextual Bandits Lecture Notes
Slides, Live Notes, Video
4/4 Spring Break
4/6 Spring Break
4/11 20 Linear Contextual Bandits Lecture Notes
Slides, Live Notes, Video
Code, Notebook
4/13 21 Exploration in MDPs Lecture Notes
Slides, Live Notes, Video
4/18 22 Imitation Learning with BC Lecture Notes
Slides, Live Notes, Video
4/21 23 Interactive Imitation Learning Lecture Notes
Slides, Live Notes, Video
4/25 24 Inverse RL Lecture Notes
Slides, Live Notes, Video
4/27 25 Max Entropy IRL Lecture Notes
Slides, Live Notes, Video
5/2 26 Specification and Societal Implications Slides, Video
5/4 27 AlphaGo Case Study Slides, Video
5/9 28 Review Slides, Video