CS 294-67, Spring 2011: Sequential Decisions: Planning and Reinforcement Learning
Reading list




This list is still under construction. An empty bullet item indicates more readings to come for that week.

Books


Week 1 (1/19): Agents, environments, Markov decision processes


Week 2 (1/26): Dynamic programming


Week 3 (2/2): Dynamic programming contd.


Week 4 (2/9): Partially observable MDPs


Week 5 (2/16): Early history, Monte Carlo RL


Week 6 (2/23): Basic RL algorithms: TD, Q-learning, SARSA


Week 7 (3/2): Convergence of Q-learning; function approximation

Project proposals due.

Week 8 (3/9): Function approximation: convergence properties and proofs


Week 9 (3/16): LSTD/LSPI; policy search methods


Week 10 (3/23):

Spring Break

Week 11 (3/30): Factored MDPs and symbolic dynamic programming


Week 12 (4/6): First-order MDPs and relational RL


Week 13 (4/13): Hierarchical RL


Week 14 (4/20): Exploration, bandits, and metalevel RL


Week 15 (4/27): Inverse RL


Week 16:

Reading/Review/Recitation

Week 17 (TBD):

Project Presentations