Lecture |
Topic |
Readings |
Optional/Additional Readings |
Th Aug 27 |
Course Introduction |
|
|
Tu Sep 1 |
MDP's, Exact Methods: Value Iteration, Policy Iteration, Linear Programming, LP notes |
Sutton and Barto, Reinforcement Learning, Chapters 3 and 4
|
|
Th Sep 3 |
Discretization of Continuous State Space MDPs (v2), Code for Discretization Examples |
|
Moore and Atkeson, 1993,
Munos and Moore, MLJ 2001,
|
Tu Sep 8 |
Function Approximation / Feature-based Representations |
|
Chow and Tsitsiklis, 1991,
Gordon, 1995,
Tsitsiklis and Van Roy, 1996,
Kushner and Dupuis, 1992/2001,
|
Th Sep 10 |
LQR, iterative LQR / Differential Dynamic Programming |
|
|
Tu Sep 15 |
Convex Optimization |
cvx_example.m |
Boyd and Vandenberghe, Chapters 9-11 |
Th Sep 17 |
Convex Optimization (part II) (same slides as previous lecture) |
|
|
Tu Sep 22 |
Non-Convex Optimization through Sequential Convex Programming (SCP), Locally Optimal Control through Optimization: Collocation, Shooting, Model Predictive Control (MPC), Trajectory Optimization for Motion Planning |
code examples |
Nocedal and Wright, Chapter 18 |
Th Sep 24 |
Inverse Optimal Control |
|
|
Tu Sep 29 (PM only) |
Guest Lecture: Adam Bry (skyd.io) |
Adam Bry (skydio) Lecture Video |
|
Th Oct 1 |
Motion Planning: PRM, RRT + variants |
Steven M. Lavalle, Motion Planning, Chapters 5, 14, RRT*, Karaman and Frazzoli, LQR trees, Tedrake, code example |
|
Tu Oct 6 |
Inverse Optimal Control (part II) (same slides as previous lecture) |
|
|
Thu Oct 8 |
Probability Review, Bayes Filters |
Intro: PR 1; Probability Review and Bayes Filters: PR 2 |
|
Tu Oct 13 |
Multivariate Gaussians |
PR 3 |
|
Th Oct 15 |
Kalman Filtering |
PR 3 |
From Gauss to Kalman |
Tu Oct 20 |
EKF, UKF |
PR 3 |
Julier and Uhlmann, the UKF |
Th Oct 22 |
Smoother, MAP |
|
|
Fr Oct 23 |
Bay Area Robotics Symposium |
Chevron Auditorium at the International House |
|
Tu Oct 27 |
Maximum Likelihood, EM |
|
|
Th Oct 29 |
POMDPs |
|
|
Tu Nov 3 (PM only) |
Guest Lecturers: Buddy Michini (Airware), Brad Neumann (Anki), and Liz Murphy (Savioke) |
Buddy Michini (Airware) Lecture Video, Brad Neumann (Anki) Lecture Video, Liz Murphy (Savioke) Lecture Video |
|
Th Nov 5 |
Policy Gradients |
|
|
Tu Nov 10 |
Guided Policy Search (Guest Lecturer: Sergey Levine) |
|
|
Th Nov 12 |
Hierarchical Planning (Guest Lecturer: Dylan Hadfield-Menell) |
|
|
Tu Nov 17 |
Learning from Demonstrations (Guest Lecturer: Sandy Huang) |
|
|
Optimization for Estimation, GraphSLAM, Particle Filters and Localization
SEIF, EnKF, EKF-SLAM, Motion Models, Beam Sensor Models, Scan Matching
Mapping with Known Poses, Rao-Blackwellized Particle Filters, gMapping
Th Nov 19 |
Particle Filters |
|
|
Tu Nov 24 |
Projects speed-dating |
|
|
Th Nov 26 |
Happy Thanksgiving! |
|
|
Tu Dec 1 |
EKF-SLAM, Graph-SLAM, Beam Sensor Model, Current Directions |
|
|
Th Dec 3 |
Autonomous Helicopters and Course Wrap-Up |
|
Abbeel, Coates, Ng, IJRR 2010, videos and data |
Fri Dec 4, 1:30pm, Cory 540A/B |
Project Presentations Session 1 |
|
|
Mon Dec 7, 1:15pm, 405 Soda |
Project Presentations Session 2 |
|
|
Mon Dec 7, 3:30pm, 242 SDH |
Project Presentations Session 3 |
|
|