Publications


Apprenticeship Learning and Reinforcement Learning with Application to Robotic Control,
Pieter Abbeel
Ph.D. Dissertation, Stanford University, Computer Science, August 2008
pdf



[ALL | Deep RL | Learning-to-Learn | Apprentice | Optimization-based Planning | Belief Space Planning | Hierarchical Planning | Perception | Deformable Objects | Medical Robotics | Helicopter | Connectomics ]


Pre-prints

Learning to Adapt: Meta-Learning for Model-Based Control,
Ignasi Clavera, Anusha Nagabandi, Ronald S. Fearing, Pieter Abbeel, Sergey Levine, Chelsea Finn.
arXiv 1803.11347, videos

Meta-Reinforcement Learning of Structured Exploration Strategies,
Abhishek Gupta, Russell Mendonca, YuXuan Liu, Pieter Abbeel, Sergey Levine.
arXiv 1802.07245

Evolved Policy Gradients,
Rein Houthooft, Richard Y. Chen, Phillip Isola, Bradly C. Stadie, Filip Wolski, Jonathan Ho, Pieter Abbeel.
arXiv 1802.04821

One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning,
Tianhe Yu*, Chelsea Finn*, Annie Xie, Sudeep Dasari, Tianhao Zhang, Pieter Abbeel, Sergey Levine.
arXiv 1802.01557, video

RL2: Fast Reinforcement Learning via Slow Reinforcement Learning,
Yan (Rocky) Duan, John Schulman, Xi (Peter) Chen, Peter L. Bartlett, Ilya Sutskever, Pieter Abbeel.
arXiv 1611.02779, videos


Publications

bibtex

[179] A Simple Neural Attentive Meta-Learner,
Nikhil Mishra*, Mostafa Rohaninejad*, Xi (Peter) Chen, Pieter Abbeel.
In the proceedings of the 6th International Conference on Learning Representations (ICLR), Vancouver, Canada, April 2018 (pdf forthcoming)

[177] Meta Learning Shared Hierarchies,
Kevin Frans, Jonathan Ho, Xi Chen, Pieter Abbeel, John Schulman.
In the proceedings of the 6th International Conference on Learning Representations (ICLR), Vancouver, Canada, April 2018 (arXiv 1710.09767)

[176] Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments, Best Paper Award,
Maruan Al-Shedivat, Trapit Bansal, Yuri Burda, Ilya Sutskever, Igor Mordatch, Pieter Abbeel.
In the proceedings of the 6th International Conference on Learning Representations (ICLR), Vancouver, Canada, April 2018 (arXiv 1710.03641, videos)

[162] One-Shot Imitation Learning,
Yan (Rocky) Duan, Marcin Andrychowicz, Bradly Stadie, Jonathan Ho, Jonas Schneider, Ilya Sutskever, Pieter Abbeel, Wojciech Zaremba.
In Neural Information Processing Systems (NIPS), Long Beach, CA, December 2017. (arXiv 1703.07326, videos)

[160] One-Shot Visual Imitation Learning via Meta-Learning,
Chelsea Finn*, Tianhe (Kevin) Yu*, Tianhao Zhang, Pieter Abbeel, Sergey Levine.
In the proceedings of the 1st Annual Conference on Robot Learning (CoRL), Mountain View, CA, November 2017. (arXiv 1709.04905, videos)

[152] Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks,
Chelsea Finn, Pieter Abbeel, Sergey Levine.
In the proceedings of the International Conference on Machine Learning, Sydney, Australia, August 2017. (arXiv 1703.03400)