Publications


Apprenticeship Learning and Reinforcement Learning with Application to Robotic Control,
Pieter Abbeel
Ph.D. Dissertation, Stanford University, Computer Science, August 2008
pdf



[ALL | Deep RL | Learning-to-Learn | Apprentice | Optimization-based Planning | Belief Space Planning | Hierarchical Planning | Perception | Deformable Objects | Medical Robotics | Helicopter | Connectomics ]


Pre-prints

Meta-Reinforcement Learning of Structured Exploration Strategies,
Abhishek Gupta, Russell Mendonca, YuXuan Liu, Pieter Abbeel, Sergey Levine.
arXiv 1802.07245

Evolved Policy Gradients,
Rein Houthooft, Richard Y. Chen, Phillip Isola, Bradly C. Stadie, Filip Wolski, Jonathan Ho, Pieter Abbeel.
arXiv 1802.04821

One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning,
Tianhe Yu*, Chelsea Finn*, Annie Xie, Sudeep Dasari, Tianhao Zhang, Pieter Abbeel, Sergey Levine.
arXiv 1802.01557, video

Meta-Learning with Temporal Convolutions,
Nikhil Mishra*, Mostafa Rohaninejad*, Xi (Peter) Chen, Pieter Abbeel.
arXiv 1707.03141

RL2: Fast Reinforcement Learning via Slow Reinforcement Learning,
Yan (Rocky) Duan, John Schulman, Xi (Peter) Chen, Peter L. Bartlett, Ilya Sutskever, Pieter Abbeel.
arXiv 1611.02779, videos


Publications

bibtex

[179] A Simple Neural Attentive Meta-Learner,
Nikhil Mishra*, Mostafa Rohaninejad*, Xi (Peter) Chen, Pieter Abbeel.
In the proceedings of the 6th International Conference on Learning Representations (ICLR), Vancouver, Canada, April 2018 (pdf forthcoming)

[177] Meta Learning Shared Hierarchies,
Kevin Frans, Jonathan Ho, Xi Chen, Pieter Abbeel, John Schulman.
In the proceedings of the 6th International Conference on Learning Representations (ICLR), Vancouver, Canada, April 2018 (arXiv 1710.09767)

[176] Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments,
Maruan Al-Shedivat, Trapit Bansal, Yuri Burda, Ilya Sutskever, Igor Mordatch, Pieter Abbeel.
In the proceedings of the 6th International Conference on Learning Representations (ICLR), Vancouver, Canada, April 2018 (arXiv 1710.03641, videos)

[162] One-Shot Imitation Learning,
Yan (Rocky) Duan, Marcin Andrychowicz, Bradly Stadie, Jonathan Ho, Jonas Schneider, Ilya Sutskever, Pieter Abbeel, Wojciech Zaremba.
In Neural Information Processing Systems (NIPS), Long Beach, CA, December 2017. (arXiv 1703.07326, videos)

[160] One-Shot Visual Imitation Learning via Meta-Learning,
Chelsea Finn*, Tianhe (Kevin) Yu*, Tianhao Zhang, Pieter Abbeel, Sergey Levine.
In the proceedings of the 1st Annual Conference on Robot Learning (CoRL), Mountain View, CA, November 2017. (arXiv 1709.04905, videos)

[152] Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks,
Chelsea Finn, Pieter Abbeel, Sergey Levine.
In the proceedings of the International Conference on Machine Learning, Sydney, Australia, August 2017. (arXiv 1703.03400)