Publications


Apprenticeship Learning and Reinforcement Learning with Application to Robotic Control,
Pieter Abbeel
Ph.D. Dissertation, Stanford University, Computer Science, August 2008
pdf



[ALL | Deep RL | Learning-to-Learn | Apprentice | Sim2Real | Unsupervised | Optimization-based Planning | Belief Space Planning | Hierarchical Planning | Perception | Deformable Objects | Medical Robotics | Helicopter | Connectomics ]


Pre-prints

One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks,
Tianhe Yu, Pieter Abbeel, Sergey Levine, Chelsea Finn.
arXiv 1810.11043

Variational Option Discovery Algorithms,
Joshua Achiam, Harrison Edwards, Dario Amodei, Pieter Abbeel.
arXiv 1807.10299

RL2: Fast Reinforcement Learning via Slow Reinforcement Learning,
Yan (Rocky) Duan, John Schulman, Xi (Peter) Chen, Peter L. Bartlett, Ilya Sutskever, Pieter Abbeel.
arXiv 1611.02779, videos


Publications

bibtex

[206] Guiding Policies with Language via Meta-Learning,
John D. Co-Reyes, Abhishek Gupta, Suvansh Sanjeev, Nick Altieri, John DeNero, Pieter Abbeel, Sergey Levine.
In the proceedings of the 7th International Conference on Learning Representations (ICLR), New Orleans, USA, May 2019.
arXiv 1811.07882

[205] ProMP: Proximal Meta-Policy Search,
Jonas Rothfuss*, Dennis Lee*, Ignasi Clavera*, Tamim Asfour, Pieter Abbeel.
In the proceedings of the 7th International Conference on Learning Representations (ICLR), New Orleans, USA, May 2019.
arXiv 1810.06784

[203] Learning to Adapt: Meta-Learning for Model-Based Control,
Ignasi Clavera, Anusha Nagabandi, Ronald S. Fearing, Pieter Abbeel, Sergey Levine, Chelsea Finn.
In the proceedings of the 7th International Conference on Learning Representations (ICLR), New Orleans, USA, May 2019.
arXiv 1803.11347, videos

[200] Some Considerations on Learning to Explore via Meta-Reinforcement Learning,
Bradly C. Stadie, Ge Yang, Rein Houthooft, Xi Chen, Yan Duan, Yuhuai Wu, Pieter Abbeel, Ilya Sutskever.
In Neural Information Processing Systems (NeurIPS), Montreal, Canada, December 2018.
arXiv 1803.01118

[199] Meta-Reinforcement Learning of Structured Exploration Strategies,
Abhishek Gupta, Russell Mendonca, YuXuan Liu, Pieter Abbeel, Sergey Levine.
In Neural Information Processing Systems (NeurIPS), Montreal, Canada, December 2018.
arXiv 1802.07245

[198] Evolved Policy Gradients,
Rein Houthooft, Richard Y. Chen, Phillip Isola, Bradly C. Stadie, Filip Wolski, Jonathan Ho, Pieter Abbeel.
In Neural Information Processing Systems (NeurIPS), Montreal, Canada, December 2018.
arXiv 1802.04821

[195] Model-Based Reinforcement Learning via Meta-Policy Optimization,
Ignasi Clavera*, Jonas Rothfuss*, John Schulman, Yasuhiro Fujita, Tamim Asfour, Pieter Abbeel.
In the proceedings of the Conference on Robot Learning (CoRL), Zurich, Switzerland, October 2018.
arXiv 1809.05214

[184] One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning,
Tianhe Yu*, Chelsea Finn*, Annie Xie, Sudeep Dasari, Tianhao Zhang, Pieter Abbeel, Sergey Levine.
In the proceedings of Robotics: Science and Systems (RSS), Pittsburgh, PA, USA, June 2018.
arXiv 1802.01557, video

[179] A Simple Neural Attentive Meta-Learner,
Nikhil Mishra*, Mostafa Rohaninejad*, Xi (Peter) Chen, Pieter Abbeel.
In the proceedings of the 6th International Conference on Learning Representations (ICLR), Vancouver, Canada, April 2018 (pdf forthcoming)

[177] Meta Learning Shared Hierarchies,
Kevin Frans, Jonathan Ho, Xi Chen, Pieter Abbeel, John Schulman.
In the proceedings of the 6th International Conference on Learning Representations (ICLR), Vancouver, Canada, April 2018 (arXiv 1710.09767)

[176] Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments, Best Paper Award,
Maruan Al-Shedivat, Trapit Bansal, Yuri Burda, Ilya Sutskever, Igor Mordatch, Pieter Abbeel.
In the proceedings of the 6th International Conference on Learning Representations (ICLR), Vancouver, Canada, April 2018 (arXiv 1710.03641, videos)

[162] One-Shot Imitation Learning,
Yan (Rocky) Duan, Marcin Andrychowicz, Bradly Stadie, Jonathan Ho, Jonas Schneider, Ilya Sutskever, Pieter Abbeel, Wojciech Zaremba.
In Neural Information Processing Systems (NIPS), Long Beach, CA, December 2017. (arXiv 1703.07326, videos)

[160] One-Shot Visual Imitation Learning via Meta-Learning,
Chelsea Finn*, Tianhe (Kevin) Yu*, Tianhao Zhang, Pieter Abbeel, Sergey Levine.
In the proceedings of the 1st Annual Conference on Robot Learning (CoRL), Mountain View, CA, November 2017. (arXiv 1709.04905, videos)

[152] Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks,
Chelsea Finn, Pieter Abbeel, Sergey Levine.
In the proceedings of the International Conference on Machine Learning, Sydney, Australia, August 2017. (arXiv 1703.03400)