Publications


Apprenticeship Learning and Reinforcement Learning with Application to Robotic Control,
Pieter Abbeel
Ph.D. Dissertation, Stanford University, Computer Science, August 2008
pdf



[ALL | Deep RL | Learning-to-Learn | Apprentice | Sim2Real | Unsupervised | Optimization-based Planning | Belief Space Planning | Hierarchical Planning | Perception | Deformable Objects | Medical Robotics | Helicopter | Connectomics ]


Pre-prints

Stochastic Adversarial Video Prediction,
Alex X. Lee, Richard Zhang, Frederik Ebert, Pieter Abbeel, Chelsea Finn, Sergey Levine.
arXiv 1804.01523, videos

Safer Classification by Synthesis,
William Wang, Angelina Wang, Aviv Tamar, Xi Chen, Pieter Abbeel.
arXiv 1711.08534


Publications

bibtex

[207] The Implicit Preference Information in an Initial State,
In the proceedings of the 7th International Conference on Learning Representations (ICLR), New Orleans, USA, May 2019.

[206] Guiding Policies with Language via Meta-Learning,
John D. Co-Reyes, Abhishek Gupta, Suvansh Sanjeev, Nick Altieri, John DeNero, Pieter Abbeel, Sergey Levine.
In the proceedings of the 7th International Conference on Learning Representations (ICLR), New Orleans, USA, May 2019.
arXiv 1811.07882

[205] ProMP: Proximal Meta-Policy Search,
Jonas Rothfuss*, Dennis Lee*, Ignasi Clavera*, Tamim Asfour, Pieter Abbeel.
In the proceedings of the 7th International Conference on Learning Representations (ICLR), New Orleans, USA, May 2019.
arXiv 1810.06784

[204] Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow,
Xue Bin Peng, Angjoo Kanazawa, Sam Toyer, Pieter Abbeel, Sergey Levine.
In the proceedings of the 7th International Conference on Learning Representations (ICLR), New Orleans, USA, May 2019.
arXiv 1810.00821

[203] Learning to Adapt: Meta-Learning for Model-Based Control,
Ignasi Clavera, Anusha Nagabandi, Ronald S. Fearing, Pieter Abbeel, Sergey Levine, Chelsea Finn.
In the proceedings of the 7th International Conference on Learning Representations (ICLR), New Orleans, USA, May 2019.
arXiv 1803.11347, videos

[202] SFV: Reinforcement Learning of Physical Skills from Videos,
Xue Bin Peng, Angjoo Kanazawa, Jitendra Malik, Pieter Abbeel, Sergey Levine.
In the proceedings of SIGGRAPH ASIA, Tokyo, Japan, December 2018.
arXiv 1810.03599

[201] Learning Plannable Representations with Causal InfoGAN,
Thanard Kurutach, Aviv Tamar, Ge Yang, Stuart Russell, Pieter Abbeel.
In Neural Information Processing Systems (NeurIPS), Montreal, Canada, December 2018.
arXiv 1807.09341

[200] Some Considerations on Learning to Explore via Meta-Reinforcement Learning,
Bradly C. Stadie, Ge Yang, Rein Houthooft, Xi Chen, Yan Duan, Yuhuai Wu, Pieter Abbeel, Ilya Sutskever.
In Neural Information Processing Systems (NeurIPS), Montreal, Canada, December 2018.
arXiv 1803.01118

[199] Meta-Reinforcement Learning of Structured Exploration Strategies,
Abhishek Gupta, Russell Mendonca, YuXuan Liu, Pieter Abbeel, Sergey Levine.
In Neural Information Processing Systems (NeurIPS), Montreal, Canada, December 2018.
arXiv 1802.07245

[198] Evolved Policy Gradients,
Rein Houthooft, Richard Y. Chen, Phillip Isola, Bradly C. Stadie, Filip Wolski, Jonathan Ho, Pieter Abbeel.
In Neural Information Processing Systems (NeurIPS), Montreal, Canada, December 2018.
arXiv 1802.04821

[197] An Algorithmic Perspective on Imitation Learning,
Takayuki Osa, Joni Pajarinen, Gerhard Neumann, J. Andrew Bagnell, Pieter Abbeel, Jan Peters.
In Foundations and Trends in Robotics, November 2018.
arXiv 1811.06711

[196] Modular Architecture for StarCraft II with Deep Reinforcement Learning,
Dennis Lee, Haoran Tang, Jeffrey O Zhang, Huazhe Xu, Trevor Darrell, Pieter Abbeel.
In the proceedings of the 14th AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE'18), Edmonton, Canada, November 2018.
arXiv 1811.03555

[195] Model-Based Reinforcement Learning via Meta-Policy Optimization,
Ignasi Clavera*, Jonas Rothfuss*, John Schulman, Yasuhiro Fujita, Tamim Asfour, Pieter Abbeel.
In the proceedings of the Conference on Robot Learning (CoRL), Zurich, Switzerland, October 2018.
arXiv 1809.05214

[194] Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation,
Gregory Kahn, Adam Villaflor, Pieter Abbeel, Sergey Levine.
In the proceedings of the Conference on Robot Learning (CoRL), Zurich, Switzerland, October 2018.
arXiv 1810.07167, video

[193] Establishing Appropriate Trust via Critical States,
Sandy H. Huang, Kush Bhatia, Pieter Abbeel, Anca D. Dragan.
In the proceedings of the IEEE/RSJ International Conference on Intelligent RObots and Systems (IROS), Madrid, Spain, October 2018.
arXiv 1810.08174

[192] Domain Randomization and Generative Models for Robotic Grasping,
Joshua Tobin, Lukas Biewald, Rocky Duan, Marcin Andrychowicz, Ankur Handa, Vikash Kumar, Bob McGrew, Jonas Schneider, Peter Welinder, Wojciech Zaremba, Pieter Abbeel.
In the proceedings of the IEEE/RSJ International Conference on Intelligent RObots and Systems (IROS), Madrid, Spain, October 2018.
arXiv 1710.06425

[186] PixelSNAIL: An Improved Autoregressive Generative Model,
Xi (Peter) Chen, Nikhil Mishra, Mostafa Rohaninejad, Pieter Abbeel.
In the proceedings of the International Conference on Machine Learning (ICML), Stockholm, Sweden, July 2018.
arXiv 1712.09763

[143] Variational Lossy Autoencoder,
Xi (Peter) Chen, Diederik P. Kingma, Tim Salimans, Yan (Rocky) Duan, Prafulla Dhariwal, John Schulman, Ilya Sutskever, Pieter Abbeel.
In the proceedings of the International Conference on Learning Representations (ICLR), Toulon, France, April 2017. arXiv 1611.02731

[134] InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets,
Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter Abbeel.
In Neural Information Processing Systems (NIPS), Barcelona, Spain, December 2016. (arXiv 1606.03657)

[114] Deep Spatial Autoencoders for Visuomotor Learning
Chelsea Finn, Xin Yu Tan, Yan Duan, Trevor Darrell, Sergey Levine, Pieter Abbeel.
In the proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 2016. (arXiv 1509.06113)