Publications


Apprenticeship Learning and Reinforcement Learning with Application to Robotic Control,
Pieter Abbeel
Ph.D. Dissertation, Stanford University, Computer Science, August 2008
pdf



[ALL | Deep RL | Learning-to-Learn | Apprentice | Sim2Real | Unsupervised | Optimization-based Planning | Belief Space Planning | Hierarchical Planning | Perception | Deformable Objects | Medical Robotics | Helicopter | Connectomics ]


Pre-prints

Variational Option Discovery Algorithms,
Joshua Achiam, Harrison Edwards, Dario Amodei, Pieter Abbeel.
arXiv 1807.10299

Stochastic Adversarial Video Prediction,
Alex X. Lee, Richard Zhang, Frederik Ebert, Pieter Abbeel, Chelsea Finn, Sergey Levine.
arXiv 1804.01523, videos

Safer Classification by Synthesis,
William Wang, Angelina Wang, Aviv Tamar, Xi Chen, Pieter Abbeel.
arXiv 1711.08534


Publications

bibtex

[216] Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design,
Jonathan Ho, Xi (Peter) Chen, Aravind Srinivas, Yan (Rocky) Duan, Pieter Abbeel.
In the proceedings of the International Conference on Machine Learning (ICML), Long Beach, CA, USA, June 2019.
arXiv 1902.00275

[214] Bit-Swap: Recursive Bits-Back Coding for Lossless Compression with Hierarchical Latent Variables,
Friso H. Kingma, Pieter Abbeel, Jonathan Ho.
In the proceedings of the International Conference on Machine Learning (ICML), Long Beach, CA, USA, June 2019.
arXiv 1905.06845

[204] Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow,
Xue Bin Peng, Angjoo Kanazawa, Sam Toyer, Pieter Abbeel, Sergey Levine.
In the proceedings of the 7th International Conference on Learning Representations (ICLR), New Orleans, USA, May 2019.
arXiv 1810.00821

[201] Learning Plannable Representations with Causal InfoGAN,
Thanard Kurutach, Aviv Tamar, Ge Yang, Stuart Russell, Pieter Abbeel.
In Neural Information Processing Systems (NeurIPS), Montreal, Canada, December 2018.
arXiv 1807.09341

[186] PixelSNAIL: An Improved Autoregressive Generative Model,
Xi (Peter) Chen, Nikhil Mishra, Mostafa Rohaninejad, Pieter Abbeel.
In the proceedings of the International Conference on Machine Learning (ICML), Stockholm, Sweden, July 2018.
arXiv 1712.09763

[143] Variational Lossy Autoencoder,
Xi (Peter) Chen, Diederik P. Kingma, Tim Salimans, Yan (Rocky) Duan, Prafulla Dhariwal, John Schulman, Ilya Sutskever, Pieter Abbeel.
In the proceedings of the International Conference on Learning Representations (ICLR), Toulon, France, April 2017. arXiv 1611.02731

[134] InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets,
Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, Pieter Abbeel.
In Neural Information Processing Systems (NIPS), Barcelona, Spain, December 2016. (arXiv 1606.03657)

[114] Deep Spatial Autoencoders for Visuomotor Learning
Chelsea Finn, Xin Yu Tan, Yan Duan, Trevor Darrell, Sergey Levine, Pieter Abbeel.
In the proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 2016. (arXiv 1509.06113)