rllab is a framework for developing and evaluating reinforcement learning algorithms. rllab provides a wrapper to run algorithms in rllab on environments from OpenAI Gym, as well as submitting the results to the OpenAI Gym scoreboard.
[121] Benchmarking Deep Reinforcement Learning for Continuous Control,
Yan Duan, Xi Chen, Rein Houthooft, John Schulman, Pieter Abbeel.
In the proceedings of the International Conference on Machine Learning (ICML), 2016. (arXiv 1604.06778, rllab:code, rllab:docs)
End-to-End Training of Deep Visuomotor Policies,
Sergey Levine*, Chelsea Finn*, Trevor Darrell, Pieter Abbeel.
arXiv 1504.00702 (video)
[114] Deep Spatial Autoencoders for Visuomotor Learning
Chelsea Finn, Xin Yu Tan, Yan Duan, Trevor Darrell, Sergey Levine, Pieter Abbeel.
In the proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 2016. (arXiv 1509.06113)
[113] Learning Deep Neural Network Policies with Continuous Memory States
Marvin Zhang, Zoe McCarthy, Chelsea Finn, Sergey Levine, Pieter Abbeel.
In the proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 2016. (arXiv 1507.01273)
[80] Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics,
Sergey Levine, Pieter Abbeel.
In Neural Information Processing Systems (NIPS) 27, 2015. (pdf)
trajopt is a software framework for generating robot trajectories by local optimization.
Papers
[57] Finding Locally Optimal, Collision-Free Trajectories with
Sequential Convex Optimization,
John D. Schulman, Jonathan Ho, Alex Lee, Ibrahim
Awwal, Henry Bradlow and Pieter Abbeel.
In the proceedings of Robotics: Science and
Systems (RSS), 2013.
(pdf, videos, code)
[66] Planning Locally Optimal, Curvature-Constrained Trajectories in 3D using Sequential Convex Optimization,
Yan Duan, Sachin Patil, John Schulman, Ken Goldberg, Pieter Abbeel.
In the proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 2014. (pdf)
[70] Motion Planning with Sequential Convex Optimization and Convex Collision Checking,
John Schulman, Yan Duan, Jonathan Ho, Alex Lee, Ibrahim Awwal, Henry Bradlow, Jia Pan, Sachin Patil, Ken Goldberg, Pieter Abbeel.
In the International Journal of Robotics Research (IJRR), 2014. (pdf)