Guiding Trajectory Optimization by Demonstrated Distributions

Takayuki Osa; Amir Masoud Ghalamzan Esfahani; Rustam Stolkin; Rudolf  Lioutikov; Jan  Peters; Gerhard Neumann

doi:10.1109/LRA.2017.2653850

Guiding Trajectory Optimization by Demonstrated Distributions

Takayuki Osa, Amir Masoud Ghalamzan Esfahani, Rustam Stolkin, Rudolf Lioutikov , Jan Peters, Gerhard Neumann

Metallurgy and Materials

Research output: Contribution to journal › Article › peer-review

22 Citations (Scopus)

Abstract

Trajectory optimization is an essential tool for motion planning under multiple constraints of robotic manipulators. Optimization-based methods can explicitly optimize a trajectory by leveraging prior knowledge of the system and have been used in various applications such as collision avoidance. However, these methods often require a hand-coded cost function in order to achieve the desired behavior. Specifying such cost function for a complex desired behavior, e.g., disentangling a rope, is a nontrivial task that is often even infeasible. Learning from demonstration (LfD) methods offer an alternative way to program robot motion. LfD methods are less dependent on analytical models and instead learn the behavior of experts implicitly from the demonstrated trajectories. However, the problem of adapting the demonstrations to new situations, e.g., avoiding newly introduced obstacles, has not been fully investigated in the literature. In this letter, we present a motion planning framework that combines the advantages of optimization-based and demonstration-based methods. We learn a distribution of trajectories demonstrated by human experts and use it to guide the trajectory optimization process. The resulting trajectory maintains the demonstrated behaviors, which are essential to performing the task successfully, while adapting the trajectory to avoid obstacles. In simulated experiments and with a real robotic system, we verify that our approach optimizes the trajectory to avoid obstacles and encodes the demonstrated behavior in the resulting trajectory.

Original language	English
Pages (from-to)	819-826
Journal	IEEE Robotics and Automation Letters
Volume	2
Issue number	2
DOIs	https://doi.org/10.1109/LRA.2017.2653850
Publication status	Published - 16 Jan 2017

Access to Document

10.1109/LRA.2017.2653850

Cite this

@article{f824c9d377644961a4966833db108b4f,

title = "Guiding Trajectory Optimization by Demonstrated Distributions",

abstract = "Trajectory optimization is an essential tool for motion planning under multiple constraints of robotic manipulators. Optimization-based methods can explicitly optimize a trajectory by leveraging prior knowledge of the system and have been used in various applications such as collision avoidance. However, these methods often require a hand-coded cost function in order to achieve the desired behavior. Specifying such cost function for a complex desired behavior, e.g., disentangling a rope, is a nontrivial task that is often even infeasible. Learning from demonstration (LfD) methods offer an alternative way to program robot motion. LfD methods are less dependent on analytical models and instead learn the behavior of experts implicitly from the demonstrated trajectories. However, the problem of adapting the demonstrations to new situations, e.g., avoiding newly introduced obstacles, has not been fully investigated in the literature. In this letter, we present a motion planning framework that combines the advantages of optimization-based and demonstration-based methods. We learn a distribution of trajectories demonstrated by human experts and use it to guide the trajectory optimization process. The resulting trajectory maintains the demonstrated behaviors, which are essential to performing the task successfully, while adapting the trajectory to avoid obstacles. In simulated experiments and with a real robotic system, we verify that our approach optimizes the trajectory to avoid obstacles and encodes the demonstrated behavior in the resulting trajectory.",

author = "Takayuki Osa and {Ghalamzan Esfahani}, {Amir Masoud} and Rustam Stolkin and Rudolf Lioutikov and Jan Peters and Gerhard Neumann",

year = "2017",

month = jan,

day = "16",

doi = "10.1109/LRA.2017.2653850",

language = "English",

volume = "2",

pages = "819--826",

journal = "IEEE Robotics and Automation Letters",

issn = "2377-3766",

publisher = "IEEE Computer Society Press",

number = "2",

}

TY - JOUR

T1 - Guiding Trajectory Optimization by Demonstrated Distributions

AU - Osa, Takayuki

AU - Ghalamzan Esfahani, Amir Masoud

AU - Stolkin, Rustam

AU - Lioutikov , Rudolf

AU - Peters, Jan

AU - Neumann, Gerhard

PY - 2017/1/16

Y1 - 2017/1/16

N2 - Trajectory optimization is an essential tool for motion planning under multiple constraints of robotic manipulators. Optimization-based methods can explicitly optimize a trajectory by leveraging prior knowledge of the system and have been used in various applications such as collision avoidance. However, these methods often require a hand-coded cost function in order to achieve the desired behavior. Specifying such cost function for a complex desired behavior, e.g., disentangling a rope, is a nontrivial task that is often even infeasible. Learning from demonstration (LfD) methods offer an alternative way to program robot motion. LfD methods are less dependent on analytical models and instead learn the behavior of experts implicitly from the demonstrated trajectories. However, the problem of adapting the demonstrations to new situations, e.g., avoiding newly introduced obstacles, has not been fully investigated in the literature. In this letter, we present a motion planning framework that combines the advantages of optimization-based and demonstration-based methods. We learn a distribution of trajectories demonstrated by human experts and use it to guide the trajectory optimization process. The resulting trajectory maintains the demonstrated behaviors, which are essential to performing the task successfully, while adapting the trajectory to avoid obstacles. In simulated experiments and with a real robotic system, we verify that our approach optimizes the trajectory to avoid obstacles and encodes the demonstrated behavior in the resulting trajectory.

AB - Trajectory optimization is an essential tool for motion planning under multiple constraints of robotic manipulators. Optimization-based methods can explicitly optimize a trajectory by leveraging prior knowledge of the system and have been used in various applications such as collision avoidance. However, these methods often require a hand-coded cost function in order to achieve the desired behavior. Specifying such cost function for a complex desired behavior, e.g., disentangling a rope, is a nontrivial task that is often even infeasible. Learning from demonstration (LfD) methods offer an alternative way to program robot motion. LfD methods are less dependent on analytical models and instead learn the behavior of experts implicitly from the demonstrated trajectories. However, the problem of adapting the demonstrations to new situations, e.g., avoiding newly introduced obstacles, has not been fully investigated in the literature. In this letter, we present a motion planning framework that combines the advantages of optimization-based and demonstration-based methods. We learn a distribution of trajectories demonstrated by human experts and use it to guide the trajectory optimization process. The resulting trajectory maintains the demonstrated behaviors, which are essential to performing the task successfully, while adapting the trajectory to avoid obstacles. In simulated experiments and with a real robotic system, we verify that our approach optimizes the trajectory to avoid obstacles and encodes the demonstrated behavior in the resulting trajectory.

U2 - 10.1109/LRA.2017.2653850

DO - 10.1109/LRA.2017.2653850

M3 - Article

SN - 2377-3766

VL - 2

SP - 819

EP - 826

JO - IEEE Robotics and Automation Letters

JF - IEEE Robotics and Automation Letters

IS - 2

ER -

Guiding Trajectory Optimization by Demonstrated Distributions

Abstract

Access to Document

Fingerprint

Cite this