End-to-end learning to grasp via sampling from object point clouds

Antonio Alliegro; Martin Rudorfer; Fabio Frattin; Ales Leonardis; Tatiana Tommasi

doi:10.1109/LRA.2022.3191183

End-to-end learning to grasp via sampling from object point clouds

Antonio Alliegro, Martin Rudorfer, Fabio Frattin, Ales Leonardis, Tatiana Tommasi

Computer Science

Research output: Contribution to journal › Article › peer-review

143 Downloads (Pure)

Abstract

The ability to grasp objects is an essential skill that enables many robotic manipulation tasks. Recent works have studied point cloud-based methods for object grasping by starting from simulated datasets and have shown promising performance in real-world scenarios. Nevertheless, many of them still rely on ad-hoc geometric heuristics to generate grasp candidates, which fail to generalize to objects with significantly different shapes with respect to those observed during training. Several approaches exploit complex multi-stage learning strategies and local neighborhood feature extraction while ignoring semantic global information. Furthermore, they are inefficient in terms of number of training samples and time required for inference. In this letter, we propose an end-to-end learning solution to generate 6-DOF parallel-jaw grasps starting from the 3D partial view of the object. Our Learning to Grasp (L2G) method gathers information from the input point cloud through a new procedure that combines a differentiable sampling strategy to identify the visible contact points, with a feature encoder that leverages local and global cues. Overall, L2G is guided by a multi-task objective that generates a diverse set of grasps by optimizing contact point sampling, grasp regression, and grasp classification. With a thorough experimental analysis, we show the effectiveness of L2G as well as its robustness and generalization abilities.

Original language	English
Pages (from-to)	9865-9872
Number of pages	8
Journal	IEEE Robotics and Automation Letters
Volume	7
Issue number	4
Early online date	15 Jul 2022
DOIs	https://doi.org/10.1109/LRA.2022.3191183
Publication status	Published - Oct 2022

Access to Document

10.1109/LRA.2022.3191183Licence: None: All rights reserved

End-to-end learning to grasp via sampling from object point clouds
This is the Accepted Author Manuscript (AAM) of an article, A. Alliegro, M. Rudorfer, F. Frattin, A. Leonardis and T. Tommasi, "End-to-End Learning to Grasp via Sampling From Object Point Clouds," in IEEE Robotics and Automation Letters, vol. 7, no. 4, pp. 9865-9872, Oct. 2022, doi: 10.1109/LRA.2022.3191183, published by IEEE. © 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Accepted author manuscript, 2.85 MBLicence: Other (please specify with Rights Statement)

Cite this

@article{5f0b6bb19b6b4ac48e51dc66789c1698,

title = "End-to-end learning to grasp via sampling from object point clouds",

abstract = "The ability to grasp objects is an essential skill that enables many robotic manipulation tasks. Recent works have studied point cloud-based methods for object grasping by starting from simulated datasets and have shown promising performance in real-world scenarios. Nevertheless, many of them still rely on ad-hoc geometric heuristics to generate grasp candidates, which fail to generalize to objects with significantly different shapes with respect to those observed during training. Several approaches exploit complex multi-stage learning strategies and local neighborhood feature extraction while ignoring semantic global information. Furthermore, they are inefficient in terms of number of training samples and time required for inference. In this letter, we propose an end-to-end learning solution to generate 6-DOF parallel-jaw grasps starting from the 3D partial view of the object. Our Learning to Grasp (L2G) method gathers information from the input point cloud through a new procedure that combines a differentiable sampling strategy to identify the visible contact points, with a feature encoder that leverages local and global cues. Overall, L2G is guided by a multi-task objective that generates a diverse set of grasps by optimizing contact point sampling, grasp regression, and grasp classification. With a thorough experimental analysis, we show the effectiveness of L2G as well as its robustness and generalization abilities.",

author = "Antonio Alliegro and Martin Rudorfer and Fabio Frattin and Ales Leonardis and Tatiana Tommasi",

year = "2022",

month = oct,

doi = "10.1109/LRA.2022.3191183",

language = "English",

volume = "7",

pages = "9865--9872",

journal = "IEEE Robotics and Automation Letters",

issn = "2377-3766",

publisher = "IEEE Computer Society Press",

number = "4",

}

TY - JOUR

T1 - End-to-end learning to grasp via sampling from object point clouds

AU - Alliegro, Antonio

AU - Rudorfer, Martin

AU - Frattin, Fabio

AU - Leonardis, Ales

AU - Tommasi, Tatiana

PY - 2022/10

Y1 - 2022/10

N2 - The ability to grasp objects is an essential skill that enables many robotic manipulation tasks. Recent works have studied point cloud-based methods for object grasping by starting from simulated datasets and have shown promising performance in real-world scenarios. Nevertheless, many of them still rely on ad-hoc geometric heuristics to generate grasp candidates, which fail to generalize to objects with significantly different shapes with respect to those observed during training. Several approaches exploit complex multi-stage learning strategies and local neighborhood feature extraction while ignoring semantic global information. Furthermore, they are inefficient in terms of number of training samples and time required for inference. In this letter, we propose an end-to-end learning solution to generate 6-DOF parallel-jaw grasps starting from the 3D partial view of the object. Our Learning to Grasp (L2G) method gathers information from the input point cloud through a new procedure that combines a differentiable sampling strategy to identify the visible contact points, with a feature encoder that leverages local and global cues. Overall, L2G is guided by a multi-task objective that generates a diverse set of grasps by optimizing contact point sampling, grasp regression, and grasp classification. With a thorough experimental analysis, we show the effectiveness of L2G as well as its robustness and generalization abilities.

AB - The ability to grasp objects is an essential skill that enables many robotic manipulation tasks. Recent works have studied point cloud-based methods for object grasping by starting from simulated datasets and have shown promising performance in real-world scenarios. Nevertheless, many of them still rely on ad-hoc geometric heuristics to generate grasp candidates, which fail to generalize to objects with significantly different shapes with respect to those observed during training. Several approaches exploit complex multi-stage learning strategies and local neighborhood feature extraction while ignoring semantic global information. Furthermore, they are inefficient in terms of number of training samples and time required for inference. In this letter, we propose an end-to-end learning solution to generate 6-DOF parallel-jaw grasps starting from the 3D partial view of the object. Our Learning to Grasp (L2G) method gathers information from the input point cloud through a new procedure that combines a differentiable sampling strategy to identify the visible contact points, with a feature encoder that leverages local and global cues. Overall, L2G is guided by a multi-task objective that generates a diverse set of grasps by optimizing contact point sampling, grasp regression, and grasp classification. With a thorough experimental analysis, we show the effectiveness of L2G as well as its robustness and generalization abilities.

UR - https://arxiv.org/abs/2203.05585

UR - https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=7083369

U2 - 10.1109/LRA.2022.3191183

DO - 10.1109/LRA.2022.3191183

M3 - Article

SN - 2377-3766

VL - 7

SP - 9865

EP - 9872

JO - IEEE Robotics and Automation Letters

JF - IEEE Robotics and Automation Letters

IS - 4

ER -

End-to-end learning to grasp via sampling from object point clouds

Abstract

Access to Document

Fingerprint

Cite this