TY - GEN
T1 - Cross-domain representation learning for clothes unfolding in robot-assisted dressing
AU - Qie, Jinge
AU - Gao, Yixing
AU - Feng, Runyang
AU - Wang, Xin
AU - Yang, Jielong
AU - Dasgupta, Esha
AU - Chang, Hyung Jin
AU - Chang, Yi
PY - 2023/2/19
Y1 - 2023/2/19
N2 - Assistive robots can significantly reduce the burden of daily activities by providing services such as unfolding clothes and assistive dressing. For robotic clothes manipulation tasks, grasping point recognition is one of the core steps, which is usually achieved by supervised deep learning methods using large amounts of labeled training data. Given that collecting real labeled data is extremely labor-intensive and time-consuming in this filed, synthetic data generated by physics engines is typically adopted for data enrichment. However, there exists an inherent discrepancy between real and synthetic domains. Therefore, effectively leveraging synthetic data together with real data to jointly train models for grasping point recognition is desirable. In this paper, we propose a Cross-Domain Representation Learning (CDRL) framework that adaptively extracts domain-specific features from synthetic and real domain respectively, before further fusing these domain-specific features to produce more informative and robust cross-domain representations, thereby improving the prediction accuracy of the grasping points an assistive robot must take advantage of. Experimental results show that our CDRL framework is capable of recognizing grasping points more precisely than when compared with five baseline methods. Based on our CDRL framework, we enable a Baxter humanoid robot to unfold a hanging white coat with a 92% success rate and to successfully assist 6 users in dressing.
AB - Assistive robots can significantly reduce the burden of daily activities by providing services such as unfolding clothes and assistive dressing. For robotic clothes manipulation tasks, grasping point recognition is one of the core steps, which is usually achieved by supervised deep learning methods using large amounts of labeled training data. Given that collecting real labeled data is extremely labor-intensive and time-consuming in this filed, synthetic data generated by physics engines is typically adopted for data enrichment. However, there exists an inherent discrepancy between real and synthetic domains. Therefore, effectively leveraging synthetic data together with real data to jointly train models for grasping point recognition is desirable. In this paper, we propose a Cross-Domain Representation Learning (CDRL) framework that adaptively extracts domain-specific features from synthetic and real domain respectively, before further fusing these domain-specific features to produce more informative and robust cross-domain representations, thereby improving the prediction accuracy of the grasping points an assistive robot must take advantage of. Experimental results show that our CDRL framework is capable of recognizing grasping points more precisely than when compared with five baseline methods. Based on our CDRL framework, we enable a Baxter humanoid robot to unfold a hanging white coat with a 92% success rate and to successfully assist 6 users in dressing.
UR - https://link.springer.com/conference/eccv
UR - https://www.scopus.com/pages/publications/85150938163
U2 - 10.1007/978-3-031-25075-0
DO - 10.1007/978-3-031-25075-0
M3 - Conference contribution
SN - 978-3-031-25074-3
T3 - Lecture Notes in Computer Science
SP - 658
EP - 671
BT - Computer Vision – ECCV 2022 Workshops
PB - Springer, Cham
T2 - Tenth International Workshop on Assistive Computer Vision and Robotics
Y2 - 24 October 2022 through 24 October 2022
ER -