TY - GEN
T1 - Towards Learning 3d Object Detection and 6d Pose Estimation from Synthetic Data
AU - Rudorfer, Martin
AU - Neumann, Lukas
AU - Krüger, Jörg
PY - 2019/9
Y1 - 2019/9
N2 - Deep Learning-based approaches for 3d object detection and 6d pose estimation typically require large amounts of labeled training data. Labeling image data is expensive and particularly the 6d pose information is difficult to obtain, as it requires a complex setup during image acquisition. Training with synthetic data is therefore very attractive. Large amounts of synthetic, labeled data can be generated, but it is not yet fully understood how certain aspects of data generation affect the detection and pose estimation performance. Our work therefore focuses on creating synthetic training data and investigating the effects on detection performance. We present two methods for data generation: rendering object views and pasting them on random background images, and simulating realistic scenes. The former is computationally simpler and achieved better results, but the detection performance is still very sensitive to small changes, e.g. the type of background images.
AB - Deep Learning-based approaches for 3d object detection and 6d pose estimation typically require large amounts of labeled training data. Labeling image data is expensive and particularly the 6d pose information is difficult to obtain, as it requires a complex setup during image acquisition. Training with synthetic data is therefore very attractive. Large amounts of synthetic, labeled data can be generated, but it is not yet fully understood how certain aspects of data generation affect the detection and pose estimation performance. Our work therefore focuses on creating synthetic training data and investigating the effects on detection performance. We present two methods for data generation: rendering object views and pasting them on random background images, and simulating realistic scenes. The former is computationally simpler and achieved better results, but the detection performance is still very sensitive to small changes, e.g. the type of background images.
KW - deep learning
KW - object detection
KW - synthetic data
UR - http://www.scopus.com/inward/record.url?scp=85074208942&partnerID=8YFLogxK
U2 - 10.1109/ETFA.2019.8869318
DO - 10.1109/ETFA.2019.8869318
M3 - Conference contribution
AN - SCOPUS:85074208942
T3 - IEEE International Conference on Emerging Technologies and Factory Automation, ETFA
SP - 1540
EP - 1543
BT - Proceedings - 2019 24th IEEE International Conference on Emerging Technologies and Factory Automation, ETFA 2019
PB - Institute of Electrical and Electronics Engineers (IEEE)
T2 - 24th IEEE International Conference on Emerging Technologies and Factory Automation, ETFA 2019
Y2 - 10 September 2019 through 13 September 2019
ER -