Imagining hidden supporting objects using volumetric conditional GANs and differentiable stability scores

Research output: Chapter in Book/Report/Conference proceedingConference contribution

201 Downloads (Pure)

Abstract

Objects supporting the physical stability of an unstructured heap of items are often heavily or completely occluded by the objects that they are supporting. Identifying plausible supporting object candidates and their poses from visual information is challenging because there may be many candidates and it is not practical to exhaustively verify each one using physical simulation. We present a generative system which predicts the complete volumetric structure of a heap of objects from visible depth and semantic information. We leverage 3D conditional Wasserstein generative adversarial networks to perform this task and inject differentiable context about physical stability from a second network trained to score the physical stability of object heaps. We demonstrate that our system is capable of generating physically stable heaps from visual information, and that the use of both generative models and context about physical stability are crucial in replicating the true distribution of hidden objects. We train and evaluate our system using a novel simulation-based dataset which we also present in this work.
Original languageEnglish
Title of host publication33rd British Machine Vision Conference 2022, {BMVC} 2022, London, UK, November 21-24, 2022
PublisherBritish Machine Vision Association
Number of pages14
Publication statusPublished - 24 Nov 2022
EventThe 33rd British Machine Vision Conference - The Kia Oval, London, United Kingdom
Duration: 21 Nov 202224 Nov 2022
https://bmvc2022.org/

Conference

ConferenceThe 33rd British Machine Vision Conference
Abbreviated titleBMVC
Country/TerritoryUnited Kingdom
CityLondon
Period21/11/2224/11/22
Internet address

Keywords

  • Physical stability
  • Generative models
  • 3D scene understanding

Fingerprint

Dive into the research topics of 'Imagining hidden supporting objects using volumetric conditional GANs and differentiable stability scores'. Together they form a unique fingerprint.

Cite this