Diffuse3D: Wide-Angle 3D Photography via Bilateral Diffusion

Yutao Jiang, Yang Zhou, Yuan Liang, Wenxi Liu, Jianbo Jiao, Yuhui Quan, Shengfeng He

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper aims to resolve the challenging problem of wide-angle novel view synthesis from a single image, a.k.a. wide-angle 3D photography. Existing approaches rely on local context and treat them equally to inpaint occluded RGB and depth regions, which fail to deal with large-region occlusion (i.e., observing from an extreme angle) and foreground layers might blend into background inpainting. To address the above issues, we propose Diffuse3D which employs a pre-trained diffusion model for global synthesis, while amending the model to activate depth-aware inference. Our key insight is to alter the convolution mechanism in the denoising process. We inject depth information into the denoising convolution operation with bilateral kernels, i.e., a depth kernel and a spatial kernel, to consider layered correlations among pixels. In this way, foreground regions are overlooked in background inpainting and only pixels close in depth are leveraged. On the other hand, we propose a global-local balancing approach to maximize both contextual understandings. Extensive experiments demonstrate that our approach outperforms state-of-the-art methods in novel view synthesis, especially in wide-angle scenarios. More importantly, our method does not require any training and is a plug-and-play module that can be integrated with any diffusion model. Our code can be found at https://github.com/yutaojiang1/Diffuse3D.
Original languageEnglish
Title of host publication2023 IEEE/CVF International Conference on Computer Vision (ICCV)
PublisherIEEE
Pages8964-8974
Number of pages11
ISBN (Electronic)9798350307184
ISBN (Print)9798350307191
DOIs
Publication statusPublished - 15 Jan 2024
Event2023 IEEE/CVF International Conference on Computer Vision (ICCV) - Paris, France
Duration: 1 Oct 20236 Oct 2023

Publication series

NameInternational Conference on Computer Vision (ICCV)
PublisherIEEE
ISSN (Print)1550-5499
ISSN (Electronic)2380-7504

Conference

Conference2023 IEEE/CVF International Conference on Computer Vision (ICCV)
Period1/10/236/10/23

Keywords

  • Photography
  • Training
  • Computer vision
  • Three-dimensional displays
  • Image resolution
  • Correlation
  • Convolution

Fingerprint

Dive into the research topics of 'Diffuse3D: Wide-Angle 3D Photography via Bilateral Diffusion'. Together they form a unique fingerprint.

Cite this