Frequency Domain Diffusion Model with Scale-Dependent Noise Schedule

Amir Ziashahabi, Baturalp Buyukates, Artan Sheshmani, Yi-Zhuang You, Salman Avestimehr

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Diffusion models have played a crucial role in the recent advancements in generative image modeling. These models are characterized by a forward process that incrementally corrupts images. The modeling objective is to develop a reverse process capable of reconstructing the original image from degraded inputs so that the trained model can then be leveraged to generate natural images from pure noise. In this work, we introduce a novel diffusion process that operates in the frequency domain. Typically, the frequency domain representation of an image exhibits a sparse structure, with energy predominantly concentrated in low frequency components. This inherent sparsity aids us in the effective separation of signal and noise during the reverse process. We utilize this property to introduce a scale-dependent noise schedule, offering precise control over various image scales. Working in the frequency domain allows us to modify the training protocol, resulting in significant computation enhancements, achieving a speedup of 2.7-8.5 x without a significant drop in generated image quality, compared to the image domain models, which operate with fixed noise schedules.
Original languageEnglish
Title of host publication2024 IEEE International Symposium on Information Theory (ISIT)
PublisherIEEE
Pages19-24
Number of pages6
ISBN (Electronic)9798350382846
ISBN (Print)9798350382853
DOIs
Publication statusPublished - 19 Aug 2024
Externally publishedYes
Event2024 IEEE International Symposium on Information Theory (ISIT) - Athens, Greece
Duration: 7 Jul 202412 Jul 2024
https://2024.ieee-isit.org/home

Publication series

NameIEEE International Symposium on Information Theory
PublisherIEEE
ISSN (Print)2157-8095
ISSN (Electronic)2157-8117

Conference

Conference2024 IEEE International Symposium on Information Theory (ISIT)
Abbreviated titleIEEE ISIT2024
Country/TerritoryGreece
CityAthens
Period7/07/2412/07/24
Internet address

Keywords

  • Image quality
  • Training
  • Schedules
  • Protocols
  • Image synthesis
  • Frequency-domain analysis
  • Noise

Fingerprint

Dive into the research topics of 'Frequency Domain Diffusion Model with Scale-Dependent Noise Schedule'. Together they form a unique fingerprint.

Cite this