Abstract
Diffusion models have played a crucial role in the recent advancements in generative image modeling. These models are characterized by a forward process that incrementally corrupts images. The modeling objective is to develop a reverse process capable of reconstructing the original image from degraded inputs so that the trained model can then be leveraged to generate natural images from pure noise. In this work, we introduce a novel diffusion process that operates in the frequency domain. Typically, the frequency domain representation of an image exhibits a sparse structure, with energy predominantly concentrated in low frequency components. This inherent sparsity aids us in the effective separation of signal and noise during the reverse process. We utilize this property to introduce a scale-dependent noise schedule, offering precise control over various image scales. Working in the frequency domain allows us to modify the training protocol, resulting in significant computation enhancements, achieving a speedup of 2.7-8.5 x without a significant drop in generated image quality, compared to the image domain models, which operate with fixed noise schedules.
Original language | English |
---|---|
Title of host publication | 2024 IEEE International Symposium on Information Theory (ISIT) |
Publisher | IEEE |
Pages | 19-24 |
Number of pages | 6 |
ISBN (Electronic) | 9798350382846 |
ISBN (Print) | 9798350382853 |
DOIs | |
Publication status | Published - 19 Aug 2024 |
Externally published | Yes |
Event | 2024 IEEE International Symposium on Information Theory (ISIT) - Athens, Greece Duration: 7 Jul 2024 → 12 Jul 2024 https://2024.ieee-isit.org/home |
Publication series
Name | IEEE International Symposium on Information Theory |
---|---|
Publisher | IEEE |
ISSN (Print) | 2157-8095 |
ISSN (Electronic) | 2157-8117 |
Conference
Conference | 2024 IEEE International Symposium on Information Theory (ISIT) |
---|---|
Abbreviated title | IEEE ISIT2024 |
Country/Territory | Greece |
City | Athens |
Period | 7/07/24 → 12/07/24 |
Internet address |
Keywords
- Image quality
- Training
- Schedules
- Protocols
- Image synthesis
- Frequency-domain analysis
- Noise