Nested Diffusion Processes for Anytime Image Generation

Elata, Noam; Kawar, Bahjat; Michaeli, Tomer; Elad, Michael

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.19066 (cs)

[Submitted on 30 May 2023 (v1), last revised 30 Oct 2023 (this version, v3)]

Title:Nested Diffusion Processes for Anytime Image Generation

Authors:Noam Elata, Bahjat Kawar, Tomer Michaeli, Michael Elad

View PDF

Abstract:Diffusion models are the current state-of-the-art in image generation, synthesizing high-quality images by breaking down the generation process into many fine-grained denoising steps. Despite their good performance, diffusion models are computationally expensive, requiring many neural function evaluations (NFEs). In this work, we propose an anytime diffusion-based method that can generate viable images when stopped at arbitrary times before completion. Using existing pretrained diffusion models, we show that the generation scheme can be recomposed as two nested diffusion processes, enabling fast iterative refinement of a generated image. In experiments on ImageNet and Stable Diffusion-based text-to-image generation, we show, both qualitatively and quantitatively, that our method's intermediate generation quality greatly exceeds that of the original diffusion model, while the final generation result remains comparable. We illustrate the applicability of Nested Diffusion in several settings, including for solving inverse problems, and for rapid text-based content creation by allowing user intervention throughout the sampling process.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2305.19066 [cs.CV]
	(or arXiv:2305.19066v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.19066

Submission history

From: Noam Elata Mr [view email]
[v1] Tue, 30 May 2023 14:28:43 UTC (49,068 KB)
[v2] Fri, 7 Jul 2023 13:25:39 UTC (23,373 KB)
[v3] Mon, 30 Oct 2023 10:58:43 UTC (27,407 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Nested Diffusion Processes for Anytime Image Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Nested Diffusion Processes for Anytime Image Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators