Consistency Models Improve Diffusion Inverse Solvers

Xu, Tongda; Zhu, Ziran; He, Dailan; Wang, Yuanyuan; Sun, Ming; Li, Ning; Qin, Hongwei; Wang, Yan; Liu, Jingjing; Zhang, Ya-Qin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.12063 (cs)

[Submitted on 9 Feb 2024]

Title:Consistency Models Improve Diffusion Inverse Solvers

Authors:Tongda Xu, Ziran Zhu, Dailan He, Yuanyuan Wang, Ming Sun, Ning Li, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang

View PDF HTML (experimental)

Abstract:Diffusion inverse solvers (DIS) aim to find an image $x$ that lives on the diffusion prior while satisfying the constraint $f(x) = y$, given an operator $f(.)$ and measurement $y$. Most non-linear DIS use posterior mean $\hat{x}_{0|t}=\mathbb{E}[x_0|x_t]$ to evaluate $f(.)$ and minimize the distance $||f(\hat{x}_{0|t})-y||^2$. Previous works show that posterior mean-based distance is biased; instead, posterior sample $x_{0|t}\sim p_{\theta}(x_0|x_t)$ promises a better candidate. In this paper, we first clarify when is posterior sample better: $1)$ When $f(.)$ is linear, the distance with posterior mean is as good as single posterior sample, thus preferable as it does not require Monte Carlo; $2)$ When $f(.)$ is non-linear, the distance using posterior sample is better. As previous approximations to posterior sample do not look like a real image, we propose to use consistency model (CM) as a high quality approximation. In addition, we propose a new family of DIS using pure CM. Empirically, we show that replacing posterior mean by CM improves DIS performance on non-linear $f(.)$ (e.g. semantic segmentation, image captioning). Further, our pure CM inversion works well for both linear and non-linear $f(.)$.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2403.12063 [cs.CV]
	(or arXiv:2403.12063v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.12063

Submission history

From: Tongda Xu [view email]
[v1] Fri, 9 Feb 2024 02:23:47 UTC (33,852 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Consistency Models Improve Diffusion Inverse Solvers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Consistency Models Improve Diffusion Inverse Solvers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators