CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction

Di, Yan; Zhang, Chenyangguang; Wang, Pengyuan; Zhai, Guangyao; Zhang, Ruida; Manhardt, Fabian; Busam, Benjamin; Ji, Xiangyang; Tombari, Federico

Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.07837 (cs)

[Submitted on 15 Aug 2023]

Title:CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction

Authors:Yan Di, Chenyangguang Zhang, Pengyuan Wang, Guangyao Zhai, Ruida Zhang, Fabian Manhardt, Benjamin Busam, Xiangyang Ji, Federico Tombari

View PDF

Abstract:In this paper, we present a novel shape reconstruction method leveraging diffusion model to generate 3D sparse point cloud for the object captured in a single RGB image. Recent methods typically leverage global embedding or local projection-based features as the condition to guide the diffusion model. However, such strategies fail to consistently align the denoised point cloud with the given image, leading to unstable conditioning and inferior performance. In this paper, we present CCD-3DR, which exploits a novel centered diffusion probabilistic model for consistent local feature conditioning. We constrain the noise and sampled point cloud from the diffusion model into a subspace where the point cloud center remains unchanged during the forward diffusion process and reverse process. The stable point cloud center further serves as an anchor to align each point with its corresponding local projection-based features. Extensive experiments on synthetic benchmark ShapeNet-R2N2 demonstrate that CCD-3DR outperforms all competitors by a large margin, with over 40% improvement. We also provide results on real-world dataset Pix3D to thoroughly demonstrate the potential of CCD-3DR in real-world applications. Codes will be released soon

Comments:	11 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2308.07837 [cs.CV]
	(or arXiv:2308.07837v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.07837

Submission history

From: Yan Di [view email]
[v1] Tue, 15 Aug 2023 15:27:42 UTC (1,776 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators