Conference item icon

Conference item

PC2: projection-conditioned point cloud diffusion for single-image 3D reconstruction

Abstract:
Reconstructing the 3D shape of an object from a single RGB image is a long-standing problem in computer vision. In this paper, we propose a novel method for single-image 3D reconstruction which generates a sparse point cloud via a conditional denoising diffusion process. Our method takes as input a single RGB image along with its camera pose and gradually denoises a set of 3D points, whose positions are initially sampled randomly from a three-dimensional Gaussian distribution, into the shape of an object. The key to our method is a geometrically-consistent conditioning process which we call projection conditioning: at each step in the diffusion process, we project local image features onto the partially-denoised point cloud from the given camera pose. This projection conditioning process enables us to generate high-resolution sparse geometries that are well-aligned with the input image and can additionally be used to predict point colors after shape reconstruction. Moreover, due to the probabilistic nature of the diffusion process, our method is naturally capable of generating multiple different shapes consistent with a single input image. In contrast to prior work, our approach not only performs well on synthetic benchmarks but also gives large qualitative improvements on complex real-world data. Data and code are available at https://lukemelas.github.io/projectionconditioned-point-cloud-diffusion/.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Publisher copy:
10.1109/CVPR52729.2023.01242

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Oxford college:
New College
Role:
Author
ORCID:
0000-0003-1374-2858


Publisher:
IEEE
Host title:
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Journal:
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) More from this journal
Pages:
12923-12932
Publication date:
2023-08-22
Acceptance date:
2023-02-27
Event title:
Conference on Computer Vision and Pattern Recognition (CVPR 2023)
Event location:
Vancouver, Canada
Event website:
https://cvpr2023.thecvf.com/Conferences/2023
Event start date:
2023-06-18
Event end date:
2023-06-22
DOI:
EISSN:
2575-7075
ISSN:
1063-6919
EISBN:
9798350301298
ISBN:
9798350301304


Language:
English
Keywords:
Pubs id:
1335234
Local pid:
pubs:1335234
Deposit date:
2023-03-31
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP