Conference item icon

Conference item

VGGSfM: visual geometry grounded deep structure from motion

Abstract:
Structure-from-motion (SfM) is a longstanding problem in the computer vision community, which aims to reconstruct the camera poses and 3D structure of a scene from a set of unconstrained 2D images. Classical frameworks solve this problem in an incremental manner by detecting and matching keypoints, registering images, triangulating 3D points, and conducting bundle adjustment. Recent research efforts have predominantly revolved around harnessing the power of deep learning techniques to enhance specific elements (e.g., keypoint matching), but are still based on the original, non-differentiable pipeline. Instead, we propose a new deep pipeline VGGSfM, where each component is fully differentiable and thus can be trained in an end-to-end manner. To this end, we introduce new mechanisms and simplifications. First, we build on recent advances in deep 2D point tracking to extract reliable pixel-accurate tracks, which eliminates the need for chaining pairwise matches. Furthermore, we recover all cameras simultaneously based on the image and track features instead of gradually registering cameras. Finally, we optimise the cameras and triangulate 3D points via a differentiable bundle adjustment layer. We attain state-of-the-art performance on three popular datasets, CO3D, IMC Phototourism, and ETH3D.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Files:
Publisher copy:
10.1109/CVPR52733.2024.02049

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Computer Science
Oxford college:
Magdalen College
Role:
Author
ORCID:
0000-0003-3994-8045


Publisher:
IEEE
Host title:
2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Pages:
21686-21697
Publication date:
2024-09-16
Acceptance date:
2024-05-02
Event title:
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)
Event location:
Seattle, Washington, USA
Event website:
https://cvpr.thecvf.com/Conferences/2024
Event start date:
2024-06-17
Event end date:
2024-06-21
DOI:
EISSN:
2575-7075
ISSN:
1063-6919
EISBN:
9798350353006
ISBN:
9798350353013


Language:
English
Keywords:
Pubs id:
2053761
Local pid:
pubs:2053761
Deposit date:
2024-11-20
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP