Conference item
MinkOcc: Towards real-time label-efficient semantic occupancy prediction
- Abstract:
- Developing 3D semantic occupancy prediction models often relies on dense 3D annotations for supervised learning, a process that is both labor and resource-intensive, underscoring the need for label-efficient or even label-free approaches. To address this, we introduce MinkOcc, a multimodal 3D semantic occupancy prediction framework for cameras and LiDARs that proposes a two-step semi-supervised training procedure. Here, a small dataset of explicitly 3D annotations warm-starts the training process; then, the supervision is continued by simpler-to-annotate accumulated LiDAR sweeps and images – semantically labelled through vision foundational models. MinkOcc effectively utilizes these sensor-rich supervisory cues and reduces reliance on manual labeling by 90% while maintaining competitive accuracy. In addition, the proposed model incorporates information from LiDAR and camera data through early fusion and leverages sparse convolution networks for real-time prediction. With its efficiency in both supervision and computation, we aim to extend MinkOcc beyond curated datasets, enabling broader real-world deployment of 3D semantic occupancy prediction in autonomous driving.
- Publication status:
- Accepted
- Peer review status:
- Peer reviewed
Actions
Authors
+ Engineering and Physical Sciences Research Council
More from this funder
- Funder identifier:
- https://ror.org/0439y7842
- Grant:
- EP/W011344/1
- Publisher:
- IEEE
- Acceptance date:
- 2025-06-30
- Event title:
- IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2025)
- Event location:
- Hangzhao, China
- Event website:
- https://iros25.org/
- Event start date:
- 2025-10-19
- Event end date:
- 2025-10-25
- Language:
-
English
- Pubs id:
-
2267966
- Local pid:
-
pubs:2267966
- Deposit date:
-
2025-08-04
Terms of use
- Notes:
- This paper was presented at the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2025), 19th-25th October 2025, Hangzhao, China. The author accepted manuscript (AAM) of this paper has been made available under the University of Oxford's Open Access Publications Policy, and a CC BY public copyright licence has been applied.
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record