Conference item
APEX: Unsupervised, object-centric scene segmentation and tracking for robot manipulation
- Abstract:
- Recent advances in unsupervised learning for object detection, segmentation, and tracking hold significant promise for applications in robotics. A common approach is to frame these tasks as inference in probabilistic latent-variable models. In this paper, however, we show that the current state-of-the-art struggles with visually complex scenes such as typically encountered in robot manipulation tasks. We propose APEX, a new latent-variable model which is able to segment and track objects in more realistic scenes featuring objects that vary widely in size and texture, including the robot arm itself. This is achieved by a principled mask normalisation algorithm and a high-resolution scene encoder. To evaluate our approach, we present results on the real-world Sketchy dataset. This dataset, however, does not contain ground truth masks and object IDs for a quantitative evaluation. We thus introduce the Panda Pushing Dataset (P2D) which shows a Panda arm interacting with objects on a table in simulation and which includes ground-truth segmentation masks and object IDs for tracking. In both cases, APEX comprehensively outperforms the current state-of-the-art in unsupervised object segmentation and tracking. We demonstrate the efficacy of our segmentations for robot skill execution on an object arrangement task, where we also achieve the best or comparable performance among all the baselines.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 1.2MB, Terms of use)
-
- Publisher copy:
- 10.1109/IROS51168.2021.9636711
Authors
- Publisher:
- Institute of Electrical and Electronics Engineers
- Host title:
- Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
- Pages:
- 3375-3382
- Publication date:
- 2021-12-16
- Event title:
- IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
- Event location:
- Prague, Czech Republic
- Event start date:
- 2021-09-27
- Event end date:
- 2021-10-01
- DOI:
- EISSN:
-
2153-0866
- ISSN:
-
2153-0858
- ISBN:
- 9781665417143
- Language:
-
English
- Keywords:
- Pubs id:
-
1240804
- Local pid:
-
pubs:1240804
- Deposit date:
-
2023-03-10
Terms of use
- Copyright holder:
- IEEE
- Copyright date:
- 2021
- Rights statement:
- Copyright 2021 IEEE.
- Notes:
-
This is the accepted manuscript version of the article. The final version is available from IEEE at https://doi.org/10.1109/IROS51168.2021.9636711
If you are the owner of this record, you can report an update to it here: Report update to this record