Conference item
Semantically grounded object matching for robust robotic scene rearrangement
- Abstract:
- Object rearrangement has recently emerged as a key competency in robot manipulation, with practical solutions generally involving object detection, recognition, grasping and high-level planning. Goal-images describing a desired scene configuration are a promising and increasingly used mode of instruction. A key outstanding challenge is the accurate inference of matches between objects in front of a robot, and those seen in a provided goal image, where recent works have struggled in the absence of object-specific training data. In this work, we explore the deterioration of existing methods' ability to infer matches between objects as the visual shift between observed and goal scenes increases. We find that a fundamental limitation of the current setting is that source and target images must contain the same instance of every object, which restricts practical deployment. We present a novel approach to object matching that uses a large pre-trained vision-language model to match objects in a cross-instance setting by leveraging semantics together with visual features as a more robust, and much more general, measure of similarity. We demonstrate that this provides considerably improved matching performance in cross-instance settings, and can be used to guide multi-object rearrangement with a robot manipulator from an image that shares no object instances with the robot's scene. Our code is available at https://github.com/applied-ai-lab/object_matching.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 5.4MB, Terms of use)
-
- Publisher copy:
- 10.1109/ICRA46639.2022.9811817
Authors
- Publisher:
- IEEE
- Host title:
- Proceedings of the International Conference on Robotics and Automation (ICRA 2022)
- Pages:
- 11138-11144
- Publication date:
- 2022-07-12
- Acceptance date:
- 2021-02-28
- Event title:
- International Conference on Robotics and Automation (ICRA 2022)
- Event location:
- Philadelphia, PA, USA
- Event website:
- https://www.icra2022.org/
- Event start date:
- 2022-05-23
- Event end date:
- 2022-05-27
- DOI:
- EISBN:
- 978-1-7281-9681-7
- ISBN:
- 978-1-7281-9682-4
- Language:
-
English
- Keywords:
- Pubs id:
-
1242865
- Local pid:
-
pubs:1242865
- Deposit date:
-
2022-03-09
Terms of use
- Copyright holder:
- IEEE
- Copyright date:
- 2022
- Rights statement:
- © IEEE 2022
- Notes:
- This paper will be presented at the International Conference on Robotics and Automation (ICRA 2022), 23rd-27th May 2022, Philadelphia, PA, USA. This is the accepted manuscript version of the article. The final version is available online from IEEE at: https://doi.org/10.1109/ICRA46639.2022.9811817
If you are the owner of this record, you can report an update to it here: Report update to this record