Journal article
A Bayesian exploration−exploitation approach for optimal online sensing and planning with a visually guided mobile robot
- Abstract:
- We address the problem of online path planning for optimal sensing with a mobile robot. The objective of the robot is to learn the most about its pose and the environment given time constraints. We use a POMDP with a utility function that depends on the belief state to model the finite horizon planning problem. We replan as the robot progresses throughout the environment. The POMDP is high-dimensional, continuous, non-differentiable, nonlinear, non-Gaussian and must be solved in real-time. Most existing techniques for stochastic planning and reinforcement learning are therefore inapplicable. To solve this extremely complex problem, we propose a Bayesian optimization method that dynamically trades off exploration (minimizing uncertainty in unknown parts of the policy space) and exploitation (capitalizing on the current best solution). We demonstrate our approach with a visually-guide mobile robot. The solution proposed here is also applicable to other closely-related domains, including active vision, sequential experimental design, dynamic sensing and calibration with mobile sensors.
Actions
Access Document
- Publisher copy:
- 10.1007/s10514-009-9130-2
Authors
- Journal:
- Autonomous Robots More from this journal
- Volume:
- 27
- Issue:
- 2
- Pages:
- 93-103
- Publication date:
- 2009-01-01
- DOI:
- ISSN:
-
0929-5593
- UUID:
-
uuid:e861c1ae-2830-4813-820b-77620ad06e0d
- Local pid:
-
cs:7473
- Deposit date:
-
2015-03-31
- ARK identifier:
Terms of use
- Copyright date:
- 2009
If you are the owner of this record, you can report an update to it here: Report update to this record