Conference item
Planning with hidden parameter polynomial MDPs
- Abstract:
- For many applications of Markov Decision Processes (MDPs), the transition function cannot be specified exactly. Bayes-Adaptive MDPs (BAMDPs) extend MDPs to consider transition probabilities governed by latent parameters. To act optimally in BAMDPs, one must maintain a belief distribution over the latent parameters. Typically, this distribution is described by a set of sample (particle) MDPs, and associated weights which represent the likelihood of a sample MDP being the true underlying MDP. However, as the number of dimensions of the latent parameter space increases, the number of sample MDPs required to sufficiently represent the belief distribution grows exponentially. Thus, maintaining an accurate belief in the form of a set of sample MDPs over complex latent spaces is computationally intensive, which in turn affects the performance of planning for these models. In this paper, we propose an alternative approach for maintaining the belief over the latent parameters. We consider a class of BAMDPs where the transition probabilities can be expressed in closed form as a polynomial of the latent parameters, and outline a method to maintain a closed-form belief distribution for the latent parameters which results in an accurate belief representation. Furthermore, the closed-form representation does away with the need to tune the number of sample MDPs required to represent the belief. We evaluate two domains and empirically show that the polynomial, closed-form, belief representation results in better plans than a sampling-based belief representation.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 434.3KB, Terms of use)
-
- Publisher copy:
- 10.1609/aaai.v37i10.26411
Authors
- Publisher:
- Association for the Advancement of Artificial Intelligence
- Journal:
- Proceedings of the AAAI Conference on Artificial Intelligence More from this journal
- Volume:
- 37
- Issue:
- 10
- Pages:
- 11963-11971
- Publication date:
- 2023-06-26
- Acceptance date:
- 2022-12-06
- Event title:
- 37th AAAI Conference on Artificial Intelligence (AAAI-23)
- Event location:
- Washington, DC, USA
- Event website:
- https://aaai.org/Conferences/AAAI-23/
- Event start date:
- 2023-02-07
- Event end date:
- 2023-02-14
- DOI:
- EISSN:
-
2374-3468
- ISSN:
-
2159-5399
- Language:
-
English
- Keywords:
- Pubs id:
-
1317622
- Local pid:
-
pubs:1317622
- Deposit date:
-
2023-01-03
Terms of use
- Copyright holder:
- Association for the Advancement of Artificial Intelligence
- Copyright date:
- 2023
- Rights statement:
- © 2023, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
- Notes:
- This is the accepted manuscript version of the paper. The final version is available online from the Association for the Advancement of Artificial Intelligence at: https://doi.org/10.1609/aaai.v37i10.26411
If you are the owner of this record, you can report an update to it here: Report update to this record