Conference item
Communicating via Markov decision processes
- Abstract:
- We consider the problem of communicating exogenous information by means of Markov decision process trajectories. This setting, which we call a Markov coding game (MCG), generalizes both source coding and a large class of referential games. MCGs also isolate a problem that is important in decentralized control settings in which cheap-talk is not available-namely, they require balancing communication with the associated cost of communicating. We contribute a theoretically grounded approach to MCGs based on maximum entropy reinforcement learning and minimum entropy coupling that we call MEME. Due to recent breakthroughs in approximation algorithms for minimum entropy coupling, MEME is not merely a theoretical algorithm, but can be applied to practical settings. Empirically, we show both that MEME is able to outperform a strong baseline on small MCGs and that MEME is able to achieve strong performance on extremely large MCGs. To the latter point, we demonstrate that MEME is able to losslessly communicate binary images via trajectories of Cartpole and Pong, while simultaneously achieving the maximal or near maximal expected returns, and that it is even capable of performing well in the presence of actuator noise.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 1.6MB, Terms of use)
-
Authors
- Publisher:
- Journal of Machine Learning Research
- Host title:
- Proceedings of the 39th International Conference on Machine Learning (ICML 2022)
- Volume:
- 162
- Pages:
- 20314-20328
- Series:
- Proceedings of Machine Learning Research
- Publication date:
- 2022-01-01
- Event title:
- 39th International Conference on Machine Learning (ICML 2022)
- Event location:
- Baltimore, MD, USA
- Event website:
- https://icml.cc/Conferences/2022
- Event start date:
- 2022-07-17
- Event end date:
- 2022-07-23
- EISSN:
-
2640-3498
- ISSN:
-
2640-3498
- Language:
-
English
- Keywords:
- Pubs id:
-
1336644
- Local pid:
-
pubs:1336644
- Deposit date:
-
2023-07-24
- ARK identifier:
Terms of use
- Copyright holder:
- Sokota et al
- Copyright date:
- 2022
- Rights statement:
- © 2022 by the author(s).
- Notes:
- This paper was presented at the 39th International Conference on Machine Learning (ICML 2022), 17th-23rd July 2022, Baltimore, MD, USA.
If you are the owner of this record, you can report an update to it here: Report update to this record