Conference item
Value propagation networks
- Abstract:
- We present Value Propagation (VProp), a parameter-efficient differentiable planning module built on Value Iteration which can successfully be trained in a reinforcement learning fashion to solve unseen tasks, has the capability to generalize to larger map sizes, and can learn to navigate in dynamic environments. We evaluate on configurations of MazeBase grid-worlds, with randomly generated environments of several different sizes. Furthermore, we show that the module enables to learn to plan when the environment also includes stochastic elements, providing a cost-efficient learning system to build low-level size-invariant planners for a variety of interactive navigation problems.
- Publication status:
- Published
- Peer review status:
- Reviewed (other)
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 791.9KB, Terms of use)
-
- Publication website:
- https://openreview.net/forum?id=SJG6G2RqtX
Authors
+ Leverhulme Trust
More from this funder
- Funder identifier:
- https://ror.org/012mzw131
- Grant:
- RPG-2012-544
+ European Commission
More from this funder
- Funder identifier:
- https://ror.org/00k4n6c32
- Grant:
- 321162
+ Engineering & Physical Sciences Research Council
More from this funder
- Funder identifier:
- https://ror.org/0439y7842
- Grant:
- EP/N019474/1
- Publisher:
- OpenReview
- Host title:
- Proceedings of the International Conference on Learning Representations (ICLR 2019)
- Article number:
- 1308
- Publication date:
- 2019-03-29
- Acceptance date:
- 2019-01-01
- Event title:
- 7th International Conference on Learning Representations (ICLR 2019)
- Event location:
- New Orleans, Louisiana, USA
- Event website:
- https://iclr.cc/Conferences/2019
- Event start date:
- 2019-05-06
- Event end date:
- 2019-05-09
- Language:
-
English
- Keywords:
- Pubs id:
-
pubs:995206
- UUID:
-
uuid:a757857e-be6f-45b0-b37f-80343f5be611
- Local pid:
-
pubs:995206
- Source identifiers:
-
995206
- Deposit date:
-
2019-05-01
- ARK identifier:
Terms of use
- Copyright holder:
- Nardelli et al.
- Copyright date:
- 2019
- Rights statement:
- © The Authors 2019.
- Notes:
- This paper was presented at the 7th International Conference on Learning Representations (ICLR 2019), 6-9 May 2019, Virtual event. The final version is available online from OpenReview at: https://openreview.net/forum?id=SJG6G2RqtX
If you are the owner of this record, you can report an update to it here: Report update to this record