Conference item icon

Conference item

Value propagation networks

Abstract:
We present Value Propagation (VProp), a parameter-efficient differentiable planning module built on Value Iteration which can successfully be trained in a reinforcement learning fashion to solve unseen tasks, has the capability to generalize to larger map sizes, and can learn to navigate in dynamic environments. We evaluate on configurations of MazeBase grid-worlds, with randomly generated environments of several different sizes. Furthermore, we show that the module enables to learn to plan when the environment also includes stochastic elements, providing a cost-efficient learning system to build low-level size-invariant planners for a variety of interactive navigation problems.
Publication status:
Published
Peer review status:
Reviewed (other)

Actions

Access Document

Files:
Publication website:
https://openreview.net/forum?id=SJG6G2RqtX

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
ORCID:
0000-0001-8491-8166
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author


More from this funder
Funder identifier:
https://ror.org/012mzw131
Grant:
RPG-2012-544
More from this funder
Funder identifier:
https://ror.org/00k4n6c32
Grant:
321162
More from this funder
Funder identifier:
https://ror.org/0439y7842
Grant:
EP/N019474/1


Publisher:
OpenReview
Host title:
Proceedings of the International Conference on Learning Representations (ICLR 2019)
Article number:
1308
Publication date:
2019-03-29
Acceptance date:
2019-01-01
Event title:
7th International Conference on Learning Representations (ICLR 2019)
Event location:
New Orleans, Louisiana, USA
Event website:
https://iclr.cc/Conferences/2019
Event start date:
2019-05-06
Event end date:
2019-05-09


Language:
English
Keywords:
Pubs id:
pubs:995206
UUID:
uuid:a757857e-be6f-45b0-b37f-80343f5be611
Local pid:
pubs:995206
Source identifiers:
995206
Deposit date:
2019-05-01
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP