Conference item
DeepSynth: Automata synthesis for automatic task segmentation in deep reinforcement learning
- Abstract:
- This paper proposes DeepSynth, a method for effective training of deep Reinforcement Learning (RL) agents when the reward is sparse and non-Markovian, but at the same time progress towards the reward requires achieving an unknown sequence of high-level objectives. Our method employs a novel algorithm for synthesis of compact automata to uncover this sequential structure automatically. We synthesise a humaninterpretable automaton from trace data collected by exploring the environment. The state space of the environment is then enriched with the synthesised automaton so that the generation of a control policy by deep RL is guided by the discovered structure encoded in the automaton. The proposed approach is able to cope with both high-dimensional, low-level features and unknown sparse non-Markovian rewards. We have evaluated DeepSynth’s performance in a set of experiments that includes the Atari game Montezuma’s Revenge. Compared to existing approaches, we obtain a reduction of two orders of magnitude in the number of iterations required for policy synthesis, and also a significant improvement in scalability.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, 1.0MB, Terms of use)
-
- Publication website:
- https://ojs.aaai.org/index.php/AAAI/article/view/16935
Authors
- Publisher:
- Association for the Advancement of Artificial Intelligence
- Journal:
- Proceedings of the AAAI Conference on Artificial Intelligence More from this journal
- Volume:
- 35
- Issue:
- 9
- Pages:
- 7647-7656
- Publication date:
- 2021-05-18
- Acceptance date:
- 2020-12-02
- Event title:
- 35th AAAI Conference on Artificial Intelligence: AAAI 2021
- Event location:
- Virtual event
- Event website:
- https://aaai.org/Conferences/AAAI-21/
- Event start date:
- 2021-02-02
- Event end date:
- 2021-02-09
- Language:
-
English
- Keywords:
- Pubs id:
-
1166170
- Local pid:
-
pubs:1166170
- Deposit date:
-
2021-03-06
Terms of use
- Copyright holder:
- Association for the Advancement of Artificial Intelligence
- Copyright date:
- 2021
- Rights statement:
- © 2021, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
If you are the owner of this record, you can report an update to it here: Report update to this record