Conference item
Towards verifiable and safe model-free reinforcement learning
- Abstract:
- Reinforcement Learning (RL) is a widely employed machine learning architecture that has been applied to a variety of decision-making problems, from resource management to robot locomotion, from recommendation systems to systems biology, and from traffic control to superhuman-level gaming. However, RL has experienced limited success beyond rigidly controlled or constrained applications, and successful employment of RL in safety-critical scenarios is yet to be achieved. A principal reason for this limitation is the lack of formal approaches to specify requirements as tasks and learning constraints, and to provide guarantees with respect to these requirements and constraints, during and after learning. This line of work addresses these issues by proposing a general framework that leverages the success of RL in learning high-performance controllers, while guaranteeing the satisfaction of given requirements and guiding the learning process within safe configurations.
- Publication status:
- Published
- Peer review status:
- Reviewed (other)
Actions
Authors
- Publisher:
- CEUR Workshop Proceedings
- Host title:
- Proceedings of the 1st Workshop on Artificial Intelligence and Formal Verification, Logic, Automata, and Synthesis (OVERLAY 2019)
- Volume:
- 2509
- Publication date:
- 2020-03-03
- Acceptance date:
- 2019-11-18
- Event title:
- 1st Workshop on Artificial Intelligence and Formal Verification, Logic, Automata, and Synthesis
- Event location:
- Rende, Italy, November 19–20, 2019
- Event website:
- http://ceur-ws.org/Vol-2509/
- Event start date:
- 2019-11-19
- Event end date:
- 2019-11-20
- Language:
-
English
- Keywords:
- Pubs id:
-
1090824
- Local pid:
-
pubs:1090824
- Deposit date:
-
2020-03-03
Terms of use
- Copyright holder:
- Hasanbeig, M et al.
- Copyright date:
- 2020
- Rights statement:
- © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
- Notes:
- This conference paper was presented at the 1st Workshop on Artificial Intelligence and Formal Verification, Logic, Automata, and Synthesis (OVERLAY 2019), Rende, Italy, November 19‐20, 2019. This is the publisher's version of the article. The final version is available online from CEUR Workshop Proceedings at: http://ceur-ws.org/Vol-2509/
- Licence:
- CC Attribution (CC BY)
If you are the owner of this record, you can report an update to it here: Report update to this record