AI Collection

Journal article

NLBAC: a neural ODE-based algorithm for state-wise stable and safe reinforcement learning

Abstract:: Ensuring safety and stability is critical when using reinforcement learning (RL) to control safety-critical systems. However, model-free RL algorithms usually suffer from low sample efficiency, and employing widely-used methods like dual ascent to solve constrained RL problems may be challenging due to their sensitivity to hyperparameters. To address these difficulties, in this work, we first propose an augmented Lagrangian-based method to maintain safety and stability through state-wise control Lyapunov function (CLF) and pre-defined control barrier function (CBFs) constraints in non-constrained Markov decision process (non-CMDP) settings. To handle tasks without pre-defined CBFs, we extend this method by training a barrier certificate jointly with the control policy, supported by theoretical guarantees to ensure monotonically improved control performance. Moreover, we investigate the issue of infeasibility arising from the presence of multiple state-wise constraints. A practical algorithm, Neural ordinary differential equations-based Lyapunov-Barrier Actor-Critic (NLBAC), is further designed by integrating the proposed method with the Soft Actor-Critic (SAC) and leveraging neural ordinary differential equations (NODEs) for system modeling. Comparisons with baselines and ablation experiments demonstrate that our algorithm achieves superior performance in terms of safety and driving the system towards the desired state with higher sample efficiency.

Publication status:: Published

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Share
Cite

Cite this record

APA Style

Zhao, L., Miao, K., Cao, H., Gatsis, K., & Papachristodoulou, A. (2025). NLBAC: a neural ODE-based algorithm for state-wise stable and safe reinforcement learning. Neurocomputing, 638.

MLA Style

Zhao, L, et al. “NLBAC: a Neural ODE-Based Algorithm for State-Wise Stable and Safe Reinforcement Learning.” Neurocomputing, vol. 638, 2025.

Chicago Style

Zhao, L, K Miao, H Cao, K Gatsis, and A Papachristodoulou. 2025. “NLBAC: a Neural ODE-Based Algorithm for State-Wise Stable and Safe Reinforcement Learning.” Neurocomputing 638.
Print

Access Document

Files:: Zhao_et_al_2025_NLBAC_A_neural.pdf

(Preview, Version of record, pdf, 5.3MB, Terms of use)

Publisher copy:: 10.1016/j.neucom.2025.130041

Authors

+ Zhao, L More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author

+ Miao, K More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Role:: Author

+ Cao, H More by this author

Role:: Author

+ Gatsis, K More by this author

Role:: Author

+ Papachristodoulou, A More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Oxford college:: Kellogg College
Role:: Author
ORCID:: 0000-0002-3565-8967

+ Engineering and Physical Sciences Research Council More from this funder

Funder identifier:: https://ror.org/0439y7842

Publisher:: Elsevier
Journal:: Neurocomputing More from this journal
Volume:: 638
Article number:: 130041
Publication date:: 2025-03-26
Acceptance date:: 2025-03-15
DOI:: 10.1016/j.neucom.2025.130041
EISSN:: 1872-8286
ISSN:: 0925-2312

Language:: English
Keywords:: augmented lagrangian method

reinforcement learning

state constraints

control barrier function

neural certificates

policy iteration

neural ordinary differential equations
Pubs id:: 2101117
Local pid:: pubs:2101117
Deposit date:: 2025-05-27
ARK identifier:: ark:/29072/ora_d60d4885e8b44f40900572e2fcee1e7f

Terms of use

Copyright holder:: Zhao et al
Copyright date:: 2025
Rights statement:: © 2025 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

Licence:: CC Attribution (CC BY)

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP