Conference item
Infinite-horizon differentiable Model Predictive Control
- Abstract:
- This paper proposes a differentiable linear quadratic Model Predictive Control (MPC) framework for safe imitation learning. The infinite-horizon cost is enforced using a terminal cost function obtained from the discrete-time algebraic Riccati equation (DARE), so that the learned controller can be proven to be stabilizing in closed-loop. A central contribution is the derivation of the analytical derivative of the solution of the DARE, thereby allowing the use of differentiation-based learning methods. A further contribution is the structure of the MPC optimization problem: an augmented Lagrangian method ensures that the MPC optimization is feasible throughout training whilst enforcing hard constraints on state and input, and a pre-stabilizing controller ensures that the MPC solution and derivatives are accurate at each iteration. The learning capabilities of the framework are demonstrated in a set of numerical studies.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, 563.9KB, Terms of use)
-
- Publication website:
- https://openreview.net/forum?id=ryxC6kSYPr
Authors
- Publisher:
- ICLR
- Journal:
- Proceedings of ICLR 2020 More from this journal
- Publication date:
- 2020-04-06
- Acceptance date:
- 2020-04-06
- Event title:
- International Conference on Learning Representations (ICLR 2020)
- Event location:
- Millennium Hall, Addis Ababa ETHIOPIA
- Event website:
- https://iclr.cc/
- Event start date:
- 2020-04-26
- Event end date:
- 2020-05-01
- Language:
-
English
- Keywords:
- Pubs id:
-
1098738
- Local pid:
-
pubs:1098738
- Deposit date:
-
2020-04-06
Terms of use
- Copyright holder:
- East, S et al.
- Copyright date:
- 2020
- Rights statement:
- © 2020 East, S et al.
- Notes:
- This conference paper was presented at the ICLR 2020 Eighth International Conference on Learning Representations, 26 April - 1 May 2020, Addis Ababa, Ethiopia. This is the final version of the paper. The final version is available online from openreview.net at: https://openreview.net/forum?id=ryxC6kSYPr
If you are the owner of this record, you can report an update to it here: Report update to this record