Data-driven initialization of deep learning solvers for Hamilton-Jacobi-Bellman PDEs
File(s)1-s2.0-S2405896322026805-main.pdf (959.9 KB)
Published version
Author(s)
Borovykh, A
Kalise, D
Laignelet, A
Parpas, P
Type
Journal Article
Abstract
A deep learning approach for the approximation of the Hamilton-Jacobi-Bellman partial differential equation (HJB PDE) associated to the Nonlinear Quadratic Regulator (NLQR) problem. A state-dependent Riccati equation control law is first used to generate a gradient-augmented synthetic dataset for supervised learning. The resulting model becomes a warm start for the minimization of a loss function based on the residual of the HJB PDE. The combination of supervised learning and residual minimization avoids spurious solutions and mitigate the data inefficiency of a supervised learning-only approach. Numerical tests validate the different advantages of the proposed methodology.
Date Issued
2022-12-01
Date Acceptance
2022-06-06
Citation
IFAC-PapersOnLine, 2022, 55 (30), pp.168-173
ISSN
2405-8963
Publisher
Elsevier
Start Page
168
End Page
173
Journal / Book Title
IFAC-PapersOnLine
Volume
55
Issue
30
Copyright Statement
Copyright © 2022 The Authors. This is an open access article under the CC BY-NC-ND license
(https://creativecommons.org/licenses/by-nc-nd/4.0/)
(https://creativecommons.org/licenses/by-nc-nd/4.0/)
Identifier
https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000889050900029&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=a2bf6146997ec60c407a63945d4e92bb
Subjects
Science & Technology
Technology
Automation & Control Systems
Hamilton-Jacobi-Bellman PDE
NLQR
supervised learning
residual minimization
ALGORITHM
Publication Status
Published
Coverage Spatial
Bayreuth, GERMANY
Date Publish Online
2022-11-23