A neuro-observer-based optimal control for nonaffine nonlinear systems with control input saturations

Farzanegan, Behzad; Zamani, Mohsen; Suratgar, Amir Abolfazl; Menhaj, Mohammad Bagher

doi:10.1007/s11768-021-00045-z

A neuro-observer-based optimal control for nonaffine nonlinear systems with control input saturations

Research Article
Published: 20 April 2021

Volume 19, pages 283–294, (2021)
Cite this article

Control Theory and Technology Aims and scope Submit manuscript

Behzad Farzanegan²,
Mohsen Zamani^3,4,
Amir Abolfazl Suratgar¹ &
…
Mohammad Bagher Menhaj²

435 Accesses
3 Citations
Explore all metrics

Abstract

In this study, an adaptive neuro-observer-based optimal control (ANOPC) policy is introduced for unknown nonaffine nonlinear systems with control input constraints. Hamilton–Jacobi–Bellman (HJB) framework is employed to minimize a non-quadratic cost function corresponding to the constrained control input. ANOPC consists of both analytical and algebraic parts. In the analytical part, first, an observer-based neural network (NN) approximates uncertain system dynamics, and then another NN structure solves the HJB equation. In the algebraic part, the optimal control input that does not exceed the saturation bounds is generated. The weights of two NNs associated with observer and controller are simultaneously updated in an online manner. The ultimately uniformly boundedness (UUB) of all signals of the whole closed-loop system is ensured through Lyapunov’s direct method. Finally, two numerical examples are provided to confirm the effectiveness of the proposed control strategy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Predictive active control of building structures using LQR and artificial intelligence

Article 19 April 2024

Robust control scheme for electrohydraulic servo systems using extended sliding mode observer

Article 12 April 2024

Design of a resilient finite-time based unknown input observer for Lipschitz switched systems

Article 15 April 2024

References

Abu-Khalaf, M., & Lewis, F. L. (2005). Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica, 41(5), 779–791.
Article MathSciNet Google Scholar
Modares, H., Lewis, F. L., & Naghibi-Sistani, M.-B. (2013). Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks. IEEE Transactions on Neural Networks and Learning Systems, 24(10), 1513–1525.
Article Google Scholar
Huang, Y., & Jiang, H. (2015). Neural network observer-based optimal control for unknown nonlinear systems with control constraints. In International Joint Conference on Neural Networks (IJCNN). Killarney. https://doi.org/10.1109/IJCNN.2015.7280596.
Esfandiari, K., Abdollahi, F., & Talebi, H. A. (2017). Adaptive near-optimal neuro controller for continuous-time nonaffine nonlinear systems with constrained input. Neural Networks, 93(195–204), 2017.
MATH Google Scholar
Esmailian, E., Farzanegan, B., Bagher Menhaj, M., & Ghassemi, H. (2018). A robust neuro-based adaptive control system design for a surface effect ship with uncertain dynamics and input saturation to cargo transfer at sea. Applied Ocean Research, 74, 59–68.
Article Google Scholar
Liu, D., Huang, Y., Wang, D., & Wei, Q. (2013). Neural-network-observer-based optimal control for unknown nonlinear systems using adaptive dynamic programming. International Journal of Control, 86(9), 1554–1566.
Article MathSciNet Google Scholar
Bian, T., Jiang, Yu., & Jiang, Z.-P. (2014). Adaptive dynamic programming and optimal control of nonlinear nonaffine systems. Automatica, 50(10), 2624–2632.
Article MathSciNet Google Scholar
Vrabie, D., & Lewis, F. L. (2009). Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems. Neural Networks, 22(3), 237–246.
Article Google Scholar
Werbos, P.J. (1992). Approximate dynamic programming for real-time control and neural modeling. In Handbook of Intelligent Control (pp. 493–526). New York: Van Nostrand Reinhold.
Vamvoudakis, K. G., & Lewis, F. L. (2010). Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica, 46(5), 878–888.
Article MathSciNet Google Scholar
Dierks, T., & Jagannathan, S. (2012). Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update. IEEE Transactions on Neural Networks and Learning Systems, 23(7), 1118–1129.
Article Google Scholar
Vrabie, D., Pastravanu, O., Abu-Khalaf, M., & Lewis, F. L. (2009). Adaptive optimal control for continuous-time linear systems based on policy iteration. Automatica, 45(2), 477–484.
Article MathSciNet Google Scholar
Dierks, T., & Jagannathan, S. (2011). Online optimal control of nonlinear discrete-time systems using approximate dynamic programming. Journal of Control Theory and Applications, 9(3), 361–369.
Article MathSciNet Google Scholar
Liu, D., Yang, X., & Li, H. (2013). Adaptive optimal control for a class of continuous-time affine nonlinear systems with unknown internal dynamics. Neural Computing and Applications, 23(7–8), 1843–1850.
Article Google Scholar
Yang, X., Liu, D., & Wei, Q. (2014). Online approximate optimal control for affine non-linear systems with unknown internal dynamics using adaptive dynamic programming. IET Control Theory & Applications, 8(16), 1676–1688.
Article MathSciNet Google Scholar
Yang, X., & He, H. (2018). Self-learning robust optimal control for continuous-time nonlinear systems with mismatched disturbances. Neural Networks, 99, 19–30.
Article Google Scholar
Bhasin, S., Kamalapurkar, R., Johnson, M., Vamvoudakis, K. G., Lewis, F. L., & Dixon, W. E. (2013). A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems. Automatica, 49(1), 82–92.
Article MathSciNet Google Scholar
Na, J., & Herrmann, G. (2014). Online adaptive approximate optimal tracking control with simplified dual approximation structure for continuous-time unknown nonlinear systems. IEEE/CAA Journal of Automatica Sinica, 1(4), 412–422.
Article Google Scholar
Liu, D., Wang, D., Wang, F.-Y., Li, H., & Yang, X. (2014). Neural-network-based online HJB solution for optimal robust guaranteed cost control of continuous-time uncertain nonlinear systems. IEEE Transactions on Cybernetics, 44(12), 2834–2847.
Article Google Scholar
Liu, D., Wei, Q., Wang, Di., Yang, X., & Li, H. (2017). Adaptive Dynamic Programming with Applications in Optimal Control. Springer.
Modares, H., Lewis, F. L., & Sistani, M.-B. N. (2014). Online solution of nonquadratic two-player zero-sum games arising in the \({\rm H}_\infty\) control of constrained input systems. International Journal of Adaptive Control and Signal Processing, 28(3–5), 232–254.
Article MathSciNet Google Scholar
Yang, X., Liu, D., & Huang, Y. (2013). Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints. IET Control Theory & Applications, 7(17), 2037–2047.
Article MathSciNet Google Scholar
Yang, X., Liu, D., Wang, D., & Wei, Q. (2014). Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning. Neural Networks, 55, 30–41.
Article Google Scholar
Cybenko, G. (1989). Approximation by superpositions of a sigmoidal function. Mathematics of Control, Signals and Systems, 2(4), 303–314.
Article MathSciNet Google Scholar
Hecht-Nielsen, R. (1992). Theory of the backpropagation neural network. In Neural Networks for Perception (pp. 65–93). Academic Press, Inc.
Khalil, H.K. (2002). Nonlinear Systems (pp. 111–174). 3rd edn. Prentice Hall.
Abdollahi, F., Talebi, H. A., & Patel, R. V. (2006). A stable neural network-based observer with application to flexible-joint manipulators. IEEE Transactions on Neural Networks, 17(1), 118–129.
Article Google Scholar
Tamura, S., & Tateishi, M. (1997). Capabilities of a four-layered feedforward neural network: four layers versus three. IEEE Transactions on Neural Networks, 8(2), 251–255.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Distributed Intelligent Optimization Research Lab, Department of Electrical Engineering, Amirkabir University of Technology, Tehran, Iran
Amir Abolfazl Suratgar
Computational Intelligence Lab, Department of Electrical Engineering, Amirkabir University of Technology, Tehran, Iran
Behzad Farzanegan & Mohammad Bagher Menhaj
The School of Electrical Engineering and Computer Science, The University of Newcastle, Newcastle, Australia
Mohsen Zamani
Department of Medical Physics and Engineering, Shiraz University of Medical Sciences, Shiraz, Iran
Mohsen Zamani

Authors

Behzad Farzanegan
View author publications
You can also search for this author in PubMed Google Scholar
Mohsen Zamani
View author publications
You can also search for this author in PubMed Google Scholar
Amir Abolfazl Suratgar
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Bagher Menhaj
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Amir Abolfazl Suratgar.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Farzanegan, B., Zamani, M., Suratgar, A.A. et al. A neuro-observer-based optimal control for nonaffine nonlinear systems with control input saturations. Control Theory Technol. 19, 283–294 (2021). https://doi.org/10.1007/s11768-021-00045-z

Download citation

Received: 29 February 2020
Revised: 07 December 2020
Accepted: 09 December 2020
Published: 20 April 2021
Issue Date: May 2021
DOI: https://doi.org/10.1007/s11768-021-00045-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A neuro-observer-based optimal control for nonaffine nonlinear systems with control input saturations

Abstract

Access this article

Similar content being viewed by others

Predictive active control of building structures using LQR and artificial intelligence

Robust control scheme for electrohydraulic servo systems using extended sliding mode observer

Design of a resilient finite-time based unknown input observer for Lipschitz switched systems

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A neuro-observer-based optimal control for nonaffine nonlinear systems with control input saturations

Abstract

Access this article

Similar content being viewed by others

Predictive active control of building structures using LQR and artificial intelligence

Robust control scheme for electrohydraulic servo systems using extended sliding mode observer

Design of a resilient finite-time based unknown input observer for Lipschitz switched systems

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation