Abstract
A distributed optimal control algorithm based on adaptive neural network is proposed for the synchronized control problem of a class of second-order nonlinear coupled harmonic oscillators. Firstly, the graph theory is used to establish the coupling relationship between the harmonic oscillator models; secondly, the neural network is used to fit the unknown nonlinearity in the harmonic oscillator model, and the virtual controller and the actual controller are designed based on the backstepping method; then, according to the state error and the controller, the cost function and the HJB function are designed. Since the HJB function cannot be solved directly, the critic neural network approximates its solution. The above two neural networks constitute a simplified reinforcement learning to achieve optimal consistent control of nonlinear coupled harmonic oscillators. Finally, the stability and effectiveness of the scheme are verified by the Lyapunov stability theorem and numerical simulation, respectively.
Similar content being viewed by others
Data availability
The data that support the findings of this study are available from the corresponding author upon reasonable request.
References
Yu, D., Yang, M., Liu, Y.-J., Wang, Z., Chen, C.L.P.: Adaptive fuzzy tracking control for uncertain nonlinear systems with multiple actuators and sensors faults. IEEE Trans. Fuzzy Syst. 31, 104–116 (2022)
Yu, D., Long, J., Chen, C.P., Wang, Z.: Bionic tracking-containment control based on smooth transition in communication. Inf. Sci. 587, 393–407 (2022)
Feng, J., Yu, F., Zhao, Y.: Exponential synchronization of nonlinearly coupled complex networks with hybrid time-varying delays via impulsive control. Nonlinear Dyn. 85(1), 621–632 (2016)
Song, Q., Wen, G., Meng, D., Liu, Z.-W., Liu, F.: Distributed impulsive control for signed networks of coupled harmonic oscillators with sampled positions. IEEE Trans. Control Netw. Syst. 8(1), 111–122 (2021)
Yang, Z., Li, S., Yu, D., Chen, C.L.P.: BLS-based formation control for nonlinear multi-agent systems with actuator fault and input saturation. Nonlinear Dyn. 109(4), 2657–2673 (2022)
Jin, X., Wang, Z., Zhao, J., Yu, D.: Swarm control for large-scale omnidirectional mobile robots within incremental behavior. Inf. Sci. 614, 35–50 (2022)
Tuna, S.E.: Synchronization of harmonic oscillators under restorative coupling with applications in electrical networks. Automatica 75, 236–243 (2017)
Ballard, L., Cao, Y., Ren, W.: Distributed discrete-time coupled harmonic oscillators with application to synchronised motion coordination. Control Theory Appl. IET 4, 806–816 (2010)
Ma, G., Ren, J., Liu, Y., Lu, G.: Distributed event-triggered impulsive control for synchronization of coupled harmonic oscillators. IEEE Access 9, 126-231–126-240 (2021)
Zhang, H., Wu, Q., Ji, J.: Synchronization of discretely coupled harmonic oscillators using sampled position states only. IEEE Trans. Autom. Control 63(11), 3994–3999 (2018)
Wang, J., Xia, J., Shen, H., Xing, M., Park, J.H.: \(h_\infty \) synchronization for fuzzy markov jump chaotic systems with piecewise-constant transition probabilities subject to PDT switching rule. IEEE Trans. Fuzzy Syst. 29(10), 3082–3092 (2021)
Shen, H., Hu, X., Wang, J., Cao, J., Qian, W.: Non-fragile \(h_\infty \) synchronization for Markov jump singularly perturbed coupled neural networks subject to double-layer switching regulation. IEEE Trans. Neural Netw. Learn. Syst. 34(5), 2682–2692 (2023)
Marcheggiani, L., Chacón, R., Lenci, S.: On the synchronization of chains of nonlinear pendula connected by linear springs. Eur. Phys. J. Spec. Top. 223(4), 729–756 (2014)
Ren, W.: Synchronization of coupled harmonic oscillators with local interaction. Automatica 44(12), 3195–3200 (2008)
Zhou, J., Zhang, H., Xiang, L., Wu, Q.: Synchronization of coupled harmonic oscillators with local instantaneous interaction. Automatica 48(8), 1715–1721 (2012)
Su, H., Wang, X., Lin, Z.: Synchronization of coupled harmonic oscillators in a dynamic proximity network. Automatica 45(10), 2286–2291 (2009)
Qin, J., Li, M., Shi, Y., Ma, Q., Zheng, W.X.: Optimal synchronization control of multiagent systems with input saturation via off-policy reinforcement learning. IEEE Trans. Neural Netw. Learn. Syst. 30(1), 85–96 (2019)
Liu, Y., Wang, Z.: Optimal output synchronization of heterogeneous multi-agent systems using measured input-output data. Inf. Sci. 582, 462–479 (2022)
Li, H., Wei, Q.: Optimal synchronization control for multi-agent systems with input saturation: a nonzero-sum game. Front. Inf. Technol. Electron. Eng. 23(7), 1010–1019 (2022)
Zhang, D., Yao, Y., Wu, Z.: Reinforcement learning based optimal synchronization control for multi-agent systems with input constraints using vanishing viscosity method. Inf. Sci. 637, 118949 (2023)
Tong, S., Sun, K., Sui, S.: Observer-based adaptive fuzzy decentralized optimal control design for strict-feedback nonlinear large-scale systems. IEEE Trans. Fuzzy Syst. 26(2), 569–584 (2018)
Cui, R., Yang, C., Li, Y., Sharma, S.: Adaptive neural network control of AUVs with control input nonlinearities using reinforcement learning. IEEE Trans. Syst. Man Cybern. Syst. 47(6), 1019–1029 (2017)
Xu, H., Yu, D., Sui, S., Zhao, Y.-P., Chen, C.L.P., Wang, Z.: Nonsingular practical fixed-time adaptive output feedback control of MIMO nonlinear systems. IEEE Trans. Neural Netw. Learn. Syst. 34(10), 7222–7234 (2023). https://doi.org/10.1109/TNNLS.2021.3139230
Guo, X., Yan, W., Cui, R.: Neural network-based nonlinear sliding-mode control for an AUV without velocity measurements. Int. J. Control 92(3), 677–692 (2017)
Yu, D., Chen, C.L.P., Xu, H.: Fuzzy swarm control based on sliding-mode strategy with self-organized omnidirectional mobile robots system. IEEE Trans. Syst. Man Cybern. Syst. 52(4), 2262–2274 (2022)
Xu, H., Yu, D., Sui, S., Chen, C.L.P.: An event-triggered predefined time decentralized output feedback fuzzy adaptive control method for interconnected systems. IEEE Trans. Fuzzy Syst. 31(2), 631–644 (2023). https://doi.org/10.1109/TFUZZ.2022.3184834
Wen, G., Chen, C.L.P., Feng, J., Zhou, N.: Optimized multi-agent formation control based on an identifier–actor–critic reinforcement learning algorithm. IEEE Trans. Fuzzy Syst. 26(5), 2719–2731 (2018)
Yang, Y., Ding, Z., Wang, R., Modares, H., Wunsch, D.C.: Data-driven human–robot interaction without velocity measurement using off-policy reinforcement learning. IEEE/CAA J. Autom. Sin. 9(1), 47–63 (2022)
Qin, J., Li, M., Wang, J., Shi, L., Kang, Y., Zheng, W.X.: Optimal denial-of-service attack energy management against state estimation over an SINR-based network. Automatica 119, 109090 (2020)
Li, Y., Tong, S.: Observer-based neuro-adaptive optimized control of strict-feedback nonlinear systems with state constraints. IEEE Trans. Neural Netw. Learning Syst 33(7), 3131–3145. (2021). https://doi.org/10.1109/TNNLS.2021.3051030
Kennedy, J.O.S.: ‘Introduction to dynamic programming,’’ in Dynamic Programming, pp. 27–49. Springer, Berlin (1986)
Pontryagin, L.S.: Mathematical Theory of Optimal Processes. CRC Press, Boca Raton (1987)
Luo, B., Liu, D., Wu, H.-N., Wang, D., Lewis, F.L.: Policy gradient adaptive dynamic programming for data-based optimal control. IEEE Trans. Cybern. 47(10), 3341–3354 (2017)
Luo, B., Liu, D., Huang, T., Wang, D.: Model-free optimal tracking control via critic-only q-learning. IEEE Trans. Neural Netw. Learn. Syst 27(10), 2134–2144 (2016). https://doi.org/10.1109/TNNLS.2016.2585520
Lan, J., Liu, Y.-J., Yu, D., Wen, G., Tong, S., Liu, L.: Time-varying optimal formation control for second-order multiagent systems based on neural network observer and reinforcement learning. IEEE Trans. Neural Netw. Learn. Syst. pp. 1–12 (2022)
Wen, G., Xu, L., Li, B.: Optimized backstepping for tracking control of strict-feedback systems. IEEE Trans. Neural Netw. Learn. Syst. 29(8), 3850–3862 (2018)
Yang, Y., Gao, W., Modares, H., Xu, C.-Z.: Robust actor-critic learning for continuous-time nonlinear systems with unmodeled dynamics. IEEE Trans. Fuzzy Syst. 30(6), 2101–2112 (2022)
Wu, C., Yao, W., Luo, W., Pan, W., Sun, G., Xie, H., Wu, L.: A secure robot learning framework for cyber attack scheduling and countermeasure. IEEE Trans. Robot., 39(5), 3722–3738 (2023). https://doi.org/10.1109/TRO.2023.3275875
Wu, C., Pan, W., Staa, R., Liu, J., Sun, G., Wu, L.: Deep reinforcement learning control approach to mitigating actuator attacks. Automatica 152, 110999 (2023)
Wen, G., Chen, C.L.P., Liu, Y.-J., Liu, Z.: Neural network-based adaptive leader-following consensus control for a class of nonlinear multiagent state-delay systems. IEEE Trans. Cybern. 47(8), 2151–2160 (2017)
Funding
This work was funded in part by the National Natural Science Foundation of China Grant under numbers 62006192, 62373302, 62333009, 72001214, and 62106283 and in part by the China Postdoctoral Science Foundation under number 2021TQ0269.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
No potential conflict of interest was reported by the authors.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Gu, Z., Fan, C., Yu, D. et al. Optimal synchronized control of nonlinear coupled harmonic oscillators based on actor–critic reinforcement learning. Nonlinear Dyn 111, 21051–21064 (2023). https://doi.org/10.1007/s11071-023-08957-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11071-023-08957-y