Sampled Characteristic Modeling and Forgetting Gradient Learning Algorithm for Robot Servo Systems

Bi, Hongbo; Chen, Dong; Li, Yanjuan; You, Ting

doi:https://doi.org/10.1155/2022/7259504

Mobile Information Systems

On this page

Abstract Introduction Conclusions Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Special Issue

Intelligent Control in the Industrial Internet

View this Special Issue

Research Article | Open Access

Volume 2022 | Article ID 7259504 | https://doi.org/10.1155/2022/7259504

Sampled Characteristic Modeling and Forgetting Gradient Learning Algorithm for Robot Servo Systems

Hongbo Bi,¹Dong Chen,¹Yanjuan Li,¹and Ting You¹

Academic Editor: Sai Zou

Received12 Jun 2022

Revised29 Jul 2022

Accepted03 Aug 2022

Published04 Oct 2022

Abstract

Servo systems of robotic exhibit nonlinear coupling with multidimensional characteristics, which poses a challenge to existing modeling and identification techniques. According to a kind of robot servo system which runs repetitively operations over a prespecified finite time interval, a low-order sampling characteristic modeling method is derived in this work. Characteristic parameters are allowed to vary from both time axis and iteration one; the forgetting gradient learning algorithm is utilized to estimate characteristic parameters. Furthermore, the effectiveness of the proposed algorithms is proved via theoretical analyses and numerical simulations.

1. Introduction

As an important part of the industrial Internet system, robots are attracting more and more attention. Existing robot control methods mostly depend on models. Because of the dynamic characteristics of controlled objects, the complexity of control tasks and operating environment, it is often difficult to establish accurate mathematical models. Nonnegligible unmodeled dynamics have higher requirements on system models and the robust performance of closed-loop systems needs to be improved. In order to reduce the impact of unmodeled dynamics on the performance of control systems, most existing methods build high-order models of systems. Modeling and control technology for high-order systems is extremely complex and difficult to implement. The feature modeling method provides a predictable way to model complex systems. It does not need to establish a precise dynamic model of the system nor does it use a system reduction or linearization method. Instead, it expresses the model as a low-order time-varying difference equation, taking into account factors, such as system state, external environment, and control variables, and compresses the dynamic information into feature parameters. The Euler approximation is often used to discretize the continuous system, and the equivalent characteristic model [1–6] is given. The feature model can also be performed with exact discretization to give a sampling feature model of the system. Without limiting the sampling interval and control tasks, a feature model with rapidly changing (even abrupt) characteristic parameters may be obtained [7, 8].

Stochastic gradient algorithm and its derivative algorithm are popular for training network weights and avoid the matrix operation compared with the recurrence least squares algorithm and the small online computation, which has attracted much attention [9–17]. The consistent convergence results of the parameters are still available for the stochastic gradient algorithm under weaker incentives. The martingale theory and stochastic process theory are powerful tools to analyze the convergence of recurrence algorithms. From the published results, most methods are targeted for stationary systems, where the estimated parameters are stationary.

When dealing with time-varying systems, it is found that the stochastic gradient algorithms do not have the ability to track the time-varying parameters. In the field of time-varying system identification, more consideration is given to how to correct the recursive algorithms, that is, how to construct the correction algorithm of the recursive algorithms, so as to achieve effective tracking of the time-varying parameters and improve the convergence rate of the parameters. It is well known that if the time-varying parameter change law is not known, there is no consistent parameter estimation convergence [18–21]. Based on this conclusion, one abandons the attempt to achieve complete tracking of time-varying parameters and instead works on a correction algorithm of how to construct and analyze recursive algorithms, expecting to give lower upper bounds on the estimation error of time-varying parameters. Common correction algorithms such as weighted recursive algorithms include rectangular windows, exponential windows (forgetting factors), etc. For irreducible nonstationary processes, these results can more accurately evaluate the parameter estimation accuracy, which has important applications in practical engineering applications.

When the system parameters are repetitive and independent, the instant change system runs repeatedly on a finite interval, the system parameters change over time, but the next time, it repeats this change, the instant change parameter to a repetition invariant. “Recursive” algorithms are constructed along the repetition axis, and the learning algorithm gives its complete estimates regardless of parameter slow change, fast change, and even mutation [22, 23]. However, in practical situations, there are often system parameters that change not only along the time domain but also with the number of iterations, that is, iteration dependence. If the time-varying parameter changes with the number of iterations are unknown, then similar to the recurrence algorithm, the iterative learning identification algorithm based on the principle of repeated invariant cannot track the system parameters effectively. We consider introducing the forgetting factor in the iterative learning algorithm and propose the forgetting gradient learning algorithm to estimate the iteration-dependent time-dependent system parameters. Then, under the premise of satisfying the repeated continuous incentive conditions, the performance analysis of the proposed algorithm is used and the simulation examples are completed to illustrate the effectiveness of the forgetting gradient learning algorithm.

2. Description of the Problem

Actual robot servo systems are mostly continuous nonlinear time-varying coupling processes, considering the following systems:where u stands for input and y for output, and represents external disturbances.

With the development of computer control technology, the analysis and synthesis process of the robot servo system needs a sampling model. The following discrete nonlinear time-varying coupling processes are also considered:

Here, we consider relaxing the initial condition that can be arbitrary values.

The goal of this paper is to build a sampling feature model for a robot servo system. For iteration-dependent and time-varying feature parameters in the model, the amnestic gradient learning algorithm is used to estimate them. To further validate the effectiveness of the proposed learning algorithm, the random process theory is used to analyze the model and a numerical example is given to verify the effectiveness of the proposed algorithm.

The rest of this paper is arranged as follows: the third part establishes the sampling characteristic model of the servo system, the fourth part puts forward the forgotten gradient learning algorithm, the fifth part gives the convergence analysis process of the learning algorithm, the sixth part verifies by numerical examples, and the seventh part gives the conclusion of this paper.

3. Characteristic Modeling for the Servo System

We consider the following discrete nonlinear system:where ; in the same way, we can obtain the following:which denotes , where then

The characteristic modeling method establishes a characteristic model for a nonlinear higher-order system by compressing the characteristic information to the characteristic parameters. Obviously, the lower the order of the characteristic model, the faster the change of the feature parameters. We consider building a first-order sampling characteristic model for the controlled system [7], and we can see that

In general, we can also set up second-order and third-order sampling feature models of the controlled system, that is, formula (5) can be written into formula (3).

3.1. Forgetting Gradient Learning Algorithm

We consider the following single-input single-output (SISO) discrete time-varying system repetitively operates over a prespecified finite time interval:where denotes time domain, and denotes iteration domain. and represent the input and output of the system, respectively. is the interference variable. and are time-varying polynomials of shift operators of SISO discrete systems, where and . and are the unknown parameters. We denote ,, and .

Equation (6) can be rewritten into the following regression model:

Similar to the stochastic gradient algorithm, a forgetting gradient learning algorithm for identifying iteratively dependent time-varying systems is presented:

3.2. Convergence Analysis of Forgetting Gradient Algorithm

The learning algorithm obtains parameter estimation based on the input and output data obtained when the system runs repeatedly in the operating interval. Because the operating interval is limited, we cannot obtain the convergence analysis results in the conventional sense. Only repetitive convergence results can be obtained. That is, for , the corresponding convergence classification is as follows

Repetitive consistency:

Repetitive boundedness:

When the system parameters are iteratively independent, using the learning algorithm, we can obtain the repeated consistency convergence results of the parameters. When the system parameters are iteratively dependent, we analyze the convergence performance of the forgetting gradient learning algorithm represented by equations (9) and (10).

For fixed time , is denoted as a algebra consisting of the input and output data obtained by k repeated operations. In order to analyze the convergence of the proposed learning algorithm, the following assumptions are derived.

Hypothesis 1. satisfies

Hypothesis 2. There exists uniformly bounded with respect to t such that

Hypothesis 3. The following repetitive persistent excitation conditions are established:where both and >0, N > n.
From formula (10), we obtain as follows:Then,where Hypothesis 3 is satisfied, , theni.e.,Take limit of both sides of this inequalityFor system (6), we define the transition matrix as follows:The upper bound of is then solved, and this bound is denoted as A(t). Let be the unit eigenvector corresponding to the maximum eigenvalue of the matrix , and we construct the difference equationthen
Taking norm on both sides of the abovementioned equation, and it follows that , according to (22) and ,We transpose both sides of this inequality and thatthenfor any ,Taking trace to repetitive excitation condition A3), we obtain as follows:In the condition (A3), we multiply to the left and to the right and use the formulas (15), (19)–(22) to obtain as follows:It is derived as follows:

Lemma 1. Let the nonnegative sequence satisfy the following relation:where , thenwhere the right limit is assumed to exist.

Suppose the observed noise and the parameter iteration-dependent system parameter change rate are zero-mean random noise sequences unrelated to the input and the following relationships are satisfied:

If PE condition (A1) is satisfied, the parameter estimation error given by the forgotten gradient learning algorithm is repetitively bounded, which is solved below.

Solution: Define the parameter estimation error vector

We assume is independent of , and , it can be obtained by using the formulas (7) and (9).

Taking norms on both sides of formula (34)

Taking expectation on both sides of formula (35),

From Lemma 1 and , it is derived as follows:

Here, we give the convergence analysis results of the forget gradient learning algorithm in the case of parameter iteration dependence. According to formula (37), we can obtain the bounded convergence effect of the algorithm, that is, the parameter bounds converge to the true values and we give the bounds of the convergence bounds. From formula (37), when = 0, which is equal to the system parameters iterate independently, we take . A random gradient learning algorithm is obtained.

In this case, the consistent convergence result of parameters can be obtained according to formula (37), that is, the parameter estimation converges completely to the parameter truth value.

4. Numerical Results

This section completes numerical examples to demonstrate that the learning identification algorithm can be used to estimate time-varying parameters in dynamic systems as shown in Figures 1 and 2.

Example 1. Consider the following nonlinear system:The expected trajectory is . Using the sampling feature modeling method provided in this paper and the adaptive learning control method provided in reference [7], a first-order sampling feature model is established. Where the sampling time T = 0.005, the initial value r = 1; the forgetting factor , we can obtain as follows:
Tracking performance is shown in Figure 3.
From the simulation results, using the sampled characteristic modeling method provided in this paper, although fast time-varying characteristic parameters are obtained, the output of the system enables efficient tracking of desired trajectories.

Example 2. Consider the following finite interval time-varying system.whereIn the simulation, we set finite interval length N = 1000. For as uniformly distributed random variables on [-0.5, 0.5], forgetting factor , . Here, is the production function of random variables that obey (0, 1) normal distribution. In the random dependence learning algorithms (8) and (9), we set the initial value , . To examine the convergence performance, we define .
The simulation results are shown in Figure 4–6. The prediction error is shown in Figure 4, and the parameter estimation error is shown in Figure 5, and the parameters estimate values in Figure 6. It can be seen from Figure 4 that the prediction error decreases rapidly with the increase of the number of iterations. In Figure 5, the parameter estimation error asymptotically converges to zero in a small field, and the simulation results show the uniform convergence of the parameter estimation, which can almost converge to real values.
To demonstrate the effectiveness of the proposed algorithm, the simulation results are compared with stochastic gradient method appeared in reference [17], where the finite interval length N = 1000 4000 and the other conditions are same to which of the abovementioned simulation. The errors with respect to every interval are shown in Figure 7. Parameter estimation errors after last recursive process and parameter estimations after last recursive process are shown in Figures 8 and 9.
From the simulation results, the bounded convergence result is guaranteed, but the identification result is weaker than the result of the method presented in this paper.

5. Conclusions

As modeling technology is of great importance to robots for industrial Internet systems, the proposed sampled characteristic model and forgetting gradient learning identification method can be used to solve the parameter estimation problem of time-varying systems running round-trip over finite intervals. The forgetting gradient learning algorithm is derived for time-varying systems under finite-interval repeat operations. We prove the repetition boundedness of the learning algorithm under repeated continuous excitation conditions and give the estimation error bounds given by the proposed algorithm. The completed simulations also verify the effectiveness of the learning algorithm. Further, we present the convergence analysis results of the stochastic gradient learning algorithm, which can obtain consistent convergence results for the parameters when parameter iterations are independent. The main purpose of this paper is to propose this learning identification method and to clarify the connection and difference between the learning identification and the existing recursive identification algorithms. For the completeness of the theory and the expression simplicity, for the consistency analysis of the learning algorithm, we learn from the mature results of the learning algorithm. However, there are still differences between the two algorithms, such as the recurrence algorithm requires the PE condition along the time domain, while the learning algorithm requires the repeated PE condition; the assumption of the convergence consistency of the learning algorithm and the estimation of the obtained convergence rate are allowed to depend on time. Systematic results for recursive identification are presented in literature [17], from which we can learn for follow-up studies, including the case of system interference such as colored noise, continuous excitation, improvements of SPR conditions, and convergence rate estimation.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported in part by the Project for Public Interest Research Projects of Science and Technology Program of Zhejiang Province, China (LGF21F010002, LGN21C130001, LGG21F030002, LGN22C140007).

References

H. X. Wu and J. Hu, Theory, Methods and Applications of Characteristic Modelling, National Defense Industry Press, Arlington, VA, USA, 2019.
J. F. Huang, Y. Kang, B. Meng, Y. Zhao, and H. Ji, “Characteristic model based adaptive controller design and analysis for a class of SISO systems,” Science China Information Sciences, vol. 59, no. 5, Article ID 52202, 2016.
View at: Publisher Site | Google Scholar
S. G. Gao, H. R. Dong, and B. Ning, “Characteristic model-based all-coefficient adaptive control for automatic train control systems,” Science China Information Sciences, vol. 57, no. 9, pp. 1–12, 2014.
View at: Publisher Site | Google Scholar
T. T. Jiang and H. X. Wu, “Sampled-data feedback and stability for a class of uncertain nonlinear systems based on characteristic modeling method,” Science China Information Sciences, vol. 59, no. 9, Article ID 92205, 2016.
View at: Publisher Site | Google Scholar
X. Wang, Y. F. Wu, J. Guo, and Q. Chen, “Adaptive terminal sliding-mode controller based on characteristic model for gear transmission servo systems,” Transactions of the Institute of Measurement and Control, vol. 41, no. 1, pp. 219–234, 2019.
View at: Publisher Site | Google Scholar
T. T. Jiang, “Adaptive control and stability for characteristic model with unmodeled dynamics,” in Proceedings of the World Congress on Intelligent Control and Automation, pp. 216–221, Guilin, China, June 2016.
View at: Google Scholar
M. X. Sun, H. B. Bi, and J. Zhang, “Characteristic modeling and adaptive iterative learning control for nonlinear time-varying systems,” Journal of Systems Science and Mathematical Sciences, vol. 36, no. 4, pp. 461–475, 2016.
View at: Google Scholar
M. X. Sun, Z. L. Li, and L. J. Yu, “The first-order characteristic models of dynamic systems and adaptive iterative learning control of linear servo systems,” Journal of Systems Science and Mathematical Sciences, vol. 32, no. 2, pp. 666–682, 2012.
View at: Google Scholar
X. Y. Peng, L. Li, and F. Y. Wang, “Accelerating minibatch stochastic gradient descent using typicality sampling,” IEEE Transactions on Neural Networks and Learning Systems, vol. 31, no. 11, pp. 4649–4659, 2020.
View at: Publisher Site | Google Scholar
Y. W. Lei, T. Hu, G. Y. Li, and K. Tang, “Stochastic gradient descent for nonconvex learning without bounded gradient assumptions,” IEEE Transactions on Neural Networks and Learning Systems, vol. 31, no. 10, pp. 4394–4400, 2020.
View at: Publisher Site | Google Scholar
N. Costilla-Enriquez, Y. Weng, and B. Zhang, “Combining Newton-raphson and stochastic gradient descent for power flow analysis,” IEEE Transactions on Power Systems, vol. 36, no. 1, pp. 514–517, 2021.
View at: Publisher Site | Google Scholar
S. A. M. Bin Al Islam, H. M. Abdul Aziz, and A. Hajbabaie, “Stochastic gradient-based optimal signal control with energy consumption bounds,” IEEE Transactions on Intelligent Transportation Systems, vol. 22, no. 5, pp. 3054–3067, 2021.
View at: Publisher Site | Google Scholar
R. Bitar, M. Wootters, and S. El Rouayheb, “Stochastic gradient coding for straggler mitigation in distributed learning,” IEEE Journal on Selected Areas in Information Theory, vol. 1, no. 1, pp. 277–291, 2020.
View at: Publisher Site | Google Scholar
Z. Wang and H. Q. Li, “Edge-based stochastic gradient algorithm for distributed optimization,” IEEE Transactions on Network Science and Engineering, vol. 7, no. 3, pp. 1421–1430, 2020.
View at: Publisher Site | Google Scholar
Z. X. Wu, Q. Ling, T. Y. Chen, and G. B. Giannakis, “Federated variance-reduced stochastic gradient descent with robustness to byzantine attacks,” IEEE Transactions on Signal Processing, vol. 68, no. 7, pp. 4583–4596, 2020.
View at: Publisher Site | Google Scholar
D. M. Yuan, D. W. C. Ho, and S. Y. Xu, “Stochastic strongly convex optimization via distributed epoch stochastic gradient algorithm,” IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 6, pp. 2344–2357, 2021.
View at: Publisher Site | Google Scholar
G. C. Goodwin and K. S. Sin, Adaptive Filtering, Prediction, and Control, Prentice-Hall, Englewood Cliffs, NJ, USA, 1984.
L. Guo, Time-varying Stochastic Systems Stability and Adaptive Theory, Science Press, Beijing, China, 2020.
F. Ding, T. Ding, J. B. Yang, and Y. M. Xu, “Convergence of forgetting gradient estimation algorithm for time-varying parameters,” Acta Automatica Sinica, vol. 28, no. 6, pp. 962–968, 2002.
View at: Google Scholar
J. Chen, Y. J. Liu, F. Ding, and Q. Zhu, “Gradient-based particle filter algorithm for an ARX model with nonlinear communication output,” IEEE Transactions on systems, man and cybernetics: Systems, vol. 50, no. 6, pp. 2198–2207, 2020.
View at: Publisher Site | Google Scholar
K. Y. You, “Recursive algorithms for parameter estimation with adaptive quantizer,” Automatica, vol. 52, pp. 192–201, 2015.
View at: Publisher Site | Google Scholar
M. X. Sun and H. B. Bi, “Learning identification: least squares algorithms and their repetitive consistency,” Acta Automatica Sinica, vol. 38, no. 5, pp. 698–706, 2012.
View at: Publisher Site | Google Scholar
M. X. Sun, H. B. Bi, and B. X. Chen, “Learning identification of a class of stochastic time-varing systems with colored noise,” Control Theory & Applications, vol. 29, no. 8, pp. 974–984, 2012.
View at: Google Scholar

Copyright

Copyright © 2022 Hongbo Bi et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

132

Downloads

229

Citations