Journal of Applied Mathematics

Volume 2014, Article ID 307809, 9 pages

http://dx.doi.org/10.1155/2014/307809

## A Novel Data-Driven Terminal Iterative Learning Control with Iteration Prediction Algorithm for a Class of Discrete-Time Nonlinear Systems

^{1}Advanced Control Systems Lab, School of Electronic & Information Engineering, Beijing Jiaotong University, Beijing 100044, China^{2}School of Automation & Electronic Engineering, Qingdao University of Science & Technology, Qingdao 266042, China

Received 15 May 2014; Accepted 23 July 2014; Published 12 August 2014

Academic Editor: Claudio H. Morales

Copyright © 2014 Shangtai Jin et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

#### Abstract

A data-driven predictive terminal iterative learning control (DDPTILC) approach is proposed for discrete-time nonlinear systems with terminal tracking tasks, where only the terminal output tracking error instead of entire output trajectory tracking error is available. The proposed DDPTILC scheme consists of an iterative learning control law, an iterative parameter estimation law, and an iterative parameter prediction law. If the partial derivative of the controlled system with respect to control input is bounded, then the proposed control approach guarantees the terminal tracking error convergence. Furthermore, the control performance is improved by using more information of predictive terminal outputs, which are predicted along the iteration axis and used to update the control law and estimation law. Rigorous analysis shows the monotonic convergence and bounded input and bounded output (BIBO) stability of the DDPTILC. In addition, extensive simulations are provided to show the applicability and effectiveness of the proposed approach.

#### 1. Introduction

Iterative learning control (ILC) is able to refine the control signals at current iteration by utilizing the input and output (I/O) data of previous iterative operations. As a direct result, the tracking error accuracy is improved as the number of repetitions increases. ILC has attracted much attention in the past three decades due to its simplicity and efficiency [1–3].

In practice, the ultimate control objective for many practical plants, such as rapid thermal processing systems for chemical vapor deposition (RTPCVD) [4], thermoforming ovens [5], and station stop control of a train [6], is the terminal state or terminal output instead of the entire trajectory of the system output. And the only measurement available for some plants [4, 5] is the terminal state or terminal output. For such a control task, the conventional ILC methods, which need to handle the desired output trajectory in a given time interval, are not applicable because the exact measurements of all the system states or outputs over the entire finite time interval are impossible [4] or unnecessary [6].

To overcome this problem, terminal iterative learning control (TILC) [4–6] and point-to-point iterative learning control (PTP-ILC) [7–9] are derived from ILC theory to use intermediate point or terminal point of every run. Recently, PTP-ILC and TILC are becoming a new research direction of ILC both in theory and in practical applications. The existing research results of PTP-ILC and TILC mainly focus on contraction-mapping-based learning law [4, 6] and optimization-based learning law [7–11].

Although the contraction-mapping-based TILC methods in [4, 6] are applicable for nonlinear systems, the proper selection of the learning gain is not a trivial thing in practical applications when there is little prior knowledge about the controlled system. Another limitation is that the TILC schemes proposed in [6] fix the learning gain through all iterations without any tuning and thus lack flexibility and adaptability regarding the expansions of the controlled plant and the exogenous uncertainties.

The optimal TILC [10, 11], where the explicit optimal cost function is given and minimized to design optimization-based TILC algorithm, can guarantee monotonic convergence along the iteration axis. However, it depends on the knowledge of a perfect model. If there is a lack of an accurate model, the monotonic convergence is no longer guaranteed.

As we know, the scale of many industrial processes, such as chemical industry, metallurgy, machinery, and transportation, becomes increasingly large, and the production technology and processes also become more and more complex. As a direct result, modeling these processes by using the first principles or identification methods becomes more and more difficult. Apparently, it will meet many limitations in practice when applying the conventional model-based control approaches. On the other hand, however, many industrial processes generate and store a huge amount of process data containing some valuable state information of the process operations and the equipment [12, 13], which motivates us to study the data-driven control methods.

More recently, a general data-driven optimal terminal iterative learning control approach, which is available for both linear and nonlinear systems, is proposed in [14]. Only the updated control input and the measured system output at the terminal point are utilized for the controller design and analysis. However, it is noted that the proposed approach in [14] is only of one-order with respect to the control input and the terminal tracking error. That is, only the I/O data of the previous one iteration is used in that control approach [14], which may reduce its robustness to iteration-dependent disturbances and uncertainties in practical applications.

Model predictive control (MPC) has been introduced to enhance the robustness of the ILC design [15, 16] by using the predictive I/O data within a prespecified time horizon. It is obvious that the more information is exploited, the more flexible the controller design and the better control performance may become. However, similar to the optimal ILC methods, the predictive ILC [15, 16] also requires that the controlled plant is an exact known linear system, or, at least, an approximate linear model of the controlled plant is known a priori.

In this work, a data-driven predictive TILC (DDPTILC) scheme is proposed by combining the advantages of data-driven optimal TILC [14] and predictive control [17–19]. An equivalent linear iteration-varying data model is developed first for the repeatable nonlinear system with a terminal tracking task. And then, the DDPTILC scheme is designed based on the equivalent data model. The control scheme consists of an iterative learning control law, a parameter iterative estimation law, and a parameter iterative prediction law and is updated iteratively by using the I/O data only. In more detail, the DDPTILC first estimates and predicts the partial derivatives of the controlled plant with respect to control inputs in the iteration domain and then uses the equivalent data model with estimated partial derivatives to predict the terminal output within a prespecified prediction iteration horizon; finally, it calculates the optimal control sequence by minimizing a given objective function.

The proposed approach is a kind of data-driven control method, since the controller design requires only the measured I/O data. Only the information about the lower and upper bounds of the partial derivative of the nonlinear discrete-time system with respect to control input is needed to analyze the bounded input and bounded output (BIBO) stability and terminal tracking error convergence. Numerical simulations show that the proposed approach contributes a better control performance and robustness by using the predictive information within a prespecified iteration horizon.

The rest of this paper is organized as follows. Section 2 is the problem formulation. The DDPTILC scheme is designed and analyzed in Section 3. Section 4 shows the monotonic convergence of the DDPTILC scheme. Section 5 provides numerical simulations to show the effectiveness of the DDPTILC scheme. Finally, some conclusions are given in Section 6.

#### 2. Problem Formulation

Consider the following SISO discrete-time nonlinear system: where is the sampling time index; is the finite time interval of the run-to-run system; denotes the system repetition number; is the system output, where only is measurable at the end of every run; denotes the control input, which is time-invariant at all sampling time in the same run, and is an unknown scalar nonlinear function and continuously differentiable.

The relationship between the input and output sequences can be expressed by the following equations: where is the initial value of system (1), and , are the corresponding nonlinear functions and differentiable to all the arguments.

Define the terminal output difference between consecutive two iterations as

Using (2) and mean value theorem, (3) is rewritten as where and .

Two assumptions are exposed on system (1) to restrict our discussion.

*Assumption 1. *The initial value is identical for every iteration; that is, ., .

*Assumption 2. * has lower bound and upper bound , and or . Without loss of generality, we only discuss the case of in this paper.

In terms of Assumption 1, (4) becomes where , and .

Based on (5), the terminal output prediction equation is given as Let where and denote the prediction vector of the terminal output and predictive control input increment vector at iteration , respectively. is the prediction horizon.

Equation (6) can be rewritten in a compact form:

*Remark 3. *It is noted that the prediction equation (8) is obtained from (5), which constructs an iteration-related linear relationship of control input and terminal output of the original nonlinear system (1). It is a completely equivalent transformation without any omitting, such as higher-order terms.

The control objective is to track a given desired output signal at the single terminal point by generating an optimal control signal .

#### 3. Data-Driven Predictive Terminal ILC Design

Consider the following cost function of control input: where is a weighting factor, and is the desired terminal output.

*Remark 4. *Note that is an important parameter. The proper selection of can guarantee the stability and improve the tracking performance.

Let ; the cost function (9) becomes

Substituting (8) into (10) and using the optimality condition yield the control law
where denotes the unit matrix.

According to the receding horizon principle, the control input at current iteration is constructed as
where .

When , (12) becomes
which is same as the control law in [20].

Since in (12) contains unknown parameters , , and , the parameter estimation algorithm and prediction algorithm should be developed. Here, the cost function of parameter is proposed as follows:
where is a weighting factor.

Minimizing (14) with respect to gives the following projection estimation algorithm:
where is a weighting factor, and is a step size factor.

The other parameters cannot be directly calculated from I/O data till iteration and thus need to be predicted by certain prediction algorithm. There exist many prediction methods, such as the Aström prediction method [21], the self-tuning method [22], and the multilevel hierarchical forecasting method [23, 24]. According to the simulation results in [23, 24], the multilevel hierarchical forecasting method possesses the best predictive error. Thus, the multilevel hierarchical forecasting method [23, 24] is applied here to predict the unknown parameters.

Assume that the estimated values have been calculated by (15) till iteration . Using these estimated values, an autoregressive (AR) model for prediction is constructed as
where are coefficients, and is the model order, which is usually set to be 2~7 [23, 24].

Using (16), prediction equation becomes
where .

Define and . are determined by following equation:
where is a positive constant.

By integrating the control algorithm (12), the parameter estimation algorithm (15), and the prediction algorithm (17)-(18), the data-driven predictive terminal iterative learning control scheme is constructed as follows:
where and are positive constants; and are the estimated values of and , , respectively; , , , .

*Remark 5. *The initial value of partial derivative estimation is generally set to be , since holds for many practical industrial systems, such as temperature control system, pressure control system. The proposed DDPTILC scheme has parameters to be estimated or predicted by merely using the I/O data of the controlled system.

*Remark 6. *The DDPTILC approach is proposed for unknown discrete-time nonlinear systems. It requires merely the measured I/O data of the controlled plant for controller design, and thus it is suitable for many practical industrial processes. In contrast, the norm-optimal ILC [10, 11] and norm-optimal predictive ILC [17] are limited to exactly known linear systems, and the controller should be redesigned by resolving a new complex Riccati equation if there is any little modification or expansion of the controlled plant.

*Remark 7. *Compared with the optimal terminal iterative learning control in [14], the DDPTILC approach utilizes the predictive terminal output information within a prespecified iteration horizon and thus has better robustness to the iteration-varying desired terminal output signal and iteration-dependent uncertainties.

#### 4. Convergence Analysis

In this section, we will discuss the stability and convergence for the DDPTILC scheme (19)–(26).

Theorem 8. *If the discrete-time nonlinear system (1), satisfying Assumptions 1–2, is controlled by DDPTILC scheme (19)–(26) for , then there exists a constant , such that the following properties hold for any .*(a)*The tracking error of the system converges; that is, .*(b)*The system output and the control input are bounded for all iterations.*

*Proof. *There are three parts for the theorem proof, as shown in the following details.*Firstly, We Will Prove the Boundedness of **. *If or or , then the boundedness of is obvious from (20). In the other case, define parameter estimation error as . Subtracting from both sides of the parameter estimation algorithm (19) and using (5) yield

From Assumption 2, we have . Taking absolute value on both sides of (27) yields
Since and , there exists a positive constant such that the following inequality holds:

Noting that (28) and (29), we have

This means is bounded. Since is bounded, the boundedness of can be guaranteed. The boundedness of the prediction values , , is the direct results of algorithms (21)–(24).*Secondly, We Will Prove Convergence of the Tracking Error*. Define terminal tracking error as . Substituting (5) into tracking error equation and using (25)-(26), we have

Taking absolute value on both sides of (31) yields

Let . Since is a semipositive definite matrix, thus and are positive definite matrix for any .

Since , where is adjoint matrix of and is the algebraic cofactor of , the following equation holds:

Equation (33) is bounded as is bounded for all iterations, and its upper bound is a constant independent of .

Since is a positive definite matrix, is a monic polynomial in of degree , is a monic polynomial in of degree , and () is a monic polynomial in of degree . Thus there exists a constant , such that (33) has the same positive sign as for any . In the sequel, there exists a positive constant such that

Combining (32) and (34) gives
Therefore .*Finally, We Will Prove the Boundedness of the System Output ** and the Control Input *. Since is a constant, is bounded.

In following, we prove the boundedness of control input sequence. From (25) and (26), we have
where is a bounded constant since is bounded.

Using (36) recursively, it gives
This equation implies that is bounded.

#### 5. Simulations

In this section, the effectiveness of the proposed DDPTILC approach is illustrated through numerical simulations. The mathematical model is assumed to be unavailable for controller design and just serves as the I/O data generator for the train stop system to be controlled.

Consider an ethanol fermentation process [25], whose mechanistic model in the form of differential algebraic equations (DAE) is described as follows [26]: where is the cell mass concentration; is the substrate concentration; is the product concentration; is the liquid volume of the reactor; and and are two parameters. is limited by the 200 L vessel size. The initial condition is specified as . The batch length is fixed to be 63 hours and divided into equal stages; that is, sampling time is 6.3 hours. The feed rate into the reactor is used for control and constrained by . There is no outflow, so the feed rate must be chosen so that the batch volume does not exceed the physical volume of the reactor.

In order to assess the control performance more extensively, three cases are considered in the following simulations. Case 1 is an idea one; the desired output and the system parameters are iteration-invariant. Case 2 is used to verify the good tracking performance of the proposed DDPTILC where the desired output is iteration-varying. Case 3 is used to verify the good tracking performance of the proposed DDPTILC where the system parameters are iteration-varying.

For the purpose of comparison, the data-driven TILC approach proposed in [14] is also simulated on the ethanol fermentation process. The TILC approach in [14] is given as follows:

In general, the tracking error convergence can be guaranteed using the same controller forms and the same controller parameters to different plants provided that the controller parameters are selected in the proper scopes. This is shown in the simulation by using the same controller parameters in the three cases. The parameters of the proposed DDPTILC approach are , , , , , , , , in the following three cases. And the parameters of the TILC approach (39) are , , , , , in the following three cases.

*Case 1. *The system parameters and the desired output are iteration-invariant; that is, , , and were selected from the literature [25]. The simulation results are shown in Figure 1. Figure 1(a) shows the convergence of the terminal tracking error. The horizon is the iteration number and the vertical axis is the absolute values of terminal tracking errors. Figures 1(b) and 1(c) show the profile of control inputs with respect to the iterations and the entire outputs in the first three iterations.

The simulation results show that two TILC approaches guarantee convergence of the terminal tracking error and BIBO stability. It is obvious that the proposed DDPTILC has a faster convergence rate than that of the TILC approach.

*Case 2. *System parameters are same as that in Case 1. The desired output is iteration-varying; that is, , where denotes the iteration number. The simulation results are shown in Figure 2. Figure 2(a) shows the convergence of the terminal tracking error. Figures 2(b) and 2(c) show the profile of control inputs with respect to the iterations and the entire outputs in the first three iterations.

It is shown that the terminal tracking errors by using two TILC approach converge to a small region when the desired output is iteration-varying. And the DDPTILC approach gives the better terminal tracking performance as shown in Figure 2(a).

*Case 3. *The desired output is same as that of Case 1. System parameters are iteration-varying; that is, , , where denotes the iteration number. The simulation results are shown in Figure 3. Figure 3(a) shows the convergence of the terminal tracking error. Figures 3(b) and 3(c) show the profile of control inputs with respect to the iterations and the entire outputs in the first three iterations.

It is shown that the terminal tracking errors by using two TILC approach converge to a small region when the system parameters are iteration-varying. And the DDPTILC approach gives the better terminal tracking performance as shown in Figure 3(a).

#### 6. Conclusions

This paper presents a new data-driven predictive terminal ILC for a class of discrete-time nonlinear systems only using the terminal output tracking error instead of the whole output trajectory tracking error. The controller design merely depends on the measured I/O data of the plant without requiring plant model information. Rigorous mathematical analysis is developed to illustrate the effectiveness of the proposed approach. Extensive simulation results show the data-driven nature, as well as effectiveness and the applicability, of the proposed predictive terminal ILC further.

#### Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

#### Acknowledgments

This work is supported by National Natural Science Foundation of China (61120106009, 61374102) and the Fundamental Research Funds for the Central Universities (2014JBM005).

#### References

- H. Ahn, Y. Q. Chen, and K. L. Moore, “Iterative learning control: Brief survey and categorization,”
*IEEE Transactions on Systems, Man and Cybernetics C: Applications and Reviews*, vol. 37, no. 6, pp. 1099–1121, 2007. View at Publisher · View at Google Scholar · View at Scopus - S. Arimoto, S. Kawamura, and F. Miyazaki, “Bettering operation of robots by learning,”
*Journal of Robotic Systems*, vol. 1, pp. 123–140, 1984. View at Google Scholar - J. Xu, “A survey on iterative learning control for nonlinear systems,”
*International Journal of Control*, vol. 84, no. 7, pp. 1275–1294, 2011. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet · View at Scopus - J. Xu, Y. Chen, and T. H. Lee, “Terminal iterative learning control with an application to RTPCVD thickness control,”
*Automatica. A Journal of IFAC, the International Federation of Automatic Control*, vol. 35, no. 9, pp. 1535–1542, 1999. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus - G. Gauthier and B. Boulet, “Terminal iterative learning control design with singular value decomposition decoupling for thermoforming ovens,” in
*Proceedings of the American Control Conference (ACC '09)*, pp. 1640–1645, St. Louis, Mo, USA, June 2009. View at Publisher · View at Google Scholar · View at Scopus - Z. Hou, Y. Wang, C. Yin, and T. Tang, “Terminal iterative learning control based station stop control of a train,”
*International Journal of Control*, vol. 84, no. 7, pp. 1263–1274, 2011. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus - T. D. Son, H. Ahn, and K. L. Moore, “Iterative learning control in optimal tracking problems with specified data points,”
*Automatica*, vol. 49, no. 5, pp. 1465–1472, 2013. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus - C. T. Freeman, “Constrained point-to-point iterative learning control with experimental verification,”
*Control Engineering Practice*, vol. 20, no. 5, pp. 489–498, 2012. View at Publisher · View at Google Scholar · View at Scopus - D. H. Owens, C. T. Freeman, and B. Chu, “Multivariable norm optimal iterative learning control with auxiliary optimisation,”
*International Journal of Control*, vol. 86, no. 6, pp. 1026–1045, 2013. View at Publisher · View at Google Scholar · View at Scopus - C. T. Freeman and Y. Tan, “Iterative learning control with mixed constraints for point-to-point tracking,”
*IEEE Transactions on Control Systems Technology*, vol. 21, no. 3, pp. 604–616, 2013. View at Publisher · View at Google Scholar · View at Scopus - T. D. Son and H. Ahn, “Terminal iterative learning control with multiple intermediate pass points,” in
*Proceedings of the American Control Conference (ACC '11)*, pp. 3651–3656, July 2011. View at Scopus - Z. S. Hou and Z. Wang, “From model-based control to data-driven control: survey, classification and perspective,”
*Information Sciences*, vol. 235, pp. 3–35, 2013. View at Publisher · View at Google Scholar · View at MathSciNet - Z. S. Hou and J. X. Xu, “On data-driven control theory: the state of the art and perspective,”
*Acta Automatica Sinica*, vol. 35, no. 6, pp. 650–667, 2009. View at Publisher · View at Google Scholar · View at Scopus - R. Chi, D. Wang, Z. Hou, and S. Jin, “Data-driven optimal terminal iterative learning control,”
*Journal of Process Control*, vol. 22, no. 10, pp. 2026–2037, 2012. View at Publisher · View at Google Scholar · View at Scopus - J. Shi, F. Gao, and T. Wu, “Single-cycle and multi-cycle generalized 2D model predictive iterative learning control (2D-GPILC) schemes for batch processes,”
*Journal of Process Control*, vol. 17, no. 9, pp. 715–727, 2007. View at Publisher · View at Google Scholar · View at Scopus - Y. Wang, D. Zhou, and F. Gao, “Iterative learning model predictive control for multi-phase batch processes,”
*Journal of Process Control*, vol. 18, no. 6, pp. 543–557, 2008. View at Publisher · View at Google Scholar · View at Scopus - B. Kouvaritakis and M. Cannon,
*Nonlinear Predictive Control: Theory and Practice*, IEE, 2001. - L. Magni, D. M. Raimondo, and F. Allgöwer,
*Nonlinear Model Predictive Control: Towards New Challenging Applications*, Springer, New York, NY, USA, 2009. View at Publisher · View at Google Scholar · View at MathSciNet - Y. Xi,
*Predictive Control*, National Defense Industry, 1993. - R. Chi, Z. Hou, D. Wang, and S. Jin, “An optimal terminal iterative learning control approach for nonlinear discrete-time systems,”
*Control Theory and Applications*, vol. 29, no. 8, pp. 1025–1030, 2012. View at Google Scholar · View at Scopus - K. J. Åström and B. Wittenmark,
*Adaptive Control*, New York, NY, USA, Addison-Wesley Longman, 1994. - D. W. Clarke and P. J. Gawthrop, “Self-tuning control,”
*IEE Proceedings*, vol. 126, no. 6, pp. 633–640, 1979. View at Publisher · View at Google Scholar · View at Scopus - Z. G. Han, “The identification of time-varying parameters in dynamic systems,”
*Acta Automatica Sinica*, vol. 10, no. 4, pp. 330–337, 1984. View at Google Scholar · View at MathSciNet - Z. G. Han,
*Multi-Level Recursive Method and Its Applications*, Science Press, 1989. - Z. Xiong, Y. Xu, J. Zhang, and J. Dong, “Batch-to-batch control of fed-batch processes using control-affine feedforward neural network,”
*Neural Computing and Applications*, vol. 17, no. 4, pp. 425–432, 2008. View at Publisher · View at Google Scholar · View at Scopus - J. Hong, “Optimal substrate feeding policy for fed batch fermentation with substrate and product inhibition kinetics,”
*Biotechnology and Bioengineering*, vol. 28, no. 9, pp. 1421–1431, 1986. View at Publisher · View at Google Scholar · View at Scopus