Abstract

Iterative learning control (ILC) is applied to remote control systems in which communication channels from the plant to the controller are subject to random data dropout and communication delay. Through analysis, it is shown that ILC can achieve asymptotical convergence along the iteration axis, as far as the probabilities of the data dropout and communication delay are known a priori. Owing to the essence of feedforward-based control ILC can perform trajectory-tracking tasks while both the data-dropout and the one-step delay phenomena are taken into consideration. Theoretical analysis and simulations validate the effectiveness of the ILC algorithm for network-based control tasks.

1. Introduction

Iterative leaning control (ILC) is a control method that achieves perfect trajectory tracking when the system operates repeatedly. ILC has made significant progresses over the past two decades [13] and covered a wide scope of research issues such as continuous-time nonlinear system control [4], discrete-time nonlinear system [5], the initial reset problem [6, 7], stochastic process control [8], state delays [9], and data dropout [10].

On the other hand, the research on networked control systems has attracted much attention [11, 12] over the past few years. In network control, two frequently encountered issues are data dropout and communication delays, which are causes of poor performance of remote control systems. A central research area in remote control systems is to evaluate and compensate data dropout and time-delay factors [1316]. Since data dropout and delay are random and time varying by nature, the existing control methods for deterministic data dropout and communication delays cannot be directly applied. Significant research efforts have been made on the control problems for networked systems with random data dropout and communication delays that are modeled in various ways in terms of the probability and characteristics of sources and destinations, for instance [10, 17].

It is in general still an open research area in ILC when remote control systems problems are concerned, except for certain pioneer works that address linear systems associated with either random data dropout [10, 18] or random communication delays [17, 1921]. This paper investigates the implementation of ILC in a remote control systems setting, specifically focusing on compensation when both random data dropout and delays occur at the communication channels between the plant output and the controller.

Since ILC is in principle a feedforward technique, it is possible to send the controller signal before the task is executed. This would not be possible for feedback-based control systems. Hence, the data dropout can be circumvented to certain extent by using network protocols that assure the delivery of data packets. Likewise, the large delay due to large data package can also be avoided when the package is used for repeated task executions, namely, in future executions. ILC task is carried out in a finite-time interval, hence the time-domain stability is not a concern. Thus, unlike most network control works that focus on the stability issue, ILC can be applied to address trajectory-tracking tasks and the learning convergence is achieved in the iteration domain.

On the other hand, the use of data in the feedforward fashion would require the temporal analysis and management of data packages as well as resending the missing data package, which may not be available in certain remote control systems tasks. In this work, we adopt an ILC scheme that uses pastcontrol signals, as well as the error signals that are perturbed by the data dropout and communication delay. The ILC law adopts classical D-type algorithm and a revised learning gain that takes into consideration the probabilities of both data-dropout and communication-delay factors. As a result, the output tracking errors can be made to converge along the iteration axis. The ILC scheme can be applied to linear discrete-time plants with trajectory-tracking tasks.

The paper is organized as below. Section 2 formulates the remote control systems problem. Sections 3 and 4 prove the convergence property of ILC for linear discrete-time plants. Section 5 presents a numerical examples.

Throughout the paper, the following notations are used. Let be the expected value of a random variable, the probability of an event, the Euclidean norm of a vector, and the maximal singular value of a matrix. Let is a discrete time signal with . For any and any , define where .

2. Problem Formulation

Consider a deterministic discrete-time linear time-invariant dynamics system: where “” and “” denote the iteration index and discrete time, respectively. , , and for all are system states, inputs, and outputs, respectively, at the th iteration. , , and are constant matrices with appropriate dimensions.

The schematic diagram of the remote control systems under consideration is shown in Figure 1.

It should be noted that the open-loop system from the ILC input to the plant output is deterministic. The randomness occurs during the data transmission from the plant output to the ILC input. There are two approaches in analyzing the closed-loop system. The first approach is to treat the entire closed-loop system as a random or stochastic process. In such circumstances, the topology of the overall system keeps changing and the control process is either a Markovian jump process or a switching process. Another approach, which is adopted in this work, is to retain the essentially deterministic structure of the original open-loop system, meanwhile model the random data dropout and communication delay into two random factors with known probability distributions. As a consequence, the signals used in ILC, are the modulated plant output with the two random factors.

When the control process is deterministic, an effective ILC law for the linear system (2.1) is where and are control inputs at the th and th iterations, namely, the present trial and the previous trial, respectively. is the output tracking error at the time th time instance of the th iteration. is a learning gain matrix.

Remark 2.1. Note that in the ILC law (2.2), the control signal of the present iteration, , consists of both the pastcontrol input, and the past error with one-step temporal advance, . The current-cycle feedback errors, such as , are not used. Since ILC does not require the current-cycle feedback nor the temporal stability, it is an effective control method for remote control systems problems with random data dropout and communication delay.

To facilitate the ILC design and convergence analysis, data dropout and one-step communication delay are formulated. First formulate the data-dropout problem. Denote a stochastic variable with Bernoulli distribution taking binary values 0 and 1, where denotes an occurrence of data dropout and denotes a normal data communication. The probabilities of are where is a known constant. Here, we assume that is a stationery stochastic process, thus the data dropout rate is independent of the time . In subsequent derivations, we treat as time invariant.

When the data dropout occurs in multiple communication channels, we can similarly define for the th communication channel. Thus, denote the corresponding mathematical expectation is where is known a priori.

Due to the data dropout, the plant output received by the controller at the th iteration is Generally speaking, the occurrences of data dropouts at two iterations are uncorrelated, thus independent. On the other hand, ILC law at the current iteration, the th iteration, uses only signals of the previous iteration, namely, th iteration, as shown in (2.2). Thus with the control input contains data dropouts upto the th iteration. Therefore, and are independent, that is, Without the loss of generality, we assume , namely, the data dropout rate is invariant at different iterations.

Next formulate the one-step communication delay problem. Denote is a random delay factor with Bernoulli distribution, which takes binary values 0 and 1 that indicate, respectively, the presence and absence of an one-step communication delay. Here we assume that is a stationery stochastic process, thus the occurrence of the communication delay is independent of the time . In subsequent derivations we treat as time invariant. With multiple communication channels, we define matrix , where denotes the occurrence of communication delay at the th communication channel. Denote and . The plant output received by ILC with possible communication delay is formulated by where is the communication delay at the th iteration. Without the loss of generality, we assume , namely, the probability of the communication delay is invariant at different iterations. Analogous to data dropout, assume that communication delay at any two iterations are independent, then and are independent, so are and because contains communication delays upto the th iteration through the ILC law (2.2).

It is worthwhile noting that stochastic variables and are not completely independent. A delayed or nondelayed communication occurs only when , that is, no data dropout. Hence, we should have the condition probability for data transmission without delay and the condition probability for data transmission with one-step delay As a consequence, we have The relationship between data drop out and communication delay, (2.11), can be extended to multiple channels at the th iteration

At the th iteration, the output signals perturbed by data dropout and one-step communication delay can be expressed as where is a unity matrix of appropriate dimensions. The mathematical expectation of can be derived using the independence property between , , and , as well as the relationship  (2.12)

The objective of control design is to seek an appropriate ILC law that can take into consideration data dropout and communication delay concurrently. The following ILC law is adopted where where .

3. Convergence Analysis for Left-Invertible Systems

In this section, we derive the convergence property of the ILC (2.15) in the presence of data dropout and communication delays.

In ILC, the learning convergence can be derived in terms of either the output tracking error, , or the input tracking error, . In this section, we prove the learning convergence property of .

Assumption 3.1. For a given output reference trajectory , which is realizable, there exists a unique desired control input such that where is uniformly bounded for all . It is assumed that for all is a random variable with .

Define the input and state errors then from (2.1) and (3.1), we have

From (2.15), using the relationship (2.12), we have

Theorem 3.2. Suppose that the update law (2.15) is applied to the networked control system and satisfied the Assumption 3.1. If there exist satisfying then the input error along the iteration axis, , converges to a bound that is proportional to the factor .

Proof. First, subtracting from both sides of the ILC law (2.15) yields Applying the ensemble operator to both sides of (3.6) and substituting the relationship (3.4) with , we obtain Substituting the state error dynamics (3.3) into (3.7) leads to the following relationship: Define .
Now let us handle the second term on the right hand side of (3.8), which is related to . Applying the ensemble operation to the following relationship:
Substituting the relation (3.9) into (3.8), taking the norm on both sides, the following relationship is derived: where and in this work we choose if .
In order to handle the exponential term with in (3.11), we introduce the norm. From Assumption 3.1, multiplying both sides of (3.10) by and taking the supermum over yield where , , and that is independent of . Since is bounded, so is
Substituting the properties of Lemma A.1 into (3.11) yields
Since , it is possible to choose sufficiently large such that Therefore we can rewrite (3.13) as which implies

Note that is proportional to , namely, the maximum difference between in , which is bounded and small when the reference trajectory is smooth or the sampling interval is sufficiently small. When the probability associated with the data communication delay, , is known a priori, we can further revise the reference trajectory to an augmented one, such that the resulting .

Corollary 3.3. Revising the original reference into an augmented one , then and the ILC (2.15) ensures a zero-tracking error.

Proof. Note that . Suppose that , then the delay-perturbed output should be . In other words, the augmented reference trajectory for should be . As a result, implies . Now replacing in (2.16) with , we can derive Comparing the above expression with (2.16), we conclude that , subsequently , which implies a zero-tracking error according to (3.16).

4. Convergence Analysis for Right-Invertible Systems

In this section, we prove the learning convergence property of .

Assumption 4.1. always exists.

Theorem 4.2. Suppose that the update law (2.15) is applied to the networked control system and satisfied the Assumption 4.1. If then the tracking error along the iteration axis, , converges to a bound that is proportional to the factor .

Proof. First note the relationship: Substituting ILC law (2.15), (2.16), and (4.3) into (4.2) yields Assumption 4.3. Assume .Applying the ensemble operator to both sides of (4.4) and substituting the relationship (4.2), we obtain Taking the norm on both sides of (4.5), the following relationship is derived where and in this work we choose if .
In order to handle the exponential term with in (4.5), we introduce the norm. Multiplying both sides of (4.5) by and taking the supermum over yield Substituting the properties of Lemma A.2 into (4.7) yields where and .
Since , it is possible to choose sufficiently large such that Therefore, we can rewrite (4.8) as which implies

5. Numerical Examples

Consider the following linear discrete-time system: with the initial condition . The desired trajectory is . The tracking period is . The control profile of the first iteration is for . Two sets of probabilities for the data dropout rate and communication delay are considered, which are , , , and , respectively. The learning gain is , which yields , and with respect to the two sets of probabilities. The tracking performance of two ILC algorithms is given in Figure 2, where Max Error denotes the maximum absolute error of each iteration.

6. Conclusion

In this work, we address a class of networked control system problems with random data dropout and communication delay. D-type ILC is applied to handle this remote control systems problem with repeated tracking tasks. Through analysis, we illustrate the desired convergence property of the ILC. Although we focus on one-step communication delay in this work, the results could be extended to multiple delays, which is one of our ongoing research topics. In our future work, we will also explore the extension to more generic nonlinear dynamic processes.

Appendix

Lemma A.1. For all  , for all , for all , the inequality: holds.

Proof. Consequently

Lemma A.2. For all  , for all , for all , the inequalities hold.

Proof. Consequently

Acknowledgments

This work is supported by the National Natural Science Foundation of China (Grants no. 60736021 and no. 60721062), The 973 Program of China (Grant no. 2009CB320603).