Abstract

This paper considers the preview control problem for discrete-time systems. It investigates the problem via game theory and dynamic programming. Different from the existing results, on one hand, the proposed approach is suitable for dealing with the corresponding multiple preview channels problem; on the other hand, the approach provides a possibility in explaining how the preview controller improves the performance and why the performance will be saturated.

1. Introduction

Preview control is a means of using the future information of the reference or disturbance input for control. In the context of tracking, it amounts to tracking a delayed reference. As for the preview disturbance, it is a signal which can be obtained before it disturbs the system. It thus can be redefined as a delayed disturbance. Accordingly, the preview information often appears in the form of the delayed exogenous disturbance or reference signal. The preview control problem thus falls into the category of the delayed control problem. The delayed magnitude of the reference or disturbance is usually referred to as the width of the preview window and it reflects the ability of the system to obtain information in advance.

Numerous facts have shown that the preview control can improve the system performance effectively. For instance, semiactive and active suspension systems can always provide the better closed-loop performance including ride comfort and the handling performance than passive suspension because the former exploit the preview information for control [13]. Hence, preview control has extensive applications such as path following [4], bottom following for autonomous underwater vehicles [5], navigation [6], and robot trajectory control [7], which thus attracts lots of attention with respect to how to utilize the preview information [811]. So far, the optimal preview control theory has been relatively mature [2] and the preview control theory has also been developed greatly. The result of Tadmor and Mirkin [10, 11] is the most beautiful one to resolve the preview control problems. Unfortunately, it seems not to attack the multiple-preview cases in that there does not exist a pair of state and costate of the Hamilton Jacobi system that the former typically captures the contribution of past inputs/initial data, whereas the costate captures the contributions of future inputs. Or rather, there is not a suitable instant to separate the past inputs/initial data and future inputs completely. An alternative comprehensive result for the continuous-time system is proposed by Kojima and Ishijima [9]. It can settle a generalized class of delay system covering the multiple input delay and preview control problem, yet the infinite-dimensional system theory and operator manipulation employed there are actually very abstract and complicated to understand. Besides the pure state-space approaches mentioned previously, Meinsma and Mirkin [12] also give another complete solution to the multiple I/O delays control problem, which is converted into a series connection of Adobe problem to solve. The solution can provide an insight into the effect of the lag on the controller and has the remarkable physical sense. Adobe input problem, as its special case, covers the preview problem. Yet, given that the transfer function/matrix is a critical intermediate to solve the problem, we know that the approach is actually confined to settle the problem for the time-invariant system. There seem few studies dealing with the finite horizon preview control problem [8]. Note that [8] provides no necessity of the problem. In addition, there seem few works mentioning how the preview control performance is improved and the explicit evidence that the performance will be saturated as the width of the preview window increases.

Compared with the existed literatures [812], the property of the proposed solution procedure is still very attractive. Our approach borrows the idea of dynamic programming. It can attack the multiple-preview case, where the preview may be from a channel or multichannel, so it can resolve the varying preview problem. What is more interesting, the structure of our solution is more similar to that of the standard controller. In detail, the state gain of the proposed preview controller is in the form of , where the matrix admits an -like Riccati equation with the same dimension as the state.

We will organize the paper as follows. In Section 2, we propose the preview control problem. Sections 3 and 4 are devoted to the analysis and solution of the problem, respectively. A numerical example is provided in Section 5. Some conclusions are achieved in Section 6.

Notation. In the paper, the superscript means matrix transpose. is the 2-norm in . A quadratic form will be denoted as and with , and is the identity matrix. represents the column vector stacked by . and only imply the linear and quadratic function over , respectively.

2. Discrete-Time Preview Control Problem

Consider the system where , , , , are matrices with compatible dimensions, is the width of the preview window, is the state, is the control input, is the previewable information, and is the regulated signal.

The finite-horizon preview control problem for the system (1) can be stated as follows.

For a prescribed , find a control-law such that is the control terminal time, and is the positive semidefinite weight matrix over the terminal state.

Equation (3) shows a remarkable difference from a general performance index that the involved and are in the different time horizon, which originals from the fact that actually does not affect the system. The difference will be actually shared with the infinite-horizon preview control problem. However, due to , the infinite-horizon preview control problem for the system (1) is as follows.

For a prescribed , find a control-law such that Equation (2) means that the control-law we pursued is full-information. The full-information controller achieves (3) or (4) implies that the controller regulates the state so as to minimize the energy of the regulated signal when the system encounters the worst-case preview information.

In order to assure that the underlying problem is well defined, we have to make the standard assumptions.(i) is stabilizable.(ii) has no invariant zeros on the unit circle and is left invertible.

In addition, to simplify the derivation, we make an orthogonal assumption without loss of generality as(i) .

3. Analyze the Problem via Zero-Sum Game Approach

The preview control problem is so involved that the game theory will be introduced. It will provide a preliminary analysis for solving the problem.

It should be noted that the standard control problem has been connected with a game problem, where the cost functional of two players is subject to (1) with , is the minimizing player, and is the maximizing player [13, 14].

Analysis shows that the finite preview control problem reserves the essence of the game above except that the information pattern is restricted. We thus associate the problem with the game which has cost functional of two players subject to (1), where the minimizing player is and the maximizing player is . Different from the standard control problem, is previewable in finite time interval . In other words, the action of the player at each time always takes effect after units of time. Strategies of the players are confined to have the information patterns as which means that, at each time , the two players have access to as well as and the strategy of is available for the player while the player knows nothing about the strategy of . For ease of use, the aforementioned information pattern will be also called the state feedback hereafter in the paper.

It is a fact that whatever the strategies before stage are, decisions at stage can never influence the states, the strategies, and the part before stage in the game. We thus borrow the idea of dynamic programming to investigate the game problem (6) related to the preview problem.

At stage, collect terms influenced by two players , in cost function (6) and write them together as that is not involved and is quadratic over follows from (1) and (9). Rewrite as where is a positive matrix from . means the linear homogeneous term over a variable , denotes a quadratic homogeneous term over , and is a linear function with respect to . They are explicit via completion of square over . In view of , it is clear that the optimal solution with the state feedback structure as (7) to is with any since never disturbs the system before .

At stage, combine with (10) together the terms in impacted by , as With a similar argument to , there will hold where and are used to represent a linear function and a quadratic one in variables , , respectively, and is a homogeneous linear with respect to , . , and can be also explicit. is determined by terms and . Likewise, if , then where any is still the optimal solution to (11).

Repeat the above process until , and we will find that the terms in (6) involving , are a quadratic form as With reminiscence that is available for , we have where stand for “linear” and “quadratic,” respectively. Note that is a homogeneous linear with respect to , is a homogeneous linear with respect to . , and can be still explicit. Different from instant before , from instant, there needs to be assured that the player has an adverse impact on so that the noncooperative game is kept. Hence, should hold simultaneously beside .

The case at , , bears the analogous analysis to the stage .

Based on the preceding analysis, we can achieve such a conclusion which will play an important role in solving the game problem as well as the original preview control problem.

Proposition 1. Consider the game problem where , subject to (1). If the information pattern of strategies , is restricted to be full-information, then can be expressed as where is a quadratic form over as

In Proposition 1, it is not important what exactly are. Here, we just want to say that can be expressed by such a quadratic form. In the sequel, we will characterize those parameters as well as the saddle point.

From (15), it is obvious that Applying Proposition 1 to leads to Plugging (19) into (18) and completing square over , , we will have the following relations: In the above, Observing (20)–(24), it is not hard to find that the left-hand side of (20)–(24) will be calculated step by step as we have initial values , , , .

As are nonsingular, it is straightforward to get that in is related to and updated as where Moreover, we have as Note that we can take any in the derivation above, so a similar conclusion holds for any . Of course, the stage is only a special case including one decision.

For all , , are nonsingular, we can achieve where is as shown in (17) and is a quadratic form totally determined by the initial data.

Theorem 2. Consider the game problem (6) subject to (1). If (29) has a solution such that , , then , as in (32) are saddle point of in (6).

Proof. As (29) has a solution such that , , it is easy to get that where means that for all , is replaced by in the cost function , so does . The proof is complete.

4. Finite Horizon Preview Controller

Now we start with giving the solution to the finite-horizon preview control problem for the discrete-time system (1) and the sufficient and necessary condition thereof.

Theorem 3. Consider the preview control problem with (3) for the state-space model (1) with zero initial data. The problem is solvable if and only if (C1)recursive equation (29) has a positive semidefinite solution,(C2) .
Moreover, if the problem is solvable, then (31) provides the preview controller satisfying (3).

Proof. For any . As (29) has a positive semidefinite solution , there holds . As , that , are the controllers such that (3) holds for the zero-initial data which follows from Theorem 2. The sufficient part is complete.
As for the necessity of Theorem 3, we can apply the existing full-information control results [15] for the augmented system corresponding to (1).
We have the following algorithm to realize the preview controller provided by Theorem 3 in finite horizon.(1)Set ;(2)set ;(3)use (29) to compute and check its positive semidefiniteness. If it holds, go on; otherwise, go to step 1 and reset a larger value than for ;(4)use (21) and (20) to compute and ;(5)use (24) to compute and check its negativeness. If it holds, go on; otherwise, go to step 1 and reset a larger value than for ;(6)use (31) to compute the controller ;(7) , if , go to step 2; otherwise, end.
Why the preview control can improve the performance of the close-loop system? To answer the question, a similar idea will be utilized to solve the finite-horizon preview control problem for the system (1) with , namely, the standard full-information control problem in finite horizon. We will omit the detail of derivation and provide the control law as well as the worst-case disturbance.

Corollary 4. Consider the preview control problem with (3) for the state-space model (1) with and zero-initial data. For a given , the full-information control law achieving (3) with exists if and only if (i)recursive equation has a positive semidefinite solution;(ii) where Moreover, there are the controller and the worst-case disturbance as follows:

Corollary 4 is a standard result for the full-information control [15]. It is the special case of Theorem 3 with .

As for comparing the optimal control laws (37) with (31), the worst-case disturbances (38) with (32), and the Riccati equations (35) with (29), we will find that, for the preview case, the -step delay enables the current control law to take not only the effect before units of time but also additional instant disturbance information into account.

Denote and then has the form as Comparing (40) with (36), we will find that (29) is more complex than (35). In detail, the right side of (35) only involves information at instant while the right side of (29) involves information from instant to instant. Observing (40), we will see that includes additional two terms, one is the term directly relating to , the other is the term directly connecting with .

Remark 5. The formula (40) together with the formula (36) shows that using the previewable disturbance makes the performance improvement possible in finite horizon. However, it should be noted that the performance will achieve saturation for some preview length , which stems from the structure of the control gain .

Given that the proceeding analysis works on the case with any preview length , yet it is not hard to find that the larger the preview length, the more complicated the problem. A fact should be noted that too much preview information may not help to achieve better performance. As a matter of fact, for the large enough preview window, the further the information away from the current position, the weaker the impact. In view of (26), is less than , so , where denotes a bounded entry whose exact form is not relevant at the moment) has weaker weight as increases. This analysis tells us that some tradeoffs may be reasonable and beneficial for the large preview problems, and it may also explain the saturation phenomenon mentioned by Tadmor and Mirkin [10] and by Mirkin and Meinsma [16] to a certain extent.

5. Numerical Example

To illustrate our results and their advantage based on Theorem 3, consider the preview problem for the plant (1) with We are interested in the effect of the preview length on the achievable performance. Applying Theorem 3 and the standard -iteration to this system leads to the optimal achievable for each . The results are provided in Figure 1.

It shows that the achievable performance is improved (from in the preview-free problem to with ) but any further increase of has no effect on the achievable performance after some finite ; that is, the achievable performance will be saturated as a function of the preview width .

6. Conclusion

The paper investigates the preview control problem for the discrete-time systems. It settles the problem via completion of square and dynamic programming. The idea is suitable for the multiple preview channels and time-variant preview problems. It makes us realize another interesting problem: how to achieve the better performance via utilizing the suitable amount of preview information.

Disclosure

Parts of this work were presented at the 7th Asian Control Conference.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

This work is supported by the National Natural Science Foundation (NNSF) of China under Grant 61203045.