Abstract

This paper focuses on the suboptimization of a class of multivariable discrete-time bilinear systems consisting of interconnected bilinear subsystems with respect to a linear quadratic optimal regulation criterion which involves the use of state weighting terms only. Conditions which ensure the controllability of the overall system are given as a previous requirement for optimization. Three transformations of variables are made on the system equations in order to implement the scheme on an equivalent linear system. This leads to an equivalent representation of the used quadratic performance index that involves the appearance of quadratic weighting terms related to both transformed input and state variables. In this way, a Riccati-matrix sequence, allowing the synthesis of a standard feedback control law, is obtained. Finally, the proposed control scheme is tested on realistic examples.

1. Introduction

Dynamic bilinear systems have received great attention by researchers in the last decades, from the classical works of Anderson and Moore [1], Feldbaum [2], Tarn [3], or Tarn et al. [4], to the recent works of Al-Baiyat [5], Kotta et al. [6] or Garrido et al. [7]. They form a transitional class between the linear and the general nonlinear systems. The importance of such systems is well-known in diverse areas in which bilinear systems appear naturally for a variety of dynamic processes such as those taking place in electrical systems [8], industrial processes [9], power plants [10], chemical processes [11], nuclear fusion [12], biomedical applications [13], information management [14], mechanical systems [15], and aerospace and avionics [16]. which can not be satisfactorily modeled under the assumption of linearity. On the other hand, the design of controllers for bilinear systems has been an area of major research during the recent years. This growing interest in practical applications requires the development of suitable algorithms for control problems associated with bilinear systems. Many of the results obtained rely on the optimization theory with an appropriate performance index, as it is the case of the proposed control scheme.

Well-known works as Goka et al. [17] and Tarn et al. [18] stated the controllability conditions for standard classes of discrete bilinear systems. In the first paper, an equivalent system description was derived with the equivalent input being dependent on the products of the state and the original input. Since the equivalent feed-forward loop is of a linear nature, the analysis becomes greatly simplified. In the second paper, the uniform controllability of such systems using bounded input was studied. On the other hand, it is well known that the systems are often interconnected and, in many cases, several dynamic subsystems can be distinguished for analysis purposes. This sometimes takes place in computer communication, transportation networks, control and power stations, and so on. There exists important literature dealing with both the associate multivariable and the large-scale problems [19–25]. For instance, standard adaptive control techniques were applied in [26] to compensate the undesirable deviations of the process parameters from their nominal values, being the overall process modeled as a bilinear system prior to the application of the adaptive scheme.

This paper reports suboptimal optimization techniques which are applied to bilinear models. Such models can be considered as direct extensions to the linear continuous interconnected systems stated by the work of Ramakrishna and Viswanadham [27]. A class of invariant discrete-time multivariable bilinear systems with interconnection subsystems is studied. First, the system is shown to be equivalently described by the linear feed-forward multivariable structure with multiplicative inputs including a deterministic disturbance vector. Also, other interpretations are stated. As a previous requirement for optimization, controllability results for the overall systems performance are given by extending those ones given in [17, 18, 27–31]. This is achieved through the above equivalent linear system with the use of centralized control methods. A central coordinator is supposed to be available for the local controllers to supply information to each control station from the remaining ones. That control technique is applied to the optimization of the aforementioned system class. The used performance index involves state-weighting terms only. An explicit solution of Riccati type (which is unusual for such an optimization criterion) is found out as the suboptimal solution by using manipulations on the input/state variables of the problem statement [1, 32]. The importance of this strategy arises from problems where constrains on the input rather than input weights are introduced in the optimization criterion, what leads to a feedback-type control law.

The paper is organized as follows. In Section 2, the problem statement for the class of the interconnected invariant discrete-time bilinear systems is given. Also, controllability necessary conditions obtained by the decomposition techniques are studied. Section 3 is devoted to the development of suboptimal strategies for the regulation criterion, which involves quadratic terms of the state variables. Three transformations of variables are used to turn the loss function into another one, which involves state and input quadratic terms leading to a Riccati-type suboptimal solution. The control strategies may be considered as extensions of those ones used in [1, 29] or [25], for the linear continuous problem. The formulation is accomplished from a centralized viewpoint. In Section 4, two simulated examples are presented. The first one consists of a simulated numerical example; and the second one consists of a real regulation problem of two amplidynes acting on a DC-motor. In the second example, it is not necessary to consider constant current or voltage as it is usual in DC-motor control, with their productβ€”namely, the electric parβ€”a bilinear term describing the resulting dynamical system. Finally, conclusions end the paper. Appendix A is related to the meaning explanation of certain equivalent interpretations of the overall system structure. Appendix B concerns with the development of the transformations of variables involved in the redefinition of the performance index. By convenience, equations and results from the appendices are sometimes invoked in the main body of the paper order not to repeat mathematical material.

2. On the System Structure and Its Controllability Properties

2.1. Controllability of a Class of Bilinear Systems

In [17], the following typical class of bilinear systems was considered: π‘₯(π‘˜+1)=𝐴+𝑒(π‘˜)𝐡π‘₯(π‘˜)+𝐢𝑒(π‘˜);π‘˜=0,1,2,…,π‘₯(β‹…)βˆˆπ‘…π‘›;𝑒(β‹…)βˆˆπ‘….(2.1) Equation (2.1) can equivalently be described, if rank (𝐡,𝐢)=1 and then rank (𝐡)=1 if 𝐢 and β„Ž are both nonzero, with 𝐡=πΆβ„Žπ‘‡ (𝐢,β„Ž being unique 𝑛-vectors within a scalar factor), by the linear feed-forward system with multiplicative input: π‘₯(π‘˜+1)=𝐴π‘₯(π‘˜)+𝐢𝜈(π‘˜);𝜈(π‘˜)β‰‘βŽ§βŽͺ⎨βŽͺβŽ©π‘’(π‘˜)β„Žπ‘‡π‘₯(π‘˜);if𝐢=0,𝑒(π‘˜)[β„Žπ‘‡π‘₯(π‘˜)+1];if𝐢≠0(2.2) with π‘˜=0,1,2,…; and where π‘₯(π‘˜) and 𝑒(π‘˜) are the state and the control at time π‘˜, respectively; and 𝐴 and 𝐡 are real constant coefficient matrices of compatible dimensions.

Note that for arbitrary ranks of 𝐡 or (𝐡,𝐢) in (2.1), the decomposition 𝐡=βˆ‘π‘π‘–=1𝑒𝑖𝑏𝑇𝑖 can be used, with 𝑒𝑖 and 𝑏𝑇𝑖   being, respectively, the 𝑖th unity vector (i.e., the 𝑖th component is unity and the remaining ones are zero) and the 𝑖th row vector of 𝐡. The controllability of the bilinear system was related in that paper to that of the linear system (2.2) as follows.

Theorem 2.1. (summary of results in [17]). For the equivalent systems (2.1)-(2.2), the following controllability results hold.
(i)If the bilinear system (2.1) is controllable, then the pair (𝐴,𝐢) in (2.2) is controllable.(ii)If (𝐴𝑇,β„Ž) is not controllable, then the homogeneous bilinear system is not controllable.(iii)If (𝐴,𝐢) and (𝐴𝑇,β„Ž) are both controllable pairs and if β„Žπ‘‡πΆ, β„Žπ‘‡π΄βˆ’1𝐢≠0, then the homogenous bilinear system is controllable in 𝑅𝑛 outside the origin.(iv)The inhomogeneous bilinear system is not controllable in 𝑅𝑛 if(a)rank [controllability matrix of (𝐴𝑇,β„Ž)] = rank [controllability matrix of (𝐴𝑇,β„Ž);1]; (1≑𝑛-vector with every element being unity),(b)βˆ‘π‘›π‘–=1π‘Žπ‘–=1, with a(β‹…) being the elements of the last row of the 𝐴-matrix in the canonical state-space representation of system (2.2).(v)The inhomogeneous bilinear system is controllable in 𝑅𝑛, if (𝐴,𝐢) is a controllable pair, β„Žπ‘‡πΆ, β„Žπ‘‡π΄βˆ’1𝐢≠0, and at least one of the conditions (iv-a) or (iv-b) is not fulfilled.
Although these results are restricted to a special class of discrete bilinear systems, many natural discrete bilinear systems [10, 15, 33–35] do satisfy these assumptions. More general controllability conditions for bilinear systems can be found in [18, 36].

Remark 2.2. More general results were derived in [30] for the case in which π΄βˆ’1 does not exist. Such a paper studies the controllability conditions for three cases of violation of the conditions given in [37] (namely, (𝐴,𝐢) and (𝐴𝑇,β„Ž) are controllable pairs and π‘˜0=g.c.d{𝑖/β„Žπ‘‡π΄π‘–βˆ’1𝐢≠0,0<𝑖≀𝑛2}=1) (g.c.d denotes the greatest common divisor). Such cases include the following:
(a)rank ⌈(𝐴,𝐢)=𝑛, rank Ξ©(β„Žπ‘‡,𝐴)<𝑛;(b)rank ⌈(𝐴,𝐢)<𝑛, rank Ξ©(β„Žπ‘‡,𝐴)=𝑛; and(c)rank ⌈(𝐴;𝐢)<𝑛, rank Ξ©(β„Žπ‘‡,𝐴)<𝑛, with ⌈(𝐴,𝐢)≑[𝐢,𝐴𝐢,…,π΄π‘›βˆ’1𝐢] and Ξ©(β„Žπ‘‡,𝐴)≑[β„Ž,π΄π‘‡β„Ž,

2.2. Models for a Class of Bilinear Systems with Interconnected Subsystems

In this paper, the following multivariable invariant discrete-time structure of bilinear systems (…,𝐴(π‘›βˆ’1)π‘‡β„Ž]𝑇.) of homogeneous type is considered 𝑆𝑖 with the interconnection bilinear subsystems (π‘†π‘–βˆΆπ‘₯𝑖(π‘˜+1)=𝐴𝑖+𝑒𝑖(π‘˜)𝐡𝑖π‘₯𝑖(π‘˜)+𝐷𝑖𝑧𝑖(π‘˜),(2.3)𝑦𝑖(π‘˜)=𝐢𝑖π‘₯𝑖(π‘˜);π‘–βˆˆπΌ;𝐼=ξ€½1,2,…,𝑝;π‘˜=0,1,2,…(2.4)), which include linear and bilinear coupling terms, given by 𝐻i where π»π‘–βˆΆπ‘§π‘–(π‘˜+1)=𝑀𝑖+𝑒𝑖(π‘˜)𝑃𝑖𝑧𝑖(π‘˜)+𝑝𝑗=1𝐿𝑖𝑗+𝑒𝑖(π‘˜)𝑀𝑖𝑗𝑦𝑗(π‘˜);π‘–βˆˆπΌ;π‘˜=0,1,2,…,(2.5) are, respectively, the state vector of the π‘₯𝑖(π‘˜)βˆˆπ‘…π‘›π‘–,𝑧𝑖(π‘˜)βˆˆπ‘…π‘Žπ‘–,𝑦𝑖(π‘˜)βˆˆπ‘…π‘Ÿπ‘–,𝑒𝑖(π‘˜)βˆˆπ‘…-subsystems, the state vectors of the interaction 𝑆𝑖-subsystems, the corresponding output and input vectors at time 𝐻𝑖, and the different coefficient matrices of appropriate dimensions (see Figure 1).

The linear continuous models of [27] for Figure 1 are more general than those ones above since additional measurements from the interconnection subsystems are used to generate the inputs to the plant. Structures of this type arise from practical systems such as countercurrent heat exchangers having weak nonlinearities modeled as bilinear.

Now, the extended state vector of the overall system is defined as π‘˜ all π‘₯(π‘˜)≑π‘₯𝑇1(π‘˜),π‘₯𝑇2(π‘˜),…,π‘₯𝑇𝑝(π‘˜)ξ‚„π‘‡βˆˆπ‘…π‘›+π‘Ž,whereπ‘₯𝑖(π‘˜)≑π‘₯𝑇𝑖(π‘˜),𝑧𝑖𝑇(π‘˜)ξ‚„π‘‡βˆˆπ‘…π‘›π‘–+π‘Žπ‘–,(2.6) with π‘–βˆˆπΌ and the extended output vector is defined as π‘›β‰‘βˆ‘π‘π‘—=1𝑛𝑗,π‘Žβ‰‘βˆ‘π‘π‘—=1π‘Žπ‘—, with 𝑦(π‘˜)≑[𝑦𝑇1(π‘˜),𝑦𝑇2(π‘˜),…,𝑦𝑇𝑝(π‘˜)]π‘‡βˆˆπ‘…π‘Ÿ, Thus, the following multivariable structure, equivalent to (2.3)–(2.5), is deduced: π‘Ÿβ‰‘βˆ‘π‘π‘—=1π‘Ÿπ‘—. where the matrices of parameters have the following structures: π‘₯(π‘˜+1)=𝐴π‘₯(π‘˜)+𝑝𝑖=1𝑒𝑖(π‘˜)𝐡𝑖π‘₯(π‘˜);π‘˜=0,1,2,3,…,(2.7)𝑦(π‘˜)=𝐢π‘₯(π‘˜);π‘˜=0,1,2,3,…,(2.8) for all 𝐴=𝐴𝑇1,𝐴𝑇2,…,𝐴𝑇𝑝𝑇;𝐴𝑖=𝐴𝑖1,𝐴𝑖2,…,𝐴𝑖𝑖,…,𝐴𝑖𝑝𝐴𝑖𝑖=[𝐴𝑖𝐷𝑖𝐿𝑖𝑖𝐢𝑖𝑀𝑖]𝐴𝑖𝑗=[00𝐿𝑖𝑗𝐢𝑗0](for𝑗≠𝑖)𝐡𝑖=0π‘–βˆ’1,…,0,𝐡𝑇𝑖,0π‘βˆ’1ξ„Ώξ…ƒξ…Œ,…,0𝑇;𝐡𝑖=𝐡𝑖1,𝐡𝑖2,…,𝐡𝑖𝑝𝐡𝑖𝑖=[𝐡𝑖0𝑀𝑖𝑖𝐢𝑖𝑃𝑖]𝐡𝑖𝑗=[00𝑀𝑖𝑗𝐢𝑗0](for𝑗≠𝑖)𝐢=𝐢𝑇1,𝐢𝑇2,…,𝐢𝑇𝑝;𝐢𝑖=0π‘–βˆ’1,…,0,𝐢𝑖,0,0π‘βˆ’1ξ„Ώξ…ƒξ…Œ,…,0ξ‚„(2.9)

Remark 2.3. In general, the number of subsystems 𝑗,π‘–βˆˆπΌ. and the number of interaction subsystems 𝑆(β‹…) are not equal. Thus, assume that 𝐻(β‹…) and 𝑝1 are, respectively, the number of subsystems 𝑝2(𝑝1≠𝑝2) and the number of subsystems 𝑆𝑑,π‘‘βˆˆπΌ1={1,2,3,…,𝑝1}, Then, an extended state vector 𝐻𝑠,π‘ βˆˆπΌ2={1,2,…,𝑝2}. can be obtained similarly, if πœ‰(β‹…) and 𝑝=max(𝑝1𝑝2)
Taking into account the dimensions and the structural zeros of the 𝐼=𝐼1π‘ˆπΌ2.-matrices, they can be decomposed into a sum of dyads as 𝐡𝑖 where 𝐡𝑖=𝑛𝑖+π‘Žπ‘–ξ“π‘—=1𝑒𝑗+π‘™π‘–βˆ’1𝑏𝑇𝑗+π‘™π‘–βˆ’1(𝑖);forallπ‘–βˆˆπΌ,(2.10) is the unity 𝑒𝑗 vector, 𝑗th is the 𝑏𝑇𝑗(𝑖) vector of the 𝑗th-matrix, 𝐡𝑖, and the lower subscripts π‘–βˆˆπΌ are related to the first nonstructural zero rows of the 1𝑗,allπ‘—βˆˆπΌ,-matrix. If they have additional (nonstructural) rows being zero, the corresponding 𝐡𝑖-vectors can be neglected in (2.10). From (2.7), it is clear that the rows of the 𝑒(β‹…)-matrices have the following structures: 𝐡i𝑏𝑇𝑗(𝑖)β‰‘βŽ§βŽͺβŽͺβŽͺβŽͺ⎨βŽͺβŽͺβŽͺβŽͺ⎩[(π‘–βˆ’1)block0,…,0,π‘’π‘‡π‘—βˆ’π‘™π‘–+1𝐡𝑖,(π‘βˆ’π‘–)blocks0,…,0];if1𝑖≀𝑗<1𝑖+𝑛𝑖,[(π‘–βˆ’1)blocksξ„½ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…‚ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ„Ύπ‘’π‘‡π‘—βˆ’π‘™π‘–βˆ’π‘›π‘–+1𝑀𝑖1𝐢1,0,…,π‘’π‘‡π‘—βˆ’π‘™π‘–.𝑛𝑖+1𝑀𝑖(π‘–βˆ’1)πΆπ‘–βˆ’1,0,π‘’π‘‡π‘—βˆ’π‘™π‘–βˆ’π‘›π‘–+1𝑃𝑖,(π‘βˆ’1)blocksξ„½ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…‚ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ…ξ„Ύπ‘’π‘‡π‘—βˆ’π‘™π‘–βˆ’π‘›π‘–+1𝑀𝑖(𝑖+1)𝐢𝑖+1,0,…,π‘’π‘‡π‘—βˆ’π‘™π‘–βˆ’π‘›π‘–+1𝑀𝑖𝑝𝐢𝑖,0];if𝑙𝑖+𝑛𝑖𝑖≀𝑗<1𝑖+𝑛𝑖+π‘Žπ‘–,(2.11) Then one deduces from (2.11) that forallπ‘–βˆˆπΌ.𝑏𝑇𝑗+1π‘–βˆ’1(𝑖)π‘₯β‰‘βŽ§βŽͺβŽͺ⎨βŽͺβŽͺβŽ©π‘π‘‡π‘—(𝑖)π‘₯𝑖(π‘˜);if1≀𝑗≀𝑛𝑖,π‘π‘‡π‘—βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—βˆ’π‘›π‘–+1𝑦𝑙(π‘˜);if𝑛𝑖+1≀𝑗≀𝑛𝑖+π‘Žπ‘–,0;if𝑛𝑖+π‘Žπ‘–+1≀𝑗≀𝑛+π‘Ž,(2.12), when withπ‘˜=0,1,2,3,…, 𝑏𝑇𝑗(𝑖), 𝑝𝑇𝑗(𝑖) denote, respectively, the π‘šπ‘‡π‘—(𝑖,1) vector row of the matrices 𝑗th From (2.7) through (2.12) one obtains 𝐡𝑖,𝑃𝑖,and𝑀𝑖,forall𝐼,1∈𝐼. where the sequences of scalars π‘₯(π‘˜+1)=𝐴π‘₯(π‘˜)+𝑝𝑖=1𝑛𝑖+π‘Žπ‘’ξ“π‘—=1𝑒𝑗+π‘™π‘–βˆ’1πœˆπ‘—π‘–(π‘˜);π‘˜=0,1,2,…,(2.13) are defined by πœˆπ‘—π‘–(π‘˜)πœˆπ‘—π‘–(π‘˜)β‰‘βŽ§βŽͺ⎨βŽͺβŽ©π‘’π‘–(π‘˜)𝑏𝑇𝑗(𝑖)π‘₯𝑖(π‘˜);if1≀𝑗≀𝑛𝑖,𝑒𝑖(π‘˜)ξ‚ƒπ‘π‘‡π‘—βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)ξ‚„;if𝑛𝑖+1≀𝑗≀𝑛𝑖+π‘Žπ‘–,(2.14) are considered as equivalent (primary) inputs for the equivalent linear systems (2.13) to be determined prior to the generation of the forallπ‘–βˆˆπΌ,π‘˜=0,1,2,…,-sequence in the design implementation.

2.3. Obtaining the 𝑒(β‹…)(β‹…)-Sequences from the 𝑒𝑖(π‘˜)-Sequences

Equations (2.13) and (2.14) constitute a feed-forward linear equivalent system with multiplicative inputs. But the 𝛾𝑗𝑖(π‘˜)𝑝-inputs cannot, in general, be determined from the 𝑒𝑖(π‘˜) equivalent (𝑛+π‘Ž)>𝑝-inputs. Then, it is necessary to take a set of p variables between the last ones, which will be optimized independently. Let us call πœˆπ‘—π‘–(π‘˜) Thus, let us define the sets

πœˆπ‘—π‘–(π‘˜),allπ‘–βˆˆπΌ. such that for the 𝑁𝑖=π‘βˆ’π‘–π‘ˆπ‘+𝑖,π‘βˆ’π‘–=1,2,…,𝑛𝑖,𝑁+𝑖=𝑛𝑖+1,𝑛𝑖+2,…,𝑛𝑖+π‘Žπ‘–ξ‚‡,allπ‘–βˆˆπΌ,(2.15) current sampling instant, π‘˜th and 𝑏𝑇𝑗𝑖(𝑖)π‘₯𝑖(π‘˜)β‰ 0,ifπ‘₯𝑖(π‘˜)β‰ 0,ifπ‘—π‘–βˆˆπ‘βˆ’π‘–,π‘π‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖,𝑙)𝑦𝑖(π‘˜)β‰ 0,ifπ‘—π‘–βˆˆπ‘+𝑖,(2.16)

Then from (2.14), one has the auxiliary inputs π‘˜=0,1,2,…. with the real inputs for the nonsaturated case beingπœˆπ‘—π‘–(π‘˜)β‰‘βŽ§βŽͺβŽͺβŽͺ⎨βŽͺβŽͺβŽͺβŽ©ξ‚ƒπ‘π‘‡π‘—π‘–(𝑖)π‘₯𝑖(π‘˜)ξ‚„βˆ’π‘–π‘π‘‡π‘—(𝑖)π‘₯𝑖(π‘˜)πœˆπ‘—π‘–(π‘˜);π‘—βˆˆπ‘βˆ’π‘–;π‘–βˆˆπΌ,ξ‚ƒπ‘π‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)ξ‚„βˆ’1Γ—ξ‚ƒπ‘π‘‡π‘—βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)πœˆπ‘—π‘–(π‘˜)ξ‚„,π‘—βˆˆπ‘+𝑖;π‘–βˆˆπΌ;π‘˜=0,1,2,3,…,(2.17)

Remark 2.4. Note that the minimum energy control (i.e., 𝑒𝑖(π‘˜)β‰‘βŽ§βŽͺβŽͺβŽͺβŽͺβŽͺβŽͺβŽͺβŽͺβŽͺ⎨βŽͺβŽͺβŽͺβŽͺβŽͺβŽͺβŽͺβŽͺβŽͺβŽ©ξ‚ƒπ‘π‘‡π‘—π‘–(𝑖)π‘₯𝑖(π‘˜)ξ‚„βˆ’1πœˆπ‘—π‘–(π‘˜);ifπ‘—π‘–βˆˆπ‘π‘–,ifthereexistssomeπ‘—π‘–βˆˆπ‘βˆ’π‘–suchthat𝑏𝑇𝑗𝑖(𝑖)π‘₯𝑖(π‘˜)β‰ 0,ξ‚ƒπ‘π‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖,𝑙)𝑦𝑖(π‘˜)ξ‚„βˆ’1πœˆπ‘—π‘–(π‘˜);ifπ‘—π‘–βˆˆπ‘π‘–,ifthereexistssomeπ‘—π‘–βˆˆπ‘+𝑖suchthatπ‘π‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)β‰ 0,0;if𝑏𝑇𝑗𝑖(𝑖)π‘₯𝑖(π‘˜)=0forallπ‘—π‘–βˆˆπ‘βˆ’π‘–andalsoπ‘π‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)=0forallπ‘—π‘–βˆˆπ‘+𝑖,allπ‘–βˆˆπΌ;π‘˜=0,1,2,3,….(2.18) some 𝑒𝑖(β‹…)=0,) has been chosen when the system becomes insensitive to the corresponding control component. In fact, any arbitrary control could be applied.

Remark 2.5. Since there are π‘–βˆˆπΌ equivalent inputs, deals with an over determined problem when the (𝑛+π‘Ž) inputs are solved from them. Their solution cannot generally be satisfied exactly, and there exist many possible ways of defining the β€œbest” approximate solution if there are 𝑝<𝑛+π‘Ž equivalent nonzero inputs. Also, note that the decomposition of the 𝑝<𝑠≀𝑛+π‘Ž-matrices (here, into a sum of dyads, as shown in (2.10)) is not unique. Then, the least-squares technique (which is known to give the best approximation) can be considered by taking into account the overall control term in (2.13) instead of the control itself. Namely, 𝐡𝑖 where 𝑒𝑖(π‘˜)β‰‘βŽ§βŽͺ⎨βŽͺβŽ©πœˆπ‘‡π‘–(π‘˜)𝐡𝑖π‘₯(π‘˜)π‘₯𝑇(π‘˜)𝐡𝑇𝑖𝐡𝑖π‘₯(π‘˜);if𝐡𝑖π‘₯(π‘˜)β‰ 0,0,otherwise,allπ‘–βˆˆπΌ;π‘˜=0,1,2,3,…,(2.19)
Note that in general, the (πœˆπ‘–(π‘˜)≑[βˆ‘π‘–βˆ’1𝑗=1(𝑛𝑗+π‘Žπ‘—)zeros0,…,0,πœˆπ‘™π‘–(π‘˜),…,πœˆπ‘›π‘–,𝑖(π‘˜),πœˆπ‘›π‘–+1,𝑖(π‘˜),…,πœˆπ‘›π‘–+π‘Žπ‘–,𝑖(π‘˜),βˆ‘π‘π‘—=𝑖+1(𝑛𝑗+π‘Žπ‘—)zeros0,…,0]π‘‡βˆˆπ‘…π‘›+π‘Ž,allπ‘–βˆˆπΌ;π‘˜=0,1,2,3,….(2.20)) vector has only nonzero components. Equation (2.19) can be rewritten as π‘˜π‘’π‘–(π‘˜)≑[𝑛𝑖+π‘Žπ‘–ξ“π‘—=1{𝑏𝑇𝑗(𝑖)π‘₯𝑖(π‘˜)𝛿𝑗(𝑖)+ξ‚ƒπ‘π‘‡π‘—βˆ’π‘›π‘–0(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)𝛿𝑗(𝑖)}2]βˆ’1Γ—[𝑛𝑖+π‘Žπ‘–ξ“π‘—=1πœˆπ‘—π‘–(π‘˜){𝑏𝑇𝑗(𝑖)π‘₯𝑖(π‘˜)𝛿𝑗(𝑖)+ξ‚ƒπ‘π‘‡π‘—βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)𝛿𝑗(𝑖)}],(2.21)

2.4. Two Alternative Interpretation Schemes of the Equivalent Linear System

Equation (2.18) leads to the following equivalences to the linear system (2.13): where𝛿(𝑖)≑1βˆ’π›Ώπ‘—(𝑖),with𝛿𝑗(𝑖)=1if1≀𝑗≀𝑛and0,if𝑛𝑖+1≀𝑗≀𝑛𝑖+π‘Žπ‘–allπ‘–βˆˆπΌ;π‘˜=0,1,2,…. where the π‘₯(π‘˜+1)=𝐴π‘₯(π‘˜)+𝑝𝑖=1𝑐𝑗+π‘™π‘–βˆ’1(π‘˜)πœˆπ‘—π‘–(π‘˜)(2.22)=𝐴π‘₯(π‘˜)+𝑝𝑖=1𝑒𝑗+π‘™π‘–βˆ’1πœˆπ‘—π‘–(π‘˜)+πœ‰(π‘˜),(2.23)-vectors (𝑛+π‘Ž) and 𝑐(β‹…)(β‹…) are defined in Appendix A.

Equations (2.22)-(2.23) provide two alternative interpretations of (2.13). Representation (2.22) involves an extended state-dependent control vector, while (2.23) includes a known deterministic input-dependent vector. These equivalent representations will lead to two different suboptimal control designs in the next section.

2.5. Controllability of the Bilinear System with Interconnections

The controllability outside the origin of the bilinear system (2.3)–(2.5) can be studied by splitting the problem as follows:

(i)controllability of the feed-forward linear system,(ii)capability of the πœ‰(β‹…)-sequences of actually acting upon the equivalent 𝑒(β‹…)(β‹…)-sequences.The following result on controllability conditions is useful as a previous requirement for optimization.

Theorem 2.6. (controllability of the bilinear system involving interconnections). Controllability of the bilinear system (2.3)–(2.5) is ensured on an interval 𝜈(β‹…)(β‹…), [π‘˜ξ…ž,π‘˜ξ…žξ…ž]βˆ©π‘…+, if the following set of conditions holds.
(i)The controllability grammian of the overall system is positive definite in π‘˜ξ…žξ…ž=π‘˜ξ…ž+π‘šπ‘Žπ‘₯1≀𝑖≀𝑝(𝑛𝑖+π‘Žπ‘–)(ii)[π‘˜ξ…ž,π‘˜ξ…žξ…ž]. with max[rank(βˆ‘π‘π‘™=1𝑀𝑠1𝐢1)]β‰₯max[rank(𝐿𝑖𝑗𝐢𝑗)](iii)All the pairs of block diagram, 1≀𝑠≀𝑝,1≀𝑖≀𝑝and1≀𝑗≀𝑝., are nonsingular.(iv)[(𝐴𝑇𝑖,𝑀𝑇𝑖),𝐡𝑖],π‘–βˆˆπΌ, (π‘βˆ—π‘‡π‘—π‘–(𝑖)πΆβˆ—π‘–(π‘˜)β‰ 0 diagram π‘βˆ—π‘‡π‘—π‘–(𝑖), (π΄βˆ’1𝑖,π‘€βˆ’1𝑖)πΆβˆ—π‘–(π‘˜)β‰ 0, allπ‘–βˆˆπΌ, where allπ‘˜ξ…žβ‰€π‘˜β‰€π‘˜ξ…žξ…ž are, respectively, the subvectors of the matrices πΆβˆ—(β‹…)andπ‘βˆ—(β‹…) in (2.3)-(2.4) associated with each subsystem 𝐢(β‹…)and𝐡(β‹…).

Sketch of the Proof
Since the pair 𝑆(β‹…) is controllable for all arbitrary matrix [𝐴+𝐡(β‹…)𝐺(β‹…),𝐡(β‹…)] if the pair 𝐺(β‹…) is controllable [29, 38], then the controllability of the pair [𝐴,𝐡(β‹…)] implies the controllability of the pair [𝐴𝑑,𝐡(β‹…)] if there exists a [𝐴𝑑+𝐴∘,𝐡(β‹…)],-matrix such that 𝐺(β‹…) with 𝐴∘=𝐡(β‹…)𝐺(β‹…), 𝐴∘=π΄βˆ’π΄π‘‘β€‰ block diagram 𝐴𝑑=. For the equality above relating ((𝐴1,𝑀1),(𝐴2,𝑀2),…,(𝐴𝑝,𝑀𝑝)) and 𝐴∘,𝐡(β‹…), to hold, the RouchΓ©-FrΓΆbenius theorem implies that condition (ii) must be fulfilled according to (2.9) and (2.11). Also, since controllability grammian of 𝐺(β‹…) on [𝐴𝑑,𝐡(β‹…)] block diagram controllability grammian [[block diagram [π‘˜ξ…ž,π‘˜ξ…žξ…ž]≑],[block diagram (𝐴1,𝑀1),𝐡1], (𝐴2,𝑀2),𝐡2, [block diagram …, which is positive if proposition (i) holds. This proves sufficiency of (i)-(ii) versus controllability of the equivalent feed-forward system. Propositions (iii)-(iv) ensure from Theorem 2.1 that controllability of the equivalent linear system implies controllability of the system (2.3)-(2.4) by allowing the determination of the scalars (𝐴𝑝,𝑀𝑝),𝐡𝑝]] from 𝑒𝑖(β‹…), πœˆπ‘—π‘–(β‹…), in order to drive each subsystem π‘–βˆˆπΌ from an arbitrary initial point to a predefined final state.

Remark 2.7. (2.5.1) If condition (iv) is such that the matrices 𝑆𝑖 and π΄βˆ’1𝑖, some π‘€βˆ’1𝑖, the same extensions referenced in Theorem 2.1(iii) are applicable (namely, see Remark 2.2).
(2.5.2) Theorem 2.6(iv) ensures the existence of an admissible control sequence which leads to the system outside the hyperplane of π‘–βˆˆπΌ of insensitivity of the subsystem π‘βˆ—π‘‡π‘™π‘–+π‘—π‘–βˆ’1(𝑖)π‘₯𝑖(π‘˜)β‰ 0 to the 𝑆𝑖-control [17, 28]. A weak additive signal can be added to the right-hand side of (2.3) at the 𝑒𝑖(β‹…) sample in case of the failure of condition (iv) in Theorem 2.6 at the next step.

3. Optimization Techniques

In this section, a regulation criterion which involves quadratic weighting terms in the state variables is given [1, 39]. The loss function used is π‘˜th where β€œπ½π‘=12π‘ξ“π‘˜=0π‘₯𝑇(π‘˜)𝑄(π‘˜)π‘₯(π‘˜);𝑄(β‹…)=𝑄𝑇(β‹…)β‰₯0;𝑁<∞,(3.1)” denotes positive semidefiniteness. The reason of choosing a finite-time horizon N is then seen from a practical context viewpoint.

In the sequel, the various inverse matrices which take place are assumed to exist. Because of their structures, this hypothesis is not strong.

3.1. Centralized Control

The overall system comprises subsystems, which are interconnected, and the implementation design must deal with the interactions that exist. Centralized control techniques are used for optimization. In the centralized approach, contrarily to the decentralized case [19, 27, 40], a central coordinator exists to take into account the interactions; namely, the coordinator is supposed to be available to the local controller to supply information to each control station from the remaining ones.

The philosophy involved is to define transformed state and input variables, which allows the redefinition of the loss function including quadratic terms of the redefined state and input variables. The idea of using transformations of variables was first pointed out in [1] for the linear continuous case. Its major interest appears in the case of constrained input sequences because, despite this fact, a recursive Riccati expression can be found leading to the optimal feedback solution. Thus, three transformations of variables are made on the equivalent feed-forward linear system of (2.22) (subsequently (2.23) will be invoked) to redefine the state vector equation of standard type (see (B.2) in Appendix B); that is, β‰₯0 On the other hand, such transformations are applied to the loss function (3.1) to obtain a standard performance index on a given planning horizon 𝑓(π‘˜+1)=𝐴(3)(π‘˜)𝑓(π‘˜)+𝐢(3)(π‘˜)𝜈(3)(π‘˜);π‘˜=0,1,2,….(3.2)

The necessary manipulation on (2.22)–(3.1) to derive (3.2)-(3.3) is detailed in Appendix B.

Now, since (3.3) is a quadratic criterion of standard type (i.e., it involves quadratic terms in both the input and the state variables), a Riccati-matrix sequence may be found, which leads to the optimal solution. The main optimization results are now summarized.

Theorem 3.1. (optimization of the system equivalent inputs). Assume that Theorem 2.6 holds. Then, the optimal equivalent input sequence for the equivalent feed-forward linear system with respect to the loss function (3.1) is given by the following expression: 𝐽𝑁=12π‘ξ“π‘˜=0𝑓𝑇(π‘˜)𝑄(3)(π‘˜)𝑓(π‘˜)+𝜈(3)𝑇(π‘˜)𝑅(3)(π‘˜)𝜈(3)(π‘˜)ξ‚„;𝑁<∞.(3.3) where the optimal equivalent redefined input sequence is πœˆβˆ—π‘—π‘–(π‘˜)=πœˆβˆ—(3)𝑗𝑖(π‘˜+1)βˆ’ξ‚ƒπΆπ‘‡π‘–(π‘˜)𝑄(π‘˜+1)𝐢𝑖(π‘˜)ξ‚„βˆ’1𝐢𝑇𝑖(π‘˜)𝑄(π‘˜+1)𝐴π‘₯(π‘˜)βˆ’π‘’π‘‡π‘—π‘–π‘…(3)βˆ’1(π‘˜+1)π‘Š(2)𝑇(π‘˜+1)𝐴π‘₯(π‘˜);π‘–βˆˆπΌ;π‘˜=0,1,2,…,𝑁,(3.4) with 𝜈(3)βˆ—π‘—π‘–(π‘˜+1)=βˆ’π‘’π‘‡π‘—π‘–ξ‚ƒπ‘…(3)(π‘˜+1)+𝐢(3)𝑇(π‘˜+1)𝑃(π‘˜+2)𝐢(3)(π‘˜+1)ξ‚„βˆ’1×𝐢(3)𝑇(π‘˜+1)𝑃(π‘˜+2)𝐴(3)(π‘˜+1)𝐴π‘₯(π‘˜);π‘˜=0,1,2,…,𝑁(3.5) being the recursive Riccati matrices; that is, 𝑃(β‹…)

Outline of the Proof
It is obvious by direct substitution while dealing with the aforementioned optimization strategies [1, 29, 38] applied to the feed-forward linear equivalent system of (2.22).

Corollary 3.2 (generation of the optimal input sequences). Under the same hypotheses as in Theorem 3.1, consider the sequences of scalars 𝑃(π‘˜)=𝑄(3)(π‘˜)+𝐴(3)(π‘˜)𝑃(π‘˜+1)Γ—{πΌβˆ’πΆ(3)(π‘˜)𝑅(3)(π‘˜)+𝐢(3)(π‘˜)𝑃(π‘˜+1)𝐢(3)(π‘˜)ξ‚„βˆ’1𝐢(3)(π‘˜)𝑃(π‘˜+1)}𝐴(3)(π‘˜);𝑃(𝑁)=𝑄(3)(𝑁);π‘˜=0,1,…,𝑁.(3.6) with 𝑒𝑖(π‘˜)β‰‘βŽ§βŽͺβŽͺβŽͺβŽͺβŽͺβŽͺ⎨βŽͺβŽͺβŽͺβŽͺβŽͺβŽͺβŽ©ξ‚ƒπ‘π‘‡π‘—π‘–(𝑖)π‘₯𝑖(π‘˜)ξ‚„βˆ’1πœˆβˆ—π‘—π‘–(π‘˜);allπ‘—βˆˆπ‘π‘–,π‘–π‘“π‘—π‘–βˆˆπ‘βˆ’π‘–,ξ‚ƒπ‘π‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)ξ‚„βˆ’1πœˆβˆ—π‘—π‘–(π‘˜);allπ‘—βˆˆπ‘π‘–,π‘–π‘“π‘—π‘–βˆˆπ‘+𝑖,0;𝑖𝑓𝑏𝑇𝑗𝑖(𝑖)π‘₯𝑖(π‘˜)=0,π‘π‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)=0βˆ€π‘—βˆˆπ‘π‘–;π‘˜=0,1,2,…,𝑁(3.7) being the πœˆβˆ—π‘—π‘–(π‘˜),π‘˜=0,1,2,… component of the optimal equivalent inputs obtained from Theorem 3.1.
Then, the following propositions hold.
(a) The optimal input sequence with respect to the loss function (3.1) is 𝑗𝑖th if the interval π‘’βˆ—π‘–(π‘˜)=𝑒𝑖(π‘˜),π‘˜=0,1,2,… is allowed as input definition domain, (βˆ’βˆž,∞)βŠ‚β„. (b)allπ‘–βˆˆπΌβ€‰β€‰If|𝑒𝑖(π‘˜)|≀𝑀<∞, then the optimal input sequence with respect to the loss function (3.1) is allπ‘–βˆˆπΌ,  π‘’βˆ—π‘–=sat[𝑒𝑖(π‘˜)], where the saturation function is defined by allπ‘–βˆˆπΌ,π‘˜=0,1,2,…;

Proof. It follows from Theorem 3.1 and the fact that, from (3.7), propositions (a)-(b) in Corollary 3.2 imply that sat𝑔(π‘₯)ξ€»β‰‘βŽ§βŽͺ⎨βŽͺβŽ©π‘€sgn𝑔(π‘₯)ξ€»;𝑖𝑓|||𝑔(π‘₯)|||>𝑀,𝑔(π‘₯);π‘œπ‘‘β„Žπ‘’π‘Ÿπ‘€π‘–π‘ π‘’.(3.8) belongs to the boundary of its definition domain (and vice versa), and the fact that the optimal Hamiltonian function associated with the loss function (3.1) is a strictly convex function of the equivalent input sequence 𝑒(β‹…)(β‹…).

Remark 3.3. (3.1.1) Note that the equivalent and the true input sequences are obtained in a centralized way because of the coupling between the different 𝜈(β‹…)(β‹…) through the states of the remaining 𝑆𝑖
(3.1.2) Also, note that (2.7) can be rewritten as𝑆𝑗,𝑖,π‘—βˆˆπΌ. where π‘₯(π‘˜+1)=𝐴π‘₯(π‘˜)+𝑝𝑗=1𝐻𝑗(π‘˜)𝑒𝑗(π‘˜);π‘˜=0,1,2,…,(3.9) By applying optimization techniques to (3.9), one could obtain, at first, an optimal 𝐻𝑖(π‘˜)≑𝐡𝑖π‘₯(π‘˜);allπ‘–βˆˆπΌ;π‘˜=0,1,2,….(3.10)-sequence. But this would be nonsense because the Riccati-matrix sequence would be dependent on the future measurements of the state vector.
(3.1.3) Due to the structure of the 𝑒(β‹…)(β‹…)-vectors which are state-dependent (see (A.1)) and so unknown in advance, both the optimal equivalent and the true inputs cannot be determined through Riccati-matrix sequence from the results in Theorem 3.1 and Corollary 3.2.
Suboptimal schemes are now presented which are alternative to the optimal one of Theorem 3.1.

3.2. Centralized Suboptimal Schemes
3.2.1. Centralized Suboptimal Modified Scheme 1

It consists of the following variations on the optimal scheme from Theorem 3.1 and Corollary 3.2:

(i)to use finite (sufficient small for the problem coherence) time-sliding optimization horizons. Namely, the time horizon for optimization is one-step advanced when the input is determined at each step. Only the first input associated with each optimization horizon is in fact applied;(ii)to approximate the 𝑐(β‹…)-vectors by their values by using each current value at the first sample of each optimization horizon; that is, 𝐢(β‹…)(β‹…), 𝐢(𝑑)=𝐢(π‘˜)

3.2.2. Centralized Suboptimal Modified Scheme 2

It is based on the application of (2.23) and (A.2) in Appendix A, by implementing the optimization scheme while neglecting the dependence of allπ‘˜β‰€π‘‘β‰€π‘˜+𝑁;π‘˜=0,1,2,…. on the πœ‰(β‹…)-optimal equivalent input sequence. This leads to a suboptimal Riccati-matrix sequence, which does not depend on the state vector, and thus, implementable. The presence of the deterministic disturbance 𝜈(3)(β‹…)(β‹…) obliges to use an additional vector (denoted by πœ‰(β‹…)) in the optimization procedure in Theorem 3.1 in order to maintain a Riccati-type solution. The costate associated with the Hamiltonian of the loss function (3.1) verifies the so-called β€œmodified Riccati transformation” [26]; namely, 𝜌(β‹…) Thus, by taking into account that now πœ‚(𝑑)(costate)≑𝑃(𝑑)𝑓(𝑑)βˆ’πœŒ(𝑑). one finds out the following result instead of Theorem 3.1. See Appendix B for particular mathematical details.

Theorem 3.4. (optimization of the equivalent inputs for the system representation including a deterministic disturbance). Under the same assumptions as in Theorem 3.1, it follows that the optimal equivalent input sequence for the equivalent feed-forward linear representation involving a deterministic disturbance with respect to the loss function (3.1) is given by 𝑓(π‘˜)≑𝐴π‘₯(π‘˜βˆ’1)+πœ‰(π‘˜βˆ’1), where the optimal equivalent redefined input sequence is πœˆβˆ—π‘—π‘–(π‘˜)=𝜈(3)βˆ—π‘—π‘–(π‘˜+1)βˆ’ξ‚ƒπ‘’π‘‡π‘—π‘–π‘„(π‘˜+1)π‘’π‘—π‘–ξ‚„βˆ’1𝑒𝑇𝑗𝑖𝑄(π‘˜+1)βˆ’π‘’π‘‡π‘—π‘–π‘…(3)βˆ’1(π‘˜+1)π‘Š(2)𝑇(π‘˜+1)𝐴π‘₯(π‘˜)+πœ‰(π‘˜)ξ‚„;π‘Žπ‘™π‘™π‘–βˆˆπΌ;π‘˜=0,1,2,…,(3.11) The recursive Riccati-matrix sequence is now given by πœˆβˆ—(3)𝑗𝑖(π‘˜)=𝑒𝑇𝑗𝑖𝑅(3)(π‘˜)+𝐢(3)𝑇(π‘˜)𝑃(π‘˜+1)𝐢(3)(π‘˜)ξ‚„βˆ’1𝐢(3)𝑇(π‘˜)Γ—ξ‚ƒπœŒ(π‘˜+1)βˆ’π‘ƒ(π‘˜+1)𝐴(3)(π‘˜)𝐴π‘₯(π‘˜βˆ’1)+πœ‰(π‘˜βˆ’1)ξ‚„+πœ‰(π‘˜);π‘Žπ‘™π‘™π‘–βˆˆπΌ;π‘˜=1,2,3,….(3.12)

Proof. It follows immediately from applying the three transformations of variables of Appendix B to (2.23) and (A.2), and from the fact that 𝑃(π‘˜)=𝑄(3)(π‘˜)+𝐴(3)(π‘˜)Γ—{𝑃(π‘˜+1)βˆ’π‘ƒ(π‘˜+1)𝐢(3)(π‘˜)𝑅(3)(π‘˜)+𝐢(3)𝑇(π‘˜)𝑃(π‘˜+1)𝐢(3)(π‘˜)ξ‚„βˆ’1𝐢(3)(π‘˜)𝑃(π‘˜+1)}𝐴(3)(π‘˜);𝑃(𝑁)=𝑄(3)(𝑁);π‘˜=0,1,2,…,𝑁,(3.13)𝜌(π‘˜+1)={𝐼+𝑄(3)(π‘˜)βˆ’π‘ƒ(π‘˜)𝐴(3)βˆ’1(π‘˜)𝑅(3)βˆ’1(π‘˜)𝐢(3)(π‘˜)𝐢(3)𝑇(π‘˜)𝐴(3)βˆ’π‘‡(π‘˜)}βˆ’π΄(3)βˆ’π‘‡(π‘˜){𝜌(π‘˜)+𝑃(π‘˜)βˆ’π‘„(3)(π‘˜)𝐴(3)βˆ’1(π‘˜)πœ‰(π‘˜)};𝜌(𝑁)=0;π‘˜=0,1,2,…,𝑁.(3.14) or equivalently 𝑓(π‘˜)≑𝐴π‘₯(π‘˜βˆ’1)+πœ‰(π‘˜βˆ’1), and the fact that the 𝑓(π‘˜+1)=𝐴(3)(π‘˜)𝑓(π‘˜)+𝐢(3)(π‘˜)𝜈(3)(π‘˜)+πœ‰(π‘˜);π‘˜=0,1,2,…,(3.15) vectors are now 𝐢(3)(β‹…)

Remark 3.5. (3.2.1) Note that Corollary 3.2 also applies here to obtain the optimal system input sequence under constraints (see (3.7) and (3.8)).
(3.2.2) The 𝐢(3)(π‘˜)=𝐢(3)≑𝐴𝑒𝑗1+𝑙1βˆ’1,𝐴𝑒𝑗2+𝑙2βˆ’1,…,𝐴𝑒𝑗𝑝+π‘™π‘βˆ’1ξ‚„withπ‘˜=0,1,2,3,….(3.16)-vector (related to the deterministic disturbance in the modified Riccati transformation is dependent on the 𝜌(β‹…)-sequence. To solve such a dependence, the 𝜈(3)(β‹…)-vector of (3.14) can be decomposed into 𝜌(π‘˜+1) being 𝜌(π‘˜+1)=π‘š(π‘˜)𝜌(π‘˜)+𝑀(π‘˜)πœ‰(π‘˜);π‘˜=0,1,2,…,(3.17), 𝑀(β‹…) matrices defined by𝑀(β‹…)βˆˆπ‘…(𝑛+π‘Ž)π‘₯(𝑛+π‘Ž) Thus, the following result yields.

Corollary 3.6 (useful implementability result for obtaining the optimal equivalent input). If the same assumptions from Theorem 3.4 are fulfilled, then the optimal equivalent input for the system interpretation including a deterministic disturbance with respect to the loss function (3.1) can be rewritten as follows (see (3.11)): 𝑀(π‘˜)=𝐼+𝑄(3)(π‘˜)βˆ’π‘ƒ(π‘˜)𝐴(3)βˆ’1(π‘˜)𝑅(3)βˆ’1(π‘˜)𝐢(3)(π‘˜)𝐢(3)𝑇(π‘˜)𝐴(3)βˆ’π‘‡(π‘˜)ξ‚„βˆ’1;π‘˜=0,1,2,3,…,𝑀(π‘˜)=𝐼+𝑄(3)(π‘˜)βˆ’π‘ƒ(π‘˜)𝐴(3)βˆ’1(π‘˜)𝑅(3)βˆ’1(π‘˜)𝐢(3)(π‘˜)𝐢(3)𝑇(π‘˜)𝐴(3)βˆ’π‘‡(π‘˜)ξ‚„βˆ’1×𝐴(3)βˆ’π‘‡(π‘˜)𝑃(π‘˜)βˆ’π‘„(3)(π‘˜)𝐴(3)βˆ’1(π‘˜);π‘˜=0,1,2,3,….(3.18)where the matrix πœˆβˆ—π‘—π‘–(π‘˜)={1+𝑒𝑇𝑗𝑖{𝑅(3)(π‘˜+1)+𝐢(3)𝑇(π‘˜+1)𝑃(π‘˜+2)𝐢(3)(π‘˜+1)ξ‚„βˆ’1𝐢(3)𝑇(π‘˜+1)𝑃(π‘˜+2)𝐴(3)(π‘˜+1)+𝑅(3)βˆ’1(π‘˜+1)π‘Š(2)(π‘˜+1)𝐹(π‘˜+1)+𝑒𝑇𝑗𝑖𝑄(π‘˜+1)π‘’π‘—π‘–ξ‚„βˆ’1}𝑒𝑗𝑖𝑄(π‘˜+1)𝐹(π‘˜+1)}βˆ’1Γ—{𝑒𝑇𝑗𝑖𝑅(3)(π‘˜+1)+𝐢(3)𝑇(π‘˜+1)𝑃(π‘˜+2)𝐢(3)(π‘˜+1)ξ‚„βˆ’1𝐢(3)𝑇(π‘˜+1)×𝑀(π‘˜+1)βˆ’π‘ƒ(π‘˜+2)ξ‚„πœ‰(π‘˜+1)+𝑀(π‘˜+1)𝜌(π‘˜+1)βˆ’π‘ƒ(π‘˜+2)𝐴(3)(π‘˜+1)𝐴π‘₯(π‘˜)ξ‚‡βˆ’ξ‚ƒπ‘’π‘‡π‘—π‘–π‘„(π‘˜+1)π‘’π‘—π‘–ξ‚„βˆ’1𝑒𝑇𝑗𝑖𝑄(π‘˜+1)𝐴π‘₯(π‘˜)βˆ’π‘’π‘‡π‘—π‘–π‘…(3)βˆ’1(π‘˜+1)π‘Š(2)𝑇(π‘˜)𝐴π‘₯(π‘˜)};allπ‘–βˆˆπΌ;π‘˜=0,1,2,3,…,(3.19) is partitioned as 𝐹(β‹…)βˆˆπ‘…(𝑛+π‘Ž)×𝑝 So that each vector 𝐹(π‘˜)≑𝐹1(π‘˜),𝐹2(π‘˜),…,𝐹𝑝(π‘˜)ξ‚„;π‘˜=0,1,2,3,….(3.20)𝐹𝑖(π‘˜)βˆˆπ‘…π‘›+π‘Ž

Proof (outline). Equation (3.19) can be obtained after direct calculations if (3.17) and (3.18) are substituted into (3.11) while taking Remark 3.5 into account.

3.3. Considerations about Implementation

The following remarks must be pointed out.

(1)The system implementation scheme, which has been reported in the paper, has appeared to be suboptimal twice. First of all, the optimization scheme is dependent on the state/input vectors (some estimates must be made) according to the two alternative interpretations of the feed-forward linear system (2.13). This fact is due to the nature of the decomposition methods which have been applied and also due to the over determination problem one must deal with when the system inputs are generated from the associate equivalent ones. The performance degradation (i.e., the optimality losses being inherent to the applied suboptimization procedures) could be studied through direct calculations. However, the hypotheses that have been taken into account allow the system implementation.(2)In summary, the steps that the designer ought to follow in the implementation environment are as follows:(a)to set up a finite-time sliding optimization horizon, 𝐹𝑖(π‘˜)≑𝑝𝑖=1𝑛𝑖+π‘Žπ‘–ξ“π‘—=1,𝑗≠𝑖{π‘π‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖)π‘₯𝑖(π‘˜)𝛿𝑗𝑖+[π‘π‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)]𝛿𝑗𝑖}βˆ’1Γ—{π‘π‘‡π‘—βˆ’π‘›π‘–(𝑖)π‘₯𝑖(π‘˜)𝛿𝑗𝑖+[π‘π‘‡π‘—βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)]𝛿𝑗𝑖};allπ‘–βˆˆπΌ,π‘˜=0,1,2,….(3.21), [π‘˜,π‘˜+𝑁], integers,(b)to test the system controllability according to Theorem 2.6 and/or Corollary3.2,(c)to estimate the state vector (suboptimal modified scheme 1 of Section 3.2.1) and the equivalent deterministic disturbance (suboptimal modified scheme 2 of Section 3.2.2) on the optimization horizon 1≀𝑁<∞ To do that, the estimated system is chosen from (3.2) as [π‘˜,π‘˜+𝑁]. and the equivalent input estimates become Μ‚β€Œπ‘“π‘˜+1=𝐴(3)1,π‘˜+1Μ‚β€Œπ‘“π‘˜+𝐢(3)1,π‘˜πœˆ(3)π‘˜βˆ’1,Μ‚β€Œπ‘“π‘˜+𝑗=𝐴(3)1,π‘˜+π‘—Μ‚β€Œπ‘“π‘˜+π‘—βˆ’1+𝐢(3)π‘˜+π‘—βˆ’1𝜈(3)π‘˜+π‘—βˆ’1;π‘—βˆˆξ€Ί2,𝑁,(3.22)(d)to compute the Riccati matrix from (3.6) or (3.13), respectively, and the system matrices from (B.6) and (B.8) of Appendix B, or (B.8) and (3.16), respectively.(e)to generate the suboptimal input sequence Μ‚πœˆπ‘˜(3)ξ€Ίπ‘˜ξ€»=Μ‚πœˆπ‘˜βˆ’1(3)ξ€Ίπ‘˜βˆ’1ξ€»=𝜈(3)π‘˜βˆ’1,(3.23)Μ‚πœˆπ‘˜+𝑗(3)ξ€Ίπ‘˜ξ€»=Μ‚πœˆπ‘˜+𝑗(3)ξ€Ίπ‘˜βˆ’1ξ€»;allintegerπ‘˜β‰ π‘—,π‘—βˆˆξ€Ί1,𝑁,(3.24)Μ‚πœˆπ‘˜+𝑗(3)ξ€Ίπ‘˜ξ€»=0;ifπ‘˜=𝑗,(3.25) from (3.4) or (3.11), respectively.(f)to obtain the suboptimal input sequence πœˆβˆ—π‘—π‘–(π‘˜),alli∈𝐼,π‘˜=0,1,2,… by applying Corollary 3.2. One must have only input controls as inputs to the system; namely, p inputs.

Remark 3.7. (3.3.1) Note that in (3.22)–(3.25), the following system representation is assumed to be π‘’βˆ—π‘–(π‘˜),alli∈𝐼,π‘˜=0,1,2,3,…,𝑁, instead of (3.2), and the subscript β€œπ‘“π‘˜+1=𝐴(3)π‘˜π‘“π‘˜+𝐢(3)π‘˜πœˆ(3)π‘˜;π‘˜=0,1,2,…(3.26)” in the matrices is related to the suboptimal modified scheme 1. Similar expressions can be derived for the alternative interpretation scheme (3.15) of Section 3.2.2.
(3.3.2) Also, note that such β€œfictitious” states must be estimated in order to later reupdate the input sequence. In fact, only the 1 equivalent input is applied in each optimization horizon.
(3.3.3) The usual asymptotic stability tests via Lyapunov's theorem (which is applied to linear time-invariant optimal systems with respect to quadratic criteria) are not applicable here because the system under study is neither invariant nor linear. Besides, standard stability proofs for optimal regulators cannot be applied to the optimal scheme, which has been obtained because of its nonimplementability. The suboptimal modified schemes can lead to instability (or at least stability can not be proved) when 𝜈(3)(π‘˜βˆ’1) . But this fact does not affect the system stability in the current context of this paper since a finite-time sliding for the optimization horizon is used.
In particular, these circumstances make the implementation of a decentralized design quite difficult because some links of the equivalent feed-forward system (2.13) are cut off. However, stability can be ensured if π‘β†’βˆž is a Hurwitz's matrix and 𝐴 is a bounded sequence, if a saturation type rule is applied when computing the optimal redefined equivalent inputs 𝑒(β‹…) in the various schemes.

4. Numerical Simulation

4.1. Example 1

This first example implemented deals with the following ninth-order discrete model: 𝜈(3)(β‹…)(β‹…)𝑆1∢π‘₯1(π‘˜+1)≑𝐴1(𝑇)+𝑒1(π‘˜)𝐡1(𝑇)ξ‚„π‘₯1(π‘˜)+𝐷1(𝑇)𝑧1(π‘˜);𝑒1βˆˆπ‘…,𝑦1(π‘˜)≑π‘₯1(π‘˜);π‘₯1(β‹…)βˆˆπ‘…2π‘₯1,𝐻1βˆΆπ‘§1(π‘˜+1)≑𝑀1(𝑇)+𝑒1(π‘˜)𝑃1(𝑇)𝑧1(π‘˜)+𝐿11(𝑇)π‘₯1(π‘˜);𝑧1(β‹…)βˆˆπ‘…2π‘₯1,𝐻2βˆΆπ‘§2(π‘˜+1)≑𝑀2(𝑇)𝑧2(π‘˜)+𝐿21(𝑇)π‘₯1(π‘˜);𝑧2(β‹…)βˆˆπ‘…3π‘₯1,𝐻3βˆΆπ‘§3(π‘˜+1)≑𝑀3(𝑇)𝑧3(π‘˜)+𝐿31(𝑇)π‘₯1(π‘˜);𝑧3(β‹…)βˆˆπ‘…2π‘₯1,(4.1) Table 1 displays the entries of the constant discrete matrices related to the composite homogeneous bilinear structures allπ‘˜β‰₯0,π‘˜βˆˆπ‘§., 𝑆1, 𝐻𝑖, in (4.1). The simulations have been performed with the centralized suboptimal modified scheme 1, including free control type (namely, without saturation) only according to Corollary 3.2.

The steady state of the signal trajectory under no control leads to the following parameters: 𝐿21β‰‘βŽ‘βŽ’βŽ£00(βˆ’6.9022+44.5212𝑇)𝑇0⎀βŽ₯⎦, π‘¦βˆ—1,st=21.8,β€‰π‘¦βˆ—2,st=238.63,  π‘’βˆ—st=400 samples. The system performance is studied as a function of the optimization horizon, π‘βˆ—st=57 Also, the plant settings are: Initial conditions, 𝑁., π‘₯1(0)=[20.0,250.0]𝑇, 𝑧1(0)=[0.0,0.0]𝑇, and 𝑧2(0)=[0.0,0.0,0.0]𝑇; sampling period 𝑧3(0)=[0.0,0.0]𝑇 seconds and working horizon 𝑇=0.03 samples.

The obtained results are shown in Table 2 where the deviations (percent) are related to the normal operation signals (i.e., WH=150, π‘¦βˆ—1,st and π‘¦βˆ—2,st) and the CCT is the average time the computer needs to perform a computation iteration of the optimization horizon.

Exhaustive simulations about variations of the optimization horizon have shown that the discrete control improves the system response from a better transient characteristic point of view. However, there exist deviations of the steady-state signal tracking related to the no-control operation mode signals. Also, the experiments become time-consuming and memory-storage expensive as the optimization horizon does increase. For instance, some of the simulated examples have been plotted in Figure 2, consisting of the two components of the plant output and the waveform of the control input to the system (see Table 2 for the definitions of the involved signals related to the optimization horizon sizes).

4.2. Example 2

A discrete bilinear system which corresponds to the discretization of a bilinear continuous one for small sampling period and zero-order hold (ZOH) is given as defined in Section 2.2, by βˆ’ where π‘₯(π‘˜+1)=𝐴(𝑇)+𝑒(π‘˜)𝑁(𝑇)ξ‚„π‘₯(π‘˜)+𝐸(𝑇)𝑒(π‘˜)+𝑀(π‘˜),𝑦(π‘˜)=𝐢π‘₯(π‘˜),(4.2), π‘₯(π‘˜)≑[π‘₯𝑇1(π‘˜),π‘₯𝑇2(π‘˜),π‘₯𝑇3(π‘˜)]𝑇 and there exists a deterministic disturbance 𝑦(π‘˜)≑[𝑦𝑇1(π‘˜),0,0]𝑇 so that 𝑀(π‘˜)≑[𝑀𝑇1(π‘˜),0,0]𝑇 being 𝑀1(π‘˜)≑𝑆1(𝑇)𝑒2(π‘˜)+[𝑆2(𝑇)+𝑒(π‘˜)𝑆3(π‘˜)]π‘ž(π‘˜), the sampling period, 𝑇 denotes transpose and the sampling instant is defined as (β‹…)𝑇

In (4.2), the coefficients of the block matrices are composed of the following structures: π‘‘π‘˜+1=π‘‘π‘˜+𝑇. In particular, (4.2) and (4.3) correspond to the linearized discretization of an electromechanical system consisting of two amplidynes acting on the DC-motor shown in Figure 3, with a regulation mechanism incorporated as introduced in Section 3. This regulation scheme is not restricted to considering either the relevant control current or voltage to be constant, which emphasizes the interest of the bilinear modelling.

Figure 4 plots a typical tracking response output of the system as a function of the sampling period, for a fixed optimization horizon 𝐴(𝑇)=⎑⎒⎒⎒⎣𝐴1(𝑇)0𝑀1(𝑇)𝑃2(𝑇)𝐴2(𝑇)𝑅2(𝑇)𝑃3(𝑇)𝑅3(𝑇)𝐴3(𝑇)⎀βŽ₯βŽ₯βŽ₯⎦;𝑁(𝑇)=βŽ‘βŽ’βŽ’βŽ’βŽ£π‘1(𝑇)0𝑆2(𝑇)000000⎀βŽ₯βŽ₯βŽ₯⎦;𝐸(𝑇)=⎑⎒⎒⎒⎣𝐸1(𝑇)𝐸2(𝑇)𝐸3(𝑇)⎀βŽ₯βŽ₯βŽ₯⎦;𝐢=𝐼00ξ€».(4.3) samples and 𝑁=50 being unity in the quadratic regulation performance criterion (see (3.1)).

The control action implemented can be used as an alternative to the traditional strategies of linear control on electrical DC-machines. In general, it has been noticed that the discrete control implies better transients related to the linear approach because those transient responses are faster, that is, present lower settling times with greater shooting parameters. Besides, it has also been observed that the discrete control performance improves, in general, as the optimization horizon is increased for different experiments, with respect to that of the nominal plant when no control action is reached.

5. Conclusions

A multivariable invariant discrete-time bilinear system being composed of interconnected subsystems has been studied. An equivalent feed-forward linear system with equivalent inputs, which are derived from products state-input, has been given. Then, the system has been suboptimized with respect to a quadratic finite-time optimization horizon in order to drive each subsystem from any arbitrary initial point to a predefined final state.

The suboptimization has been made by neglecting either the time-dependence of the control vectors on the state vector (modified suboptimal scheme 1) or the dependence of a deterministic disturbance vector on the equivalent input sequence (modified suboptimal scheme 2). These approaches have effect only on the system implementability rather than on its stability. Besides, stability proofs lead to drawbacks when the optimization horizon is infinite because of the suboptimal real implementation. This is also translated into drawbacks when implementing decentralized expected results. The proposed suboptimal schemes have been proven by means of realistic examples.

Appendices

A. Vector Variation for the Two Alternative Interpretation Schemes of the Feed-Forward Linear Systems

Section 2.4 deals with two alternative representations of the feed-forward linear system (2.13). This appendix is devoted to the derivation of the vectors 𝑄 and 𝐢(β‹…)(β‹…) involved in this section.

Substituting (2.18) into (2.13) and grouping terms, one obtains

πœ‰(β‹…) or alternatively 𝑐𝑗+π‘™π‘–βˆ’1(π‘˜)≑𝑒𝑗𝑖+π‘™π‘–βˆ’1+𝑛𝑖+π‘Žπ‘–βˆ‘π‘—=1/𝑗≠𝑗𝑖{{𝑏𝑇𝑗𝑖(𝑖)π‘₯𝑖(π‘˜)𝛿𝑗𝑖+[π‘π‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+π‘βˆ‘π‘™=1π‘šπ‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)]𝛿𝑗𝑖}βˆ’1Γ—{𝑏𝑇𝑗(𝑖)π‘₯𝑖(π‘˜)𝛿𝑗𝑖+[π‘π‘‡π‘—βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+π‘βˆ‘π‘™=1π‘šπ‘‡π‘—βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)]𝛿𝑗𝑖}𝑒𝑗+π‘™π‘–βˆ’1},(A.1) where πœ‰(π‘˜)β‰‘βŽ§βŽͺβŽͺβŽͺβŽͺ⎨βŽͺβŽͺβŽͺβŽͺβŽ©π‘ξ“π‘–=1𝑛𝑖+π‘Žπ‘–ξ“π‘—=1/𝑗≠𝑗𝑖{𝑏𝑇𝑗𝑖(𝑖)π‘₯𝑖(π‘˜)𝛿𝑗𝑖+[π‘π‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—π‘–βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)]𝛿𝑗𝑖}βˆ’1Γ—{𝑏𝑇𝑗(𝑖)π‘₯𝑖(π‘˜)𝛿𝑗𝑖+[π‘π‘‡π‘—βˆ’π‘›π‘–(𝑖)𝑧𝑖(π‘˜)+𝑝𝑙=1π‘šπ‘‡π‘—βˆ’π‘›π‘–(𝑖,𝑙)𝑦1(π‘˜)]𝛿𝑗𝑖}πœˆπ‘—π‘–(π‘˜)𝑒𝑗+π‘™π‘–βˆ’1};ifatleastanadmissibleπ‘—π‘–βˆˆπ‘π‘–existsatthecurrentsamplinginstant,0;otherwise,(A.2)

B. Derivation of the Optimization Equations

In the sequel, in order not to repeat tedious notation, the 𝛿𝑗𝑖≑1βˆ’π›Ώπ‘—π‘–,with𝛿𝑗𝑖=1,ifπ‘—π‘–βˆˆπ‘π‘–,and𝛿𝑗𝑖=0,ifπ‘—π‘–βˆˆπ‘+𝑖,allπ‘–βˆˆπΌ,π‘˜=0,1,2,….-index (related to the system inputs and associate equations of the feed-forward linear system 2.13)) will be denoted by the subscript 𝑗𝑖 Also, three modified auxiliary inputs 𝑖,π‘–βˆˆπΌ., 𝜈(1)(π‘˜) and 𝜈(2)(π‘˜), are calculated from 𝜈(3)(π‘˜) and introduced due to the bilinear terms.

B.1. First Transformation

Let us define new variables as 𝜈(π‘˜), with 𝐢(1)𝑗(π‘˜)≑𝐴𝐢𝑗(π‘˜βˆ’1);𝜈(1)𝑗(π‘˜)β‰‘πœˆπ‘—(π‘˜βˆ’1),𝑓(π‘˜)≑𝐴π‘₯(π‘˜βˆ’1)=π‘₯(π‘˜)βˆ’π‘ξ“π‘–=1𝐢𝑖(π‘˜βˆ’1)πœˆπ‘–(π‘˜βˆ’1)(B.1) and π‘—βˆˆπΌ.

Substituting (B.1) into (2.22), one obtains π‘˜β‰₯1 Also, the loss function (3.1) can be equivalently rewritten as 𝑓(π‘˜+1)=𝐴𝑓(π‘˜)+𝑝𝑖=1𝐢(1)𝑖(π‘˜)𝜈(1)𝑖(π‘˜);π‘˜=0,1,2,3,….(B.2) where 𝐽𝑁=12{π‘ξ“π‘˜=0𝑓𝑇(π‘˜)𝑄(π‘˜)βˆ’π‘ξ“π‘–=1π‘Ÿβˆ’1𝑖(π‘˜)β„Žπ‘–(π‘˜)β„Žπ‘‡π‘–(π‘˜)𝑓(π‘˜)+𝑝𝑖=1{π‘Ÿπ‘–(π‘˜)ξ‚ƒπœˆ(1)𝑖(π‘˜)+π‘Ÿβˆ’1𝑖(π‘˜)β„Žπ‘‡π‘–(π‘˜)𝑓(π‘˜)ξ‚„2+2𝑝𝑗=1/𝑗>𝑖𝐢(1)𝑇𝑖(π‘˜)π΄βˆ’π‘‡β„Žπ‘—(π‘˜)𝜈(1)𝑖(π‘˜)𝜈(1)𝑗(π‘˜)}};𝑁<∞,(B.3)π‘Ÿπ‘—(π‘˜)≑𝐢(1)𝑇𝑗(π‘˜)π΄βˆ’π‘‡π‘„(π‘˜)π΄βˆ’1𝐢(1)𝑗(π‘˜)=𝐢𝑇𝑗(π‘˜βˆ’1)𝑄(π‘˜)𝐢𝑗(π‘˜βˆ’1),β„Žπ‘—(π‘˜)≑𝑄(π‘˜)π΄βˆ’1𝐢(1)𝑗(π‘˜)=𝑄(π‘˜)𝐢𝑗(π‘˜βˆ’1),(B.4).

B.2. Second Transformation

The following new variables are defined as follows: allπ‘—βˆˆπΌ;π‘˜=1,2,3,…𝐢(2)𝑗(π‘˜)≑𝐢(1)𝑗(π‘˜),𝜈(2)𝑗(π‘˜)β‰‘πœˆ(1)𝑗(π‘˜)+π‘Ÿβˆ’1𝑗(π‘˜)β„Žπ‘‡π‘—(π‘˜)𝑓(π‘˜)=πœˆπ‘—(π‘˜βˆ’1)+π‘₯𝑇(π‘˜βˆ’1)𝐴𝑇𝑄(π‘˜)𝐢𝑗(π‘˜βˆ’1)𝐢𝑇𝑗(π‘˜βˆ’1)𝑄(π‘˜)𝐢𝑗(π‘˜βˆ’1)ξ‚„βˆ’1,𝐴(2)(π‘˜)β‰‘π΄βˆ’π‘ξ“π‘–=1π‘Ÿβˆ’1𝑖(π‘˜)𝐢(1)𝑖(π‘˜)β„Žπ‘‡π‘–(π‘˜)=π΄βˆ’π‘ξ“π‘–=1𝐢𝑇𝑖(π‘˜βˆ’1)𝑄(π‘˜)𝐢𝑖(π‘˜βˆ’1)ξ‚„βˆ’1𝐴𝐢𝑖(π‘˜βˆ’1)𝐢𝑇𝑖(π‘˜βˆ’1)𝑄(π‘˜),(B.5).

From (B.5), one has equivalently to (B.3);allπ‘—βˆˆπΌ;π‘˜=1,2,3,… where 𝑓(π‘˜+1)=𝐴(2)(π‘˜)𝑓(π‘˜)+𝐢(2)(π‘˜)𝜈(2)(π‘˜);π‘˜=0,1,2,3,…,(B.6) Also, taking into account the mentioned (B.5), (B.3) becomes as follows: 𝐢(2)(β‹…)≑[𝐢(2)1(β‹…),𝐢(2)2(β‹…),…,𝐢(2)𝑝(β‹…)]βˆˆπ‘…(𝑛+π‘Ž)π‘₯(𝑛+π‘Ž),𝜈(2)(β‹…)≑[𝜈(2)1(β‹…),𝜈(2)2(β‹…),…,𝜈(2)𝑝(β‹…)]βˆˆπ‘…π‘.(B.7) with 𝐽𝑁=12π‘βˆ‘π‘˜=0{𝑓𝑇(π‘˜)𝑄(2)(π‘˜)𝑓(π‘˜)+π‘βˆ‘π‘–=1ξ‚ƒπ‘Ÿπ‘–(π‘˜)𝜈(2)2𝑖(π‘˜)+2π‘βˆ‘π‘—=1/𝑗>𝑖𝑝𝑖𝑗(π‘˜)𝜈2𝑖(π‘˜)𝜈2𝑗(π‘˜)ξ‚„βˆ’2π‘βˆ‘π‘–=1π‘βˆ‘π‘—=1/𝑗>𝑖𝑓𝑇(π‘˜)𝑝𝑖𝑗(π‘˜)𝑑𝑇𝑗(π‘˜)𝜈2𝑖(π‘˜)+𝑑𝑇𝑖(π‘˜)𝜈2𝑗(π‘˜)ξ‚„};𝑁<∞(B.8)𝑝𝑖𝑗(π‘˜)=𝑝𝑗𝑖(π‘˜)≑𝐢(2)π‘‡π‘–π΄βˆ’π‘‡β„Žπ‘—(π‘˜)=𝐢𝑇𝑖(π‘˜βˆ’1)𝑄(π‘˜)𝐢𝑗(π‘˜βˆ’1);𝑗≠𝑖,𝑑𝑖(π‘˜)β‰‘π‘Ÿβˆ’1𝑖(π‘˜)β„Žπ‘‡π‘–(π‘˜)=𝐢𝑇𝑖(π‘˜βˆ’1)𝑄(π‘˜)𝐢𝑖(π‘˜βˆ’1)ξ‚„βˆ’1𝐢𝑇𝑖(π‘˜βˆ’1)𝑄(π‘˜),(B.9)

Equation (B.8) can be compactly rewritten as forall𝑖,π‘—βˆˆπΌ;π‘˜=0,1,2,…. where the matrices involved are 𝐽𝑛=12π‘ξ“π‘˜=0𝑓𝑇(π‘˜)𝑄(2)(π‘˜)𝑓(π‘˜)+𝜈(2)𝑇(π‘˜)𝑅(2)(π‘˜)𝜈(2)(π‘˜)+2𝑓𝑇(π‘˜)π‘Š(2)(π‘˜)𝜈(2)(π‘˜)ξ‚„;𝑁<∞,(B.10) with 𝑄(2)(π‘˜)≑𝑄(π‘˜)βˆ’π‘ξ“π‘–=1{𝐢𝑇𝑖(π‘˜βˆ’1)𝑄(π‘˜)𝐢𝑖(π‘˜βˆ’1)ξ‚„βˆ’1𝑄(π‘˜)𝐢𝑖(π‘˜βˆ’1)𝐢𝑇𝑖(π‘˜βˆ’1)𝑄(π‘˜)βˆ’2𝑝𝑗=1/𝑗>𝑖{𝐢𝑇𝑗(π‘˜βˆ’1)𝑄(π‘˜)𝐢𝑖(π‘˜βˆ’1)ξ‚„βˆ’1𝐢𝑇𝑗(π‘˜βˆ’1)𝑄(π‘˜)𝐢𝑗(π‘˜βˆ’1)ξ‚„βˆ’1×𝐢𝑖(π‘˜βˆ’1)𝑄(π‘˜)𝐢𝑗(π‘˜βˆ’1)𝑄(π‘˜)𝐢𝑖(π‘˜βˆ’1)𝐢𝑇𝑗(π‘˜βˆ’1)𝑄(π‘˜)}}(B.11)𝑅(2)(π‘˜)=𝑅(2)𝑖𝑗(π‘˜)ξ€»;𝑅(2)𝑖𝑗(π‘˜)=𝑅(2)𝑗𝑖(π‘˜)β‰‘βŽ§βŽͺ⎨βŽͺβŽ©π‘Ÿπ‘–(π‘˜),if𝑗=𝑖,𝑝𝑖𝑗(π‘˜),if𝑗≠𝑖,(B.12)π‘Š(2)(π‘˜)β‰‘π‘Š1(π‘˜)π‘Š2(π‘˜)(B.13)π‘Š1(π‘˜)≑𝑑𝑇1(π‘˜),𝑑𝑇2(π‘˜),…,𝑑𝑇𝑝(π‘˜)ξ‚„,π‘Š2(π‘˜)β‰‘ξ‚ƒπ‘Šπ‘–π‘—(π‘˜)ξ‚„;π‘Šπ‘–π‘—(π‘˜)=π‘Šπ‘—π‘–(π‘˜)β‰‘βŽ§βŽͺ⎨βŽͺ⎩0if𝑗=π‘–βˆ’π‘π‘–π‘—(π‘˜),if𝑗≠𝑖(B.14), and π‘Ÿπ‘–(β‹…),𝑝𝑖𝑗(β‹…), in (B.12) to (B.14); 𝑑𝑖(β‹…);𝑗≠𝑖, are given, respectively, in (B.4) and (B.9).

Note that the factorization of matrix 𝑖,π‘—βˆˆπΌin (B.13) is possible from (B.8).

B.3. Third Transformation

Although it is not necessary, this transformation of variables provides immediately a loss function (redefinition of (3.1)), which includes weighting terms associated with both the state vector and the transformed input. Let us define them as follows: π‘Š(2)(β‹…) Substitution of (B.15) and (B.16) into (B.6) yields directly (3.2). Also, substituting (B.15) into (B.10), one obtains (3.3) if the weighting matrices are defined by 𝐢(3)(π‘˜)≑𝐢(2)(π‘˜),𝜈(3)(π‘˜)β‰‘πœˆ(2)(π‘˜)+𝑅(2)βˆ’1(π‘˜)π‘Š(2)𝑇(π‘˜)𝑓(π‘˜),(B.15)𝐴(3)(π‘˜)≑𝐴(2)(π‘˜)βˆ’πΆ(2)(π‘˜)𝑅(2)βˆ’1(π‘˜)π‘Š(2)𝑇(π‘˜)βˆ€π‘˜=0,1,2,3,….(B.16) with 𝑄(3)(π‘˜)≑𝑄(2)(π‘˜)βˆ’π‘Š(2)(π‘˜)𝑅(2)βˆ’1(π‘˜)π‘Š(2)𝑇(π‘˜),𝑅(3)(π‘˜)≑𝑅(2)(π‘˜)(B.17) and 𝑄(3)(β‹…)β‰₯0

Acknowledgments

The authors are very grateful to the Spanish MEC for supporting this work through the research Projects DPI2006-00714 and DPI2006-01677. They are also grateful to the UPV/EHU and the Basque Government MEC for its partial support through Projects EHU 06/88 and S-PE06UN10, respectively. They would like also to thank the comments and suggestions of the anonymous reviewers who have helped to improve the previous version of this manuscript.