Abstract

This paper considers matrix inequality procedures to address the robust fault detection and isolation (FDI) problem for linear time-invariant systems subject to disturbances, faults, and polytopic or norm-bounded uncertainties. We propose a design procedure for an FDI filter that aims to minimize a weighted combination of the sensitivity of the residual signal to disturbances and modeling errors, and the deviation of the faults to residual dynamics from a fault to residual reference model, using the -norm as a measure. A key step in our procedure is the design of an optimal fault reference model. We show that the optimal design requires the solution of a quadratic matrix inequality (QMI) optimization problem. Since the solution of the optimal problem is intractable, we propose a linearization technique to derive a numerically tractable suboptimal design procedure that requires the solution of a linear matrix inequality (LMI) optimization. A jet engine example is employed to demonstrate the effectiveness of the proposed approach.

1. Introduction

In the past decade, great attention has been devoted to the design of model-based fault detection systems and their robustness [1, 2]. With the rapid development of robust control theory and optimization control techniques, more and more methods have been presented to solve the robust FDI problem. The -filter is designed such that the -norm of the estimation error is minimized (see [35] and the references therein). Some of the approaches used for this problem include frequency domain approaches [6], left and right eigenvector assignment [7], structure parity equation [8], and an unknown input observer with disturbances decoupled in the state estimation error [9]. Recently developed LMI approaches offer numerically attractive techniques [1012].

A reference residual model is an ideal solution for robust FDI under the assumption that there are no disturbances or model uncertainty. The idea is to design a filter for the uncertain system that approximates the solution given by the reference model [13]. In [14], a new performance index is proposed using such a reference residual model. Frisk and Nielsen [15] give an algorithm to design a reference model and a robust FDI filter that fits into the framework of standard robust -filtering relying on established and efficient methods. However, their framework consists in solving two optimization problems successively, which results in a suboptimal solution.

In this paper, we propose a performance index that captures the requirements of fault isolation and disturbance rejection as well as the design of the optimal reference model. The fault isolation performance is measured by the size of the deviation of the fault to residual dynamics from the reference dynamics model, while the disturbance rejection performance is measured by the size of the input to residual and disturbance to residual dynamics. In all cases, the norm is used as a measure. The design of the optimal reference model is incorporated in the robust FDI framework. We consider systems subject to norm-bounded or polytopic uncertainties. For systems described by polytopic and unstructured norm-bounded uncertainties, we derive an optimal FDI filter obtained as the solution of a QMI optimization. For systems described by structured uncertainties, we derive a suboptimal QMI-based solution. Since the solution of a QMI optimization is, in general, intractable, we propose a linearization technique to derive a suboptimal design procedure that requires the solution of a numerically tractable LMI optimization. This note extends our work in [16] by proposing algorithms for the design of suboptimal reference dynamics.

The structure of the work is as follows. After defining the notation, we review filter-based FDI techniques for residual signal generation and give the problem formulation in Section 2. Section 3 presents a matrix inequality formulation for the FDI problem, and gives the solution and the design of the optimal reference model for both norm-bounded and polytopic uncertainties in a form of QMI's. Section 4 gives a suboptimal solution in both cases through the use of an algorithm that necessitates solving LMI's. Finally, a numerical example is presented in Section 5, and Section 6 summarizes our results.

The notation we use is fairly standard. The set of real matrices is denoted by . For , we use the notation to denote transpose. For , () denotes that is positive (negative) definite, that is, all the eigenvalues of are greater (less) than zero. The identity matrix is denoted as and the null matrix is denoted as with the subscripts occasionally dropped if they can be inferred from context.

Let be the set of square integrable functions. The -norm of is defined as . A transfer matrix will be denoted as or and dependence on the variable will be suppressed. For a stable transfer matrix , we define In section 3, we use the following result.

Lemma 1 (see [17]). Let with sign controllable, , and . Then has a spectral factor (i.e., ) if and only if there exists symmetric that satisfies the following linear matrix inequality:

2. Problem Formulation

Consider a linear time-invariant dynamic system subject to disturbances, modeling errors and process, sensor and actuator faults modeled as where , , and are the process state and input and output vectors, respectively, and where and are the disturbance and fault vectors, respectively. Here, and are the component and instrument fault distribution matrices, respectively, while and are the corresponding disturbance distribution matrices [18].

We consider two types of uncertainties: norm-bounded and polytopic uncertainties. In the case of norm-bounded uncertainties, where represents the nominal model, , where and where , , , , , , and are known and constant matrices with appropriate dimensions. This linear fractional representation of uncertainty, which is assumed to be well-posed over (i.e., for all ), has great generality and is widely used in control theories.

In the case of polytopic uncertainties, where , are known constant matrices with appropriate dimensions.

A residual signal in an FDI system should represent the inconsistency between the system variables and the mathematical model. The objective is to design an FDI filter of the form where is the filter state and is the residual signal. Figure 1 illustrates this filter in the robust residual generation scheme.

By defining an augmented state the residual dynamics are given by or where , , and denote the dynamics from faults, disturbances, and inputs to the residual, respectively. Note that dependence on the uncertain data is indicated by a superscript .

Ideally, the residual signal is required to be sensitive only to faults. This corresponds to , , and . For fault isolation, it is further required that the fault signature can be deduced from the residual. This corresponds to , where is a set of transfer matrices with a special structure that depends on the nature of the faults, for example, if the faults can occur simultaneously, is the set of stable diagonal transfer matrices with nonzero diagonal entries. Unfortunately, characterizing a general structured set is intractable, and we will assume that we can define a subset such that subsequent optimizations over the structured state-space data sets , , , and are tractable. For example, if denotes the set of all stable diagonal transfer matrices with nonzero diagonal entries, we may define as the set of all diagonal matrices with negative entries, and , , and as the sets of all diagonal matrices. An example of this simplification procedure is given in Section 5 below. We also replace the requirement of nonzero diagonal entries for by a condition of the form for all .

Due to the presence of disturbances and modelling uncertainties, exact FDI is not possible. For robust FDI, we propose the following, more realistic, problem formulation.

Problem 1. Assume that the system dynamics in (4) are quadratically stable. For any , find a stable fault reference dynamics such that (to ensure the requirement of nonzero diagonal entries for ), and find a stable filter of the form given in (8), if it exists, such that the residual dynamics in (10) are quadratically stable and where for norm-bounded uncertainties and for polytopic uncertainties.

Recall that quadratic stability for the dynamics in (4) is equivalent to the existence of such that for all (see [19] for more details).

A modified version of Problem 1 uses a weighted cost function, say, of the form where , , and are constant or frequency-dependent weighting functions that can vary the emphasis between fault detection (small and ) and fault isolation (small ). In the sequel, we assume that any such weighting functions are absorbed in the system data.

Remark 1. The objective is to find the smallest for which (12) is satisfied. Indeed, by minimizing , we will ensure (which measures the disturbance rejection level), (which measures the effect of uncertainty), and (which measures the deviation of the fault to residual dynamics from the reference dynamics, and hence is a measure of fault isolation capability) to be small.

A poorly chosen reference model can result in a residual generator with poor robustness. Here, we incorporate its design into our scheme so as to improve the robustness of the FDI filter.

In some approaches, a common assumption is that and/or has full rank [20, 21]; in others, the assumption has full rank and is widely used [22, 23]. Here, we do not impose any of these assumptions. Note, however, that if does not have full column rank, for example, if , then this will have an adverse effect on the minimum values of since This would therefore limit the overall performance of the filter, which is measured by the value of .

3. Matrix Inequality Formulation

In this section, we derive a matrix inequality formulation of Problem 1. The main idea is to express (12) in terms of QMIs using the bounded real lemma and change of variables techniques, and then to derive necessary and sufficient conditions for solvability.

The dynamics in (12) can be written as It follows from the bounded real lemma that there exists a stable filter of the form given in (8) such that (12) is satisfied if and only if there exists such that and for all (see [24, Theorem 3]). We deal separately with norm-bounded and polytopic uncertainties.

3.1. Solution with Norm-Bounded Uncertainties

For norm-bounded uncertainties, we separate the terms involving modeling uncertainties from the other terms as where Using this separation, the inequality (15) can be rewritten as A calculation shows that , where The next result uses the fact that to remove explicit dependence on .

Lemma 2 (see [25]). Let be as described in (6) and define the subspaces Let , , , and be matrices with appropriate dimensions. We have and for every if there exist and such that and If is unstructured (i.e., if? ), then (22) becomes for some scalar . In this case, condition (23) is both necessary and sufficient.

When the uncertainty set is unstructured, then Lemma 2 implies that for some . Using a Schur complement argument shows that (19) is equivalent to where denotes terms readily inferred from symmetry. Next, we introduce a change of variables to linearize the above matrix inequality [26]. Assume that , that is, the filter order is equal to the order of the system plus the order of the reference model. Let where are symmetric matrices, and . Define . Then if and only if . From , we have the following calculations, where boldface letters are used to indicate optimization variables: where we have defined If and are invertible, the variables , , , , , can be replaced by the new variables , , , , , without loss of generality. We can now rewrite (19) as which is nonlinear in the variables. Note that the nonlinearities involve the terms and only. The constraint can be expressed as an LMI as follows: The constraint can be expressed as a quadratic matrix inequality using the next lemma.

Lemma 3. Let be as defined above. Then if and only if there exists such that

Proof. Let . Then if and only if . It is easy to show that can be written as follows: The result then follows from Lemma 1.

The next lemma summarizes our result so far by giving necessary and sufficient conditions for the solution of Problem 1, in the case that the uncertainty set is unstructured, in the form of a QMI feasibility problem.

Lemma 4. Assume that is unstructured. Then there exist a stable filter of the form given in (8) and a stable fault reference dynamics model , where is defined in (11), such that , residual dynamics in (10) are quadratically stable, and (12) is satisfied for if and only if there exist , , , , , , , , , , and symmetric matrices , , and such that (37), (38), and (39) are satisfied. If such variables exist, the filter dynamics are obtained by solving (36) where and are chosen such that .

Approximate solutions to these QMIs can be obtained by using algorithms with guaranteed global convergence [27, 28], as well as local numerical search algorithms that converge (without a guarantee) much faster [29, 30]. A related discussion of the solution algorithms for QMIs can also be found in [31]. In Section 4 below, we develop an alternative procedure for the approximate solution of these QMIs.

Remark 2. In the case that is preassigned to a known value, (37) becomes linear and (39) becomes irrelevant, therefore, the optimal solution is given in a form of an LMI optimization. This case has been considered in [16].

When is structured, we proceed as follows. By using (22) from Lemma 2, we get where Using a Schur complement argument and the expression of and in (20), we get

As we did for unstructured uncertainties, we use the same matrix to allow to change variables, it follows that if and only if . We multiply by and from left and right, respectively, to get if and only if Therefore, when is structured, we have the following sufficient condition for solvability.

Lemma 5. Suppose that has the structure defined in (6). Then there exist a stable filter of the form given in (8) and a stable fault reference dynamics model , where is defined in (11), such that , the residual dynamics in (10) are quadratically stable, and (12) is satisfied for if there exist , , , , , , , , , , , , and symmetric matrices , , and such that (38), (39), and (44) are satisfied.

3.2. Solution with Polytopic Uncertainties

In this section, we derive necessary and sufficient conditions for solvability of the robust FDI problem for a system subject to polytopic uncertainties, in the form of LMIs. Now, where are as defined in (17), but with superscript replaced by . We assume that the polytopic system is quadratically stable. Recall that (12) is satisfied if and only if (15) is satisfied for all . Now,

Noting that the change of variable defined in (36) is independent of , we can therefore use it in this scheme as well. Letting , it follows that where the are as defined in (27)–(33) and (34)-(35), but with the nominal model data replaced by . Therefore, for polytopic uncertainties, we can derive necessary and sufficient conditions for solvability of Problem 1 in the form of a QMI feasibility problem as follows.

Lemma 6. Let . Then there exist a stable filter of the form given in (8) and a stable fault reference dynamics model , where is defined in (11), such that , the residual dynamics in (10) are quadratically stable, and (12) is satisfied for if and only there exist , , , , , , , , , , , , and symmetric matrices , , and such that (38), (39), and are satisfied for .

4. Robust FDI Filter Design Using LMIS

In this section, we give a suboptimal solution to Problem 1. An optimal solution would necessitate the solution of a quadratic matrix inequality and is in general intractable. Here, we propose a linearization procedure to derive an upper bound on the optimal solution that requires solving an LMI optimization problem.

The following general result demonstrates that if we have one feasible solution to a QMI optimization, then we can construct an LMI optimization problem whose solution gives an upper bound on the QMI problem.

Lemma 7. Let and be given and let and . For , define Then

Proof. Let . Then by using a Schur complement argument, we get Let . Now, and it follows that That is, Using (53) and noting that and , we have A similar proof can be used to derive (51) and (52).

In order to simplify the subsequent analysis, we adopt the convention that variables appended with a subscript “” denote feasible values of the variables for the QMIs (37) and (39).

In (37), the only nonlinear terms are , , , and . We denote the matrix in (37) by , and set to be the matrix , with the nonlinear terms removed.

Let , , , and be given. We use Lemma 7 to derive an LMI formulation. We can write as where

Let denote the value of when and are replaced by and , respectively, let denote the value of when and are replaced by and , respectively, and define . Using (49) from Lemma 7, we get where

To linearize the matrix inequality in (39), we need Lemma 7 and the following lemma, whose proof is similar to that of Lemma 7 and is therefore dropped.

Lemma 8. Let and . For , define Then

Let , , and be given. Using (51) from Lemmas 7 and 8, it is easy to get where where , , . The next lemma summarizes the results of this section by giving a linearized formulation of the optimization problem defined in Lemma 4 using (60) and (64).

Lemma 9. Assume that is unstructured. Let , , , , , , and be given. Then there exist a stable filter of the form given in (8) and a stable fault reference dynamics model , where is defined in (11), such that , residual dynamics in (10) are quadratically stable, and (12) is satisfied for if there exist , , , , , , , , , , , and symmetric matrices , , and such that

Remark 3. This scheme can also be applied to Lemmas 5 and 6 to obtain a suboptimal solution involving linear matrix inequalities.

Next, we need to choose the initial parameters (, , , , , , and ) to reduce . The idea is to derive an algorithm where at each iteration, we solve the optimization problem given in Lemma 9, using the solution of this problem at the previous iteration, for initial parameters. The algorithm will use initial values , , , , , , and , which must guarantee that the LMI constraints (66) and (67) will be feasible.

From the above discussion, an algorithm for choosing the initial parameters can be listed as follows.

Algorithm 1. (1) Set initial values , , , and such that satisfies and .
(2) Find the solutions , , and of the optimization derived from Lemmas 3 and 4, which is linear since the matrix inequalities (37), (38), and (39) become linear when is fixed. (The matrices , , and always exist since the cost function in (12) is bounded because is bounded).
(3) Set , , and .
(4) Start loop.
(5) Since the initial values are feasible for (37), (38), and (39), the LMIs (66) to (68) have feasible solutions from (50) and (63) in Lemmas 7 and 8. Compute solutions (, , etc.) of (66) to (68) to minimize .
(6) Rename , , , , , , and , and go to Step 5.
(7) End loop.

Algorithm 1 is convergent, possibly to a local minimum, in the sense that the quantity ? is nonincreasing after each iteration. This can be easily shown using (50) and (52) from Lemma 7, and (63) from Lemma 8.

Remark 4. Lemmas 7, 8, and Algorithm 1 can also be applied to other problems involving QMIs and bilinear matrix inequalities (BMIs). The procedure potentially gives an improvement and seems to work well in practice.

Remark 5. In the case that we choose a diagonal structure for , we may use as initial values. This will ensure that is stable and . We can solve the LMI optimization problem given in Lemma 4. This will give , , , and so forth. These initial values are not unique and can be chosen using other criteria.

Remark 6. A more systematic technique for generating valid initial values is as follows: first, generate any , , , and a stable with the structure chosen, then, compute . If , redefine the matrices as , , , and , which fulfill that the conditions are stable and .

Remark 7. The requirement for to be diagonal is to ensure fault isolability in the case that all faults may occur simultaneously. If we ignore disturbances and uncertainty, and assuming perfect fault isolation, then so that fault only affects residual . If none of the faults can occur simultaneously, then we need only to impose the constraint that is upper (or lower) triangular. While it is not difficult to modify our analysis under these conditions, we only consider the case when is diagonal so that all faults may occur simultaneously since our contribution is focussed on reducing the effect of disturbances and uncertainties under the most stringent fault scenarios.

5. Numerical Example

In order to illustrate the efficiency of Algorithm 1 and the importance of the choice of , we consider a jet engine state-space model [32] supplied by NASA Glenn Research Center given as where The system is subject to three disturbances and three potential faults. Here, the setup is given by

Since no uncertainty parameters were given in this example, we assume an unstructured norm-bounded uncertainty, with matrices , , , , and randomly generated as

A square and diagonal structure for is necessary to get fault isolation in the case that the faults occur at the same time.

Remark 8. If , , and are chosen diagonal, then The terms in can be incorporated in . It follows that we can set . Therefore, the nonlinearity in (37) comes only from bilinear terms and .

To initialize Algorithm 1, is generated following Remark 6 as Then, by following the first two steps of Algorithm 1, we get the optimal as and the values given below for , , and , which will be used to initialize the main loop of the algorithm as follows.

Taking into account Remark 8, we implemented Algorithm 1 in Matlab to minimize . Table 1 shows the evolution of the optimization following Algorithm 1.

Algorithm 1 can clearly improve the result in a few iterations; is reduced by compared to the one obtained with a fixed . This shows that the choice of is essential in our FDI scheme. We get

Remark 9. In our numerical experimentation, other choices for have been used; however, all converged to the same solution but with different numbers of iterations.

In order to show that our filter is robust against disturbances and model uncertainties, we introduce a randomly generated given by as well as three disturbances. Simulated through MATLAB and SIMULINK, these disturbances are three-band-limited white noises with mean and standard deviation 2. Faults and are both simulated by a unit positive jump connected from the 14th second. Fault , simulated by a soft bias (slope ), is connected from the 20th second. Figure 2 gives the residual responses before the algorithm, where each residual is dedicated to a particular fault, while Figure 3 gives the optimized residual response using our algorithm.

The lines , , and represent the optimal trajectories that each residual must follow

In both figures, each fault can be distinguished from the others and the disturbances; however, in Figure 3, the faults can be more easily distinguished and each residual follows its optimal trajectory (green line) with more accuracy. Furthermore, the disturbances are more attenuated compared to Figure 2, and the jumps that indicate faults are clearer in Figure 3 since their amplitudes are bigger and therefore allow a better fault detection using thresholds [33, 34]. The isolation performance is clearly effective as each fault produces a deviation of its residual only, without modifying the trajectory of the others. This example illustrates that the designed filter satisfies the performance requirement of FDI which is sufficiently robust against disturbances and modeling errors. Figure 3 also justifies the efficiency of Algorithm 1 to improve the design of the reference model and therefore the overall performance of our filter.

6. Conclusion

This paper has addressed the problem of fault detection and isolation for linear time-invariant systems subject to faults, disturbances, and model uncertainties. We proposed a performance index that captures the FDI requirements. Through QMI formulations, we gave the design of an optimal filter for polytopic or unstructured norm-bounded uncertainties, and a suboptimal filter for structured norm-bounded uncertainties. Suboptimality in the latter case is inherited from the bounded real lemma, which gives only sufficient LMI conditions for structured uncertainties (see Lemma 2). By allowing the reference model to be variable in our formulation, we get its optimal design, which can be used in other schemes dedicated to fault isolation. The optimal design of this reference model is initially given in a form of a QMI optimization, then a suboptimal solution is obtained by using a linearization procedure which derives an upper bound on the optimal solution of the QMI formulation that requires the solution of an LMI optimization. Note, however, that we have no indication concerning the deviation of our design from the optimal filter. We have also illustrated the effectiveness of our algorithm using a jet-engine example.

Acknowledgment

This work was partially supported by the Ministry of Defence through the Data and Information Fusion Defence Technology Centre.