Control and Power Group, Department of Electrical and Electronic Engineering, Imperial College, London SW7 2AZ, UK
This paper considers matrix inequality procedures to address the
robust fault detection and isolation (FDI) problem for linear
time-invariant systems subject to disturbances, faults, and
polytopic or norm-bounded uncertainties. We propose a design
procedure for an FDI filter that aims to minimize a weighted
combination of the sensitivity of the residual signal to
disturbances and modeling errors, and the deviation of the faults
to residual dynamics from a fault to residual reference model, using the -norm as a measure. A key step in our
procedure is the design of an optimal fault reference model. We
show that the optimal design requires the solution of a quadratic
matrix inequality (QMI) optimization problem. Since the solution
of the optimal problem is intractable, we propose a linearization
technique to derive a numerically tractable suboptimal design
procedure that requires the solution of a linear matrix inequality
(LMI) optimization. A jet engine example is employed to
demonstrate the effectiveness of the proposed approach.
1. Introduction
In the past
decade, great attention has been devoted to the design of model-based fault
detection systems and their robustness [1, 2]. With the rapid development of
robust control theory and optimization
control techniques, more and more methods have been presented to solve the robust FDI problem. The -filter is designed such that the -norm of the estimation error is minimized (see [3–5] and the references therein).
Some of the approaches used for this problem include frequency domain
approaches [6], left and right eigenvector assignment [7], structure parity
equation [8], and an unknown input observer with disturbances decoupled in the
state estimation error [9]. Recently developed LMI approaches offer numerically
attractive techniques [10–12].
A reference residual model is an ideal solution for
robust FDI under the assumption that there are no disturbances or model
uncertainty. The idea is to design a filter for the uncertain system that
approximates the solution given by the reference model [13]. In [14], a new
performance index is proposed using such a reference residual model. Frisk and
Nielsen [15] give an algorithm to design a reference model and a robust FDI
filter that fits into the framework of standard robust -filtering
relying on established and efficient methods. However, their framework consists
in solving two optimization problems successively, which results in a
suboptimal solution.
In this paper, we propose a performance index that
captures the requirements of fault isolation and disturbance rejection as well
as the design of the optimal reference model. The fault isolation performance
is measured by the size of the deviation of the fault to residual dynamics from
the reference dynamics model, while the disturbance rejection performance is
measured by the size of the input to residual and disturbance to residual
dynamics. In all cases, the norm is used as
a measure. The design of the optimal reference model is incorporated in the
robust FDI framework. We consider systems subject to norm-bounded or polytopic
uncertainties. For systems described by polytopic and unstructured norm-bounded
uncertainties, we derive an optimal FDI filter obtained as the solution of a
QMI optimization. For systems described by structured uncertainties, we derive
a suboptimal QMI-based solution. Since the solution of a QMI optimization is,
in general, intractable, we propose a linearization technique to derive a
suboptimal design procedure that requires the solution of a numerically
tractable LMI optimization. This note extends our work in [16] by proposing
algorithms for the design of suboptimal reference dynamics.
The structure of the work is as follows. After
defining the notation, we review filter-based FDI techniques for residual
signal generation and give the problem formulation in Section 2. Section 3
presents a matrix inequality formulation for the FDI problem, and gives the
solution and the design of the optimal reference model for both norm-bounded
and polytopic uncertainties in a form of QMI's. Section 4 gives a suboptimal
solution in both cases through the use of an algorithm that necessitates
solving LMI's. Finally, a numerical example is presented in Section 5, and
Section 6 summarizes our results.
The notation we use is fairly standard. The set of
real matrices is
denoted by . For , we use the notation to denote
transpose. For , () denotes that is positive
(negative) definite, that is, all the eigenvalues of are greater
(less) than zero. The identity matrix
is denoted as and the null matrix is
denoted as with the
subscripts occasionally dropped if they can be inferred from context.
Let be the set of
square integrable functions. The -norm of is defined as . A transfer matrix will be denoted
as or and dependence on the variable will be
suppressed. For a stable transfer matrix , we define In section 3, we use the following result.
Lemma 1 (see [17]). Let with
sign
controllable, , and . Then
has a spectral
factor
(i.e.,
) if and only
if there exists symmetric
that satisfies
the following linear matrix inequality:
2. Problem Formulation
Consider a
linear time-invariant dynamic system subject to disturbances, modeling errors
and process, sensor and actuator faults modeled as where , , and are the process
state and input and output vectors, respectively, and where and are the
disturbance and fault vectors, respectively. Here, and are the
component and instrument fault distribution matrices, respectively, while and are the
corresponding disturbance distribution matrices [18].
We consider two types of uncertainties: norm-bounded
and polytopic uncertainties. In the case of norm-bounded uncertainties, where represents the
nominal model, , where and where , , , , , , and are known and
constant matrices with appropriate dimensions. This linear fractional
representation of uncertainty, which is assumed to be well-posed over (i.e., for all ), has great
generality and is widely used in control
theories.
In the case of polytopic uncertainties, where , are known
constant matrices with appropriate dimensions.
A residual signal in an FDI system should represent the
inconsistency between the system variables and the mathematical model. The
objective is to design an FDI filter of the form where is the filter
state and is the residual
signal. Figure 1 illustrates this filter in the robust residual generation
scheme.
Figure 1: Filter-based robust FDI scheme.
By defining an augmented state the residual
dynamics are given by or where , , and denote the
dynamics from faults, disturbances, and inputs to the residual, respectively.
Note that dependence on the uncertain data is indicated by a superscript .
Ideally, the residual signal is required to be
sensitive only to faults. This corresponds to , , and . For fault isolation, it is further required that the
fault signature can be deduced from the residual. This corresponds to , where is a set of
transfer matrices with a special structure that depends on the nature of the
faults, for example, if the faults can occur simultaneously, is the set of
stable diagonal transfer matrices with nonzero diagonal entries. Unfortunately,
characterizing a general structured set is intractable,
and we will assume that we can define a subset such that
subsequent optimizations over the structured state-space data sets , , , and are tractable.
For example, if denotes the set
of all stable diagonal
transfer matrices with nonzero diagonal entries, we may define as the set of
all diagonal
matrices with negative entries, and , , and as the sets of
all diagonal
matrices. An example of this simplification procedure is given in Section 5
below. We also replace the requirement of nonzero diagonal entries for by a condition
of the form for all .
Due to the presence of disturbances and modelling
uncertainties, exact FDI is not possible. For robust FDI, we propose the
following, more realistic, problem formulation.
Problem 1. Assume that the system dynamics in (4) are
quadratically stable. For any , find a stable fault reference dynamics such that (to ensure the
requirement of nonzero diagonal entries for ), and find a
stable filter of the form given in (8), if it exists, such that the residual
dynamics in (10) are quadratically stable and where for
norm-bounded uncertainties and for polytopic
uncertainties.
Recall that quadratic stability for the dynamics in
(4) is equivalent to the existence of such that for all (see [19] for
more details).
A modified version of Problem 1 uses a weighted cost
function, say, of the form where , , and are constant or
frequency-dependent weighting functions that can vary the emphasis between
fault detection (small and ) and fault
isolation (small ). In the
sequel, we assume that any such weighting functions are absorbed in the system
data.
Remark 1. The objective is to find the smallest
for which (12)
is satisfied. Indeed, by minimizing , we will ensure (which measures
the disturbance rejection level), (which measures
the effect of uncertainty), and (which measures
the deviation of the fault to residual dynamics from the reference dynamics,
and hence is a measure of fault isolation capability) to be small.
A poorly chosen reference model can result in a
residual generator with poor robustness. Here, we incorporate its design into
our scheme so as to improve the robustness of the FDI filter.
In some approaches, a common assumption is that and/or has full rank
[20, 21]; in others, the assumption has full rank
and is widely used [22, 23]. Here, we do not impose any of these assumptions.
Note, however, that if does not have
full column rank, for example, if , then this will have an adverse effect on the minimum
values of since This would
therefore limit the overall performance of the filter, which is measured by the
value of .
3. Matrix Inequality Formulation
In this
section, we derive a matrix inequality formulation of Problem 1. The main
idea is to express (12) in terms of QMIs using the bounded real lemma and change
of variables techniques, and then to derive necessary and sufficient conditions
for solvability.
The dynamics in (12) can be written as It follows from
the bounded real lemma that there exists a stable filter of the form given in
(8) such that (12) is satisfied if and only if there exists such that and for all (see
[24, Theorem 3]). We deal separately with norm-bounded and polytopic uncertainties.
3.1. Solution with Norm-Bounded Uncertainties
For
norm-bounded uncertainties, we separate the terms involving modeling
uncertainties from the other terms as where Using this
separation, the inequality (15) can be rewritten as A calculation
shows that , where The next result uses the fact that to remove
explicit dependence on .
Lemma 2 (see [25]). Let
be as described
in (6) and define the subspaces Let , , , and be matrices
with appropriate dimensions. We have and for every if there exist and such that and If is unstructured
(i.e., if
), then (22)
becomes for some scalar . In this case, condition (23) is both necessary and
sufficient.
When the uncertainty set is unstructured, then Lemma
2 implies that for some . Using a Schur complement argument shows that (19) is
equivalent to where denotes terms
readily inferred from symmetry. Next, we introduce a change of variables to
linearize the above matrix inequality [26]. Assume that , that is, the filter order is equal to the order of
the system plus the order of the reference model. Let where are symmetric
matrices, and . Define . Then if and only if . From , we have the following calculations, where boldface
letters are used to indicate optimization variables: where we have
defined If and are invertible,
the variables
, , , , , can be replaced
by the new variables , , , , , without loss of
generality. We can now rewrite (19) as which is
nonlinear in the variables. Note that the nonlinearities involve the terms and only. The
constraint can be
expressed as an LMI as follows:
The constraint can be
expressed as a quadratic matrix inequality using the next lemma.
Lemma 3. Let be as defined
above. Then if and only if
there exists such that
Proof. Let . Then if and only if . It is easy to show that can be written
as follows: The result then
follows from Lemma 1.
The next lemma summarizes our result so far by giving
necessary and sufficient conditions for the solution of Problem 1, in the
case that the uncertainty set is unstructured, in the form of a QMI feasibility
problem.
Lemma 4. Assume that
is
unstructured. Then there exist a stable filter of the form given in (8) and a
stable fault reference dynamics model , where
is defined in
(11), such that , residual dynamics in (10) are quadratically stable,
and (12) is satisfied for if and only if
there exist , , , , , , , , , , and symmetric matrices , , and
such that (37),
(38), and (39) are satisfied. If such variables exist, the filter dynamics are
obtained by solving (36) where and
are chosen such
that .
Approximate solutions to these QMIs can be obtained by
using algorithms with guaranteed global convergence [27, 28], as well as
local numerical search algorithms that converge (without a guarantee) much
faster [29, 30]. A related discussion of the solution algorithms for QMIs can
also be found in [31]. In Section 4 below, we develop an alternative procedure
for the approximate solution of these QMIs.
Remark 2.
In
the case that is preassigned
to a known value, (37) becomes linear and (39) becomes irrelevant, therefore,
the optimal solution is given in a form of an LMI optimization. This case has
been considered in [16].
When is structured,
we proceed as follows. By using (22) from Lemma 2, we get where Using a Schur
complement argument and the expression of and in (20), we
get
As we did for unstructured uncertainties, we use the
same matrix to allow to
change variables, it follows that if and only if . We multiply by and from left and
right, respectively, to get if and only
if Therefore, when is structured,
we have the following sufficient condition for solvability.
Lemma 5.
Suppose that
has the
structure defined in (6). Then there exist a stable filter of the form given in
(8) and a stable fault reference dynamics model , where
is defined in
(11), such that , the residual dynamics in (10) are quadratically
stable, and (12) is satisfied for if there exist , , , , , , , , , , , , and symmetric matrices , , and
such that (38),
(39), and (44) are satisfied.
3.2. Solution with Polytopic Uncertainties
In this
section, we derive necessary and sufficient conditions for solvability of the
robust FDI problem for a system subject to polytopic uncertainties, in the form
of LMIs. Now, where are as defined
in (17), but with superscript replaced by . We assume that the polytopic system is quadratically
stable. Recall that (12) is satisfied if and only if (15) is satisfied for all . Now,
Noting that the change of variable defined in (36) is
independent of , we can therefore use it in this scheme as well.
Letting , it follows that where the are as
defined in (27)–(33) and (34)-(35), but with the nominal model data replaced by . Therefore, for polytopic uncertainties, we can
derive necessary and sufficient conditions for solvability of Problem 1 in
the form of a QMI feasibility problem as follows.
Lemma 6. Let . Then there exist a stable filter of the form given
in (8) and a stable fault reference dynamics model , where
is defined in
(11), such that , the residual dynamics in (10) are quadratically
stable, and (12) is satisfied for if and only
there exist , , , , , , , , , , , , and symmetric matrices , , and such that (38),
(39), and
are satisfied
for .
4. Robust FDI Filter Design Using LMIS
In this
section, we give a suboptimal solution to Problem 1. An optimal solution
would necessitate the solution of a quadratic matrix inequality and is in
general intractable. Here, we propose a linearization procedure to derive an
upper bound on the optimal solution that requires solving an LMI optimization
problem.
The following general result demonstrates that if we
have one feasible solution to a QMI optimization, then we can construct an LMI
optimization problem whose solution gives an upper bound on the QMI
problem.
Lemma 7.
Let and be given and
let and . For , define Then
Proof. Let . Then by using a Schur complement argument, we
get Let . Now, and it follows
that That is, Using (53) and
noting that and , we have A similar proof
can be used to derive (51) and (52).
In order to simplify the subsequent analysis, we adopt
the convention that variables appended with a subscript “” denote
feasible values of the variables for the QMIs (37) and (39).
In (37), the only nonlinear terms are , , , and . We denote the matrix in (37) by , and set to be the
matrix , with the nonlinear terms removed.
Let , , , and be given. We
use Lemma 7 to derive an LMI formulation. We can write as where
Let denote the
value of when and are replaced by and , respectively, let denote the
value of when and are replaced by and , respectively, and define . Using (49) from Lemma 7, we get where
To linearize the matrix inequality in (39), we need
Lemma 7 and the following lemma, whose proof is similar to that of Lemma 7
and is therefore dropped.
Lemma 8.
Let and . For , define Then
Let , , and be given. Using
(51) from Lemmas 7 and 8, it is easy to get
where
where
, , . The next lemma
summarizes the results of this section by giving a linearized formulation of
the optimization problem defined in Lemma 4 using (60) and (64).
Lemma 9. Assume that
is
unstructured. Let , , , , , , and
be given. Then
there exist a stable filter of the form given in (8) and a stable fault
reference dynamics model , where
is defined in
(11), such that , residual dynamics in (10) are quadratically stable,
and (12) is satisfied for if there exist , , , , , , , , , , , and symmetric matrices , , and such that
Remark 3.
This scheme
can also be applied to Lemmas 5 and 6 to obtain a suboptimal solution
involving linear matrix inequalities.
Next, we need to choose the initial parameters (, , , , , , and ) to reduce . The idea is to derive an algorithm where at each
iteration, we solve the optimization problem given in Lemma 9, using the
solution of this problem at the previous iteration, for initial parameters. The
algorithm will use initial values , , , , , , and , which must guarantee that the LMI constraints (66)
and (67) will be feasible.
From the above discussion, an algorithm for choosing
the initial parameters can be listed as follows.
Algorithm 1. (1) Set initial
values , , , and such that satisfies and .
(2) Find the
solutions , , and of the
optimization derived from Lemmas 3 and 4, which is linear since the matrix inequalities (37), (38), and (39) become linear when is fixed. (The
matrices , , and always exist
since the cost function in (12) is bounded because is bounded).
(3) Set , , and .
(4) Start loop.
(5) Since the
initial values are feasible for (37), (38), and (39), the LMIs (66) to (68) have feasible solutions from (50) and (63) in Lemmas 7 and 8. Compute solutions (, , etc.) of (66) to (68) to minimize .
(6) Rename , , , , , , and , and go to Step 5.
(7) End loop.
Algorithm 1 is convergent, possibly to a local minimum, in the sense that the quantity γ is nonincreasing
after each iteration. This can be easily shown using (50) and (52) from Lemma 7, and (63)
from Lemma 8.
Remark 4. Lemmas 7, 8, and Algorithm 1 can also be applied to other problems involving
QMIs and bilinear matrix inequalities (BMIs). The procedure potentially gives an improvement and seems
to work well in practice.
Remark 5. In
the case that we choose a diagonal structure for , we may use as initial
values. This will ensure that is stable and . We can solve the LMI optimization problem given in
Lemma 4. This will give , , , and so forth. These initial values are not unique
and can be chosen using other criteria.
Remark 6.
A
more systematic technique for generating valid initial values is as follows:
first, generate any , , , and a stable
with the
structure chosen, then, compute . If , redefine the matrices as , , , and , which fulfill that the conditions are stable and .
Remark 7. The requirement for to be diagonal
is to ensure fault isolability in the case that all faults may occur
simultaneously. If we ignore disturbances and uncertainty, and assuming perfect
fault isolation, then so that fault only affects
residual . If none of the faults can occur simultaneously, then
we need only to impose the constraint that is upper (or
lower) triangular. While it is not difficult to modify our analysis under these
conditions, we only consider the case when is diagonal so
that all faults may occur simultaneously since our contribution is focussed on
reducing the effect of disturbances and uncertainties under the most stringent
fault scenarios.
5. Numerical Example
In order to
illustrate the efficiency of Algorithm 1 and the importance of the choice of , we consider a jet engine state-space model [32]
supplied by NASA Glenn Research Center given as where The system is subject to three disturbances and three
potential faults. Here, the setup is given by
Since no uncertainty parameters were given in this
example, we assume an unstructured norm-bounded uncertainty, with matrices , , , , and randomly
generated as
A square and diagonal structure for is necessary to
get fault isolation in the case that the faults occur at the same time.
Remark 8. If , , and are chosen diagonal,
then The terms in can be
incorporated in . It follows that we can set . Therefore, the nonlinearity in (37) comes only from
bilinear terms and .
To initialize Algorithm 1, is generated
following Remark 6 as Then, by
following the first two steps of Algorithm 1, we get the optimal as and the values
given below for , , and , which will be used to initialize the main loop of
the algorithm as follows.
Taking into account Remark 8, we implemented
Algorithm 1 in Matlab to minimize . Table 1
shows the evolution of the
optimization following Algorithm 1.
Algorithm 1 can clearly improve the result in a few
iterations; is reduced by compared to the
one obtained with a fixed . This shows that the choice of is essential in
our FDI scheme. We get
Remark 9.
In
our numerical experimentation, other choices for
have been used;
however, all converged to the same solution but with different numbers of
iterations.
In order to show that our filter is robust against
disturbances and model uncertainties, we introduce a randomly generated given by as well as
three disturbances. Simulated through MATLAB and SIMULINK, these disturbances
are three-band-limited white noises with mean and standard
deviation 2. Faults and are both
simulated by a unit positive jump connected from the 14th second. Fault , simulated by a soft bias (slope ), is connected
from the 20th second. Figure 2 gives the residual responses before the algorithm, where each residual is
dedicated to a particular fault, while Figure 3 gives the optimized residual
response using our algorithm.
Figure 2: Time response of the residual before the optimization.
Figure 3: Time response of the residual after the optimization.
The lines , , and represent the
optimal trajectories that each residual must follow
In both figures, each fault can be
distinguished from the others and the disturbances; however, in Figure 3, the faults can be
more easily distinguished and each residual follows its optimal trajectory
(green line) with more accuracy. Furthermore, the disturbances are more
attenuated compared to Figure 2, and the jumps that indicate faults are clearer
in Figure 3 since their amplitudes are bigger and therefore allow a better
fault detection using thresholds [33, 34]. The isolation performance is clearly
effective as each fault produces a deviation of its residual only, without
modifying the trajectory of the others. This example illustrates that the
designed filter satisfies the performance requirement of FDI which is
sufficiently robust against disturbances and modeling errors. Figure 3 also
justifies the efficiency of Algorithm 1 to improve the design of the reference
model and therefore the overall performance of our filter.
6. Conclusion
This paper has
addressed the problem of fault detection and isolation for linear
time-invariant systems subject to faults, disturbances, and model
uncertainties. We proposed a performance index that captures the FDI
requirements. Through QMI formulations, we gave the design of an optimal filter
for polytopic or unstructured norm-bounded uncertainties, and a suboptimal
filter for structured norm-bounded uncertainties. Suboptimality in the latter
case is inherited from the bounded real lemma, which gives only sufficient LMI
conditions for structured uncertainties (see Lemma 2). By allowing the reference model to be variable
in our formulation, we get its optimal design, which can be used in other
schemes dedicated to fault isolation. The optimal design of this reference
model is initially given in a form of a QMI optimization, then a suboptimal
solution is obtained by using a linearization procedure which derives an upper
bound on the optimal solution of the QMI formulation that requires the solution
of an LMI optimization. Note, however, that we have no indication concerning
the deviation of our design from the optimal filter. We have also illustrated
the effectiveness of our algorithm using a jet-engine example.
Acknowledgment
This work was
partially supported by the Ministry of Defence through the Data and Information
Fusion Defence Technology Centre.