Abstract

The trajectory tracking for a class of uncertain nonlinear systems in which the number of possible states is equal to the number of inputs and each input is preceded by an unknown symmetric deadzone is considered. The unknown dynamics is identified by means of a continuous time recurrent neural network in which the control singularity is conveniently avoided by guaranteeing the invertibility of the coupling matrix. Given this neural network-based mathematical model of the uncertain system, a singularity-free feedback linearization control law is developed in order to compel the system state to follow a reference trajectory. By means of Lyapunov-like analysis, the exponential convergence of the tracking error to a bounded zone can be proven. Likewise, the boundedness of all closed-loop signals can be guaranteed.

1. Introduction

During the last two decades, the control of systems using artificial neural networks (ANNs) has emerged as an effective and successful alternative to the conventional control techniques. The success of this approach lies on the universal approximation capability of ANNs which avoids the need for very time-consuming first principles modeling. Thus, it is possible to handle a broad class of nonlinear uncertain systems with little or (ideally) no a priori information.

The first deep insight about the identification and control of dynamic systems based on neural networks was provided by Narendra and Parthasarathy in [1]. However, they could not present a systematic procedure to analyze the stability of their neurocontrollers. This issue was addressed by Polycarpou and Ioannou [2], Rovithakis and Christodoulou [3], Kosmatopoulos et al. [4], and Yu and Poznyak [5]. They used Lyapunov-like analysis systematically in order to prove the stability of their algorithms. Based on these results, further refinements and improvements were accomplished in [69] and different applications were explored for robotics [10, 11], manufacturing systems [12], chemical process [13], power systems [14], and so on. It is worth mentioning that the vast majority of these studies are based on feedback linearization techniques. An inherent problem associated with these techniques is the possibility of the control singularity. A first approach to try to solve this problem is simply to focus only on a class of systems in which the gain function is known and constant [15, 16]. Certainly, to consider only this kind of systems could result into being very restrictive in practice. A generalized procedure to handle the control singularity consists of making modifications to the conventional adaptive algorithms [1721]. Nonetheless, such modifications could provoke discontinuities in the control signal or well they could require the use of projection techniques. In this last case, the design and implementation process of such controllers could become quite complicated. To avoid the utilization of the projection, an integral-type Lyapunov function was proposed in [22]. On the basis of this function, a singularity-free smooth adaptive neural controller was developed. Notwithstanding, due to the requirement of the integral operation, the practical implementation of this approach is difficult [23]. In [24, 25], the neurocontrol of systems with a unique input was studied. The singularity was avoided by maintaining the input weight always different from zero. However, a systematic procedure for this goal was not specified.

Note that for the case of systems with multiple inputs, in particular when the number of states is equal to the number of inputs, the avoidance of the singularity cannot be guaranteed only by maintaining the coupling matrix of the neural network (see (12)), that is, , different from zero. Evidently, a stronger condition is required. In fact, the necessary and sufficient condition to guarantee the nonsingularity is that the coupling matrix should always be invertible. To simplify the implementation of this condition, the input weight matrix and the sigmoidal function matrix can be constructed as square matrices. Besides, can be selected in such a way that its invertibility can be assured. Then, the problem now is focused on guaranteeing the invertibility of the input weight . In this paper, unlike [24, 25] where no concrete procedure was specified and as an alternative to the projection techniques presented in [1721], we propose a simple strategy to avoid the control singularity. Taking into account that a necessary and sufficient condition for the invertibility of the square matrix is that or equivalently , we define a positive threshold in such a way that when , the weights of the neural network are updated according to stable learning laws. However, at the instant when the condition is presented, the process of learning is immediately stopped. The effect of this modification on the stability of the identification error is thoroughly studied by means of Lyapunov analysis. The proposed strategy is applied to the identification and control of a class of uncertain nonlinear systems with multiple inputs each one subjected to an unknown deadzone.

The deadzone is a nonsmooth nonlinearity commonly found in many practical systems such as electrohydraulic systems [26], pneumatic servo systems [27], DC servo motors [28], and rudders and propellers [29]. When the deadzone is not considered explicitly during the design process, the performance of the control system could be degraded due to an increase of the steady-state error, the presence of limit cycles, or inclusive instability [3032].

A direct way of compensating the deleterious effect of the deadzone is by calculating its inverse. However, this is not an easy question because in many practical situations, both the parameters and the output of the deadzone are unknown. To overcome this problem, in a pioneer work [30], Tao and Kokotovic proposed to employ an adaptive inverse of the deadzone. This scheme was applied to linear systems in transfer function form. Cho and Bai [33] extended this work and achieved a perfect asymptotic adaptive cancellation of the deadzone. However, their work assumed that the deadzone output was measurable. In [34], the work of Tao and Kokotovic was extended to linear systems in a state space form with nonmeasurable deadzone output. In [35], a new smooth parameterization of the deadzone was proposed and a class of SISO systems with completely known nonlinear functions and with linearly parameterized unknown constants was controlled by using backstepping technique. In order to avoid the construction of the adaptive inverse, in [36], the same class of nonlinear systems as in [35] was controlled by means of a robust adaptive approach and by modeling the deadzone as a combination of a linear term and a disturbance-like term. The controller design in [36] was based on the assumption that maximum and minimum values for the deadzone parameters are a priori known. However, a specific procedure to find such bounds was not provided. Based on the universal approximation property of the neural networks, a wider class of SISO systems in Brunovsky canonical form with completely unknown nonlinear functions and unknown constant control gain was considered in [3739]. Apparently, the generalization of these results to the case when the control gain is varying and state dependent is trivial. Nevertheless, the solution to this problem is not simple due to the singularity possibility for the control law. In [40, 41], this problem was overcome.

All the aforementioned works about deadzone studied a very particular class of systems, that is, systems in strict Brunovsky canonical form with a unique input. In this paper, we consider a wider class of systems, that is, uncertain nonlinear systems with multiple inputs where each input is preceded by an unknown symmetric deadzone. This global system could be seen as formed by an unknown affine system (see (1)) whose inputs are the outputs of the different deadzones. By generalizing the model used in [36], the multiple deadzones can be represented by means of a diagonal matrix multiplied by the global input vector plus a disturbance-like vector (the diagonal matrix is composed by the unknown symmetric slopes of each deadzone). By using this model, a continuous time recurrent neural network is employed to identify the global unknown dynamics. On the basis of this neural network, an instantaneous mathematical model of the uncertain system can be obtained, and a singularity-free feedback linearization control law is developed in such a way that the system state is compelled to follow a bounded reference trajectory. Once again, by using Lyapunov analysis, the exponential convergence of the tracking error to a bounded zone can be shown. Likewise, the boundedness of all closed-loop signals can be guaranteed.

2. Preliminaries

2.1. Notation

Throughout this paper, we will use the following notation.

represents the identity matrix of dimension . Given , denotes the Euclidean norm of ; that is, . Given the following vector norm: , we will say that when is finite.

2.2. Description of the System

In this study, the system to be controlled consists of an unknown multi-input nonlinear plant in which each input is preceded by an unknown symmetric deadzone; that is, where is the measurable state vector for is an unknown but continuous nonlinear vector function, is an unknown but continuous nonlinear matrix function, represents an unknown but bounded deterministic disturbance, the th element of the vector , that is, , represents the output of the th deadzone, is the input to the th deadzone, and represent the right and left constant breakpoints of the th deadzone, and is the constant slope of the th deadzone. In accordance with [30, 31], the deadzone model (2) is a static simplification of diverse physical phenomena with negligible fast dynamics.

Note that is the actual control input to the global system described by (1) and (2). Hereafter, it is considered that the following assumptions are valid.

Assumption 1. The plant described by (1)-(2) is controllable.

Assumption 2. The th deadzone output, that is, , is not available for measurement.

Assumption 3. Although the th deadzone parameters , and are unknown constants, we can assure that , , and for .

2.3. Deadzone Representation as a Linear Term and a Disturbance-Like Term

The model of the th deadzone (2) can alternatively be described as follows [34, 42]: where is given by Note that (4) is the negative of a saturation function. Thus, although could not be exactly known, its boundedness can be assured. Consider that the positive constant is an upper bound for ; that is, .

Based on (3), in [43], the relationship between and can be expressed as where and is given by . Clearly, . Consider that the positive constant is an upper bound for .

3. Identification Process with Guaranteed Invertibility of

In this section, the identification problem of the unknown global dynamics described by (1) and (2) using a recurrent neural network is considered.

Note that an alternative representation for (1) is given by where is a Hurwitz matrix which can be selected for simplicity as , is a positive constant proposed by the designer, and are unknown constant weight matrices, is an activation vector function with sigmoidal components, that is, where , , and are positive constants which can be specified by the designer, is a sigmoidal function selected as where , , and are positive constants which can be specified by the designer, and is the unmodeled dynamics which can be defined simply as

Remark 4. Note that the structure for the sigmoidal function was selected in such a way that its invertibility can always be guaranteed.

Remark 5. Typically, and are considered as the optimal values of the weights which minimize the unmodeled dynamics . Although procedures to find such values are presented in [44, 45], in this study the aforementioned values are not required anymore. Thus, the design process is considerably simplified.

Assumption 6. On a compact set , unmodeled dynamics is bounded by ; that is, . The disturbance is also bounded; that is, . Both and are positive constants not necessarily a priori known.

Assumption 7. The input signal is bounded; that is, , where is a positive constant (not necessarily known).

By substituting (5) into (6), we get

Remark 8. It can be observed that by using the model (5), the actual control input appears now directly into the dynamics.

Since, by construction, is bounded, the term must also be bounded. Let us define the following expression: . Clearly, this expression is bounded. Let us denote an upper bound for as . This bound is a positive constant not necessarily a priori known. Now, note that the term can be alternatively expressed as where is an unknown weight matrix. In view of the above, (10) can be rewritten as Now, consider the following series-parallel structure for a continuous-time recurrent neural network where is the state of the neural network, is the control input as in (10), and and are the time-varying weight matrices. In order to solve the problem of identifying system (1)-(2) based on the recurrent neural network (12), given the measurable state and the input , we should be able to adjust on line the weights and by proper learning laws such that the identification error can be reduced to a bounded zone around zero and, at the same time, the invertibility of can be guaranteed. Specifically, we employ in this study the following learning laws: where , , , and are positive constants which can be selected by the designer, and is a positive constant adjustable by the designer.

Based on the learning laws (13) and (14), we can establish here the following result.

Theorem 9. If Assumptions 2, 3, 6, and 7 are satisfied, the constant is selected greater than , and the weight matrices , of the neural network (12) are adjusted by the learning laws (13) and (14), respectively, then (a)the identification error and the weights of the neural network (12) are bounded as follows: (b1)when , the norm of the identification error, that is, , converges exponentially fast to a zone bounded by the term where and (b2)when , the identification error is uniformly ultimately bounded with respect to the set where and is an upper bound for the term

Proof of Theorem 9. First, let us determine the dynamics of the identification error. The first derivative of is simply Substituting (12) and (11) into (21) yields the following: where and .
Consider the following Lyapunov function candidate: The first derivative of is Substituting (22) into (24) and taking into account that yield Since and , the first derivatives for and are clearly and , respectively. However, and are given by the learning laws (13) and (14). Therefore, by substituting (13) into and (14) into and the corresponding expressions into the right-hand side of (25), it is possible to obtain It is easy to show that Substituting (27) into (26) yields Let us consider the two possible cases for separately.
Case I (when ). On this condition, taking into account that if (29) is substituted into (28) and reducing the like terms, we can get Besides, it can be proven [46] that Substituting (31) into (30) yields or In view of the following bound as a function of can finally be determined for : This implies that the following bound for can be established (the demonstration of this intermediate result can be consulted in [43]) Since by definition and are positive constants, the right-hand side of the inequality (36) can be bounded by . Thus, and since by construction is a nonnegative function, the boundedness of , , and can be guaranteed. Because and are bounded, and must be bounded too and the first part of Theorem 9 has been proven. With respect to the second part of this theorem, from (23), it is evident that . Taking into account this fact and from (36), we get By taking the limit as of the inequality (37), we can guarantee that converges exponentially fast to a zone bounded by the term and the part (b1) of the Theorem 9 has been proven.
Case II (when ). Under this condition, (28) becomes simply It is easy to show that Substituting (39) into (38) yields Now, from the learning laws (13) and (14), the condition implies that and . This means that and become constant matrices and consequently they are bounded. From the above and taking into account Assumption 7, we can conclude the boundedness of the term Consider that a bound for such term is the constant (not necessarily known) and given , (40) becomes Note that if , then . This means that for and consequently . If clearly is bounded. Thus, the uniform ultimate boundedness of with respect to the set can be guaranteed and the proof is completed.

Remark 10. Note that the utilization of permits to guarantee that for . Hence, is an invertible matrix for . Certainly, the designer should select in such a way that this condition can be fulfilled.

4. Tracking Controller

In this section, an appropriate control law will be determined in such a way that the state of system (1)-(2) follows a given reference trajectory , and, at the same time, all closed-loop signals stay bounded.

Assumption 11. The reference trajectory and its first derivative are continuous and bounded. Besides, these variables are available for the design.

By simultaneously adding and subtracting the terms and into (11), we can get Let us define the term as According to this definition, (44) can be expressed simply as Note that if the learning laws (13) and (14) are used to adjust the weights and , respectively, then according to Theorem 9   and belong to . As and are known and bounded terms and is an unknown but bounded term, from (45), we can conclude that is an unknown but bounded term. Consider that the positive constant (not necessarily known) is an upper bound for ; that is, .

Now, let us define the tracking error as The first derivative of (47) is simply Substituting (46) into (48) yields By using the principle of feedback linearization, we propose the following control law: where is a Hurwitz matrix which can be selected by simplicity as , is a positive constant proposed by the designer such that . If (50) is substituted into (49), we can get We can analyze the dynamics of the tracking error given in (51) by proposing the following Lyapunov function candidate: The first derivative of (52) is Substituting (51) into (53) yields It is easy to show that Taking into account (55) and given , (54) becomes or By defining the following positive constants and , (57) can be expressed as Hence, According to (52) and (59), it can be established that From this last inequality, the boundedness of can be concluded. From the above and taking into account that and as, according to Assumption 11, is bounded, must also be bounded. This implies, according to (50), that belongs to and this last result agrees with Assumption 7. Finally, by taking the limit as in both sides of the inequality (60), we can guarantee that converges exponentially fast to a zone bounded by the term . In this way, the following theorem has been proven.

Theorem 12. If Assumptions 111 are satisfied, the constant is selected greater than , the weight matrices , of the neural network (12) are adjusted by the learning laws (13) and (14), and the control law (50) is applied to the system formed by (1)-(2), then(a)the tracking error and the state of system (1) are bounded (b)the norm of the tracking error, that is, , converges exponentially fast to a zone bounded by the term where and .

5. Numerical Example

In order to illustrate the strategy proposed in this paper, a simulation example is presented in this section. A second order nonlinear system (see (63)) is used as the unknown plant. Consider The initial condition for system (63) is , . Each input of this system is preceded by a deadzone. The parameters of each deadzone are as follows: , , , , , and . The states of the reference trajectory are selected as and . The parameters of the neural identifier and the control law are selected by trial and error as , , , , , , , , , , , and . The results of the tracking process are presented in Figures 1 and 2 for the first 10 seconds. In Figure 1, the state of the nonlinear system (63) is represented by dashed line whereas the reference trajectory is represented by solid line. The state (dashed line) and the reference trajectory (solid line) are presented in Figure 2 for comparison. The control signals and are shown in Figures 3 and 4, respectively.

6. Conclusions

In this paper, the exponential tracking for a class of nonlinear systems with unknown deadzones using recurrent neural networks was considered. Since physical model is not available, the neural networks are used to identify the unknown dynamics. The main novelty in this study is a systematic procedure for the modification of the learning laws of the synaptic weights in such a way that the avoidance of the control singularity can be guaranteed. This objective is achieved by continuously monitoring the determinant of the coupling matrix or more specifically the input weight matrix. By defining a threshold for the determinant of the input weight, a “dangerous” region next to the singularity can be established. When such region is reached, the learning process is immediately stopped. In this way, the invertibility of the coupling matrix is guaranteed. The effect of this modification on the identification error stability is rigorously studied by means of Lyapunov analysis. On the basis of the instantaneous mathematical model obtained by the identification process, a singularity-free feedback linearization control law is developed in order to compel the system state to follow a reference trajectory. By means of Lyapunov-like analysis, the exponential convergence of the tracking error to a bounded zone can be proven. Likewise, the boundedness of all closed-loop signals can be guaranteed. Certainly, the main attractiveness of the suggested approach is its simplicity. However, it must be mentioned that the turning off of the learning law could reduce the system performance. In fact, in such conditions, the control action becomes mainly proportional.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The authors would like to thank the anonymous reviewers for their valuable comments and suggestions to improve this paper. The first author would like to thank the financial support through a postdoctoral fellowship from The Mexican National Council for Science and Technology (CONACYT) under Grant 194775-IPN.