Abstract

In this study, the problem of controlling an unknown SISO nonlinear system in Brunovsky canonical form with unknown deadzone input in such a way that the system output follows a specified bounded reference trajectory is considered. Based on universal approximation property of the neural networks, two schemes are proposed to handle this problem. The first scheme utilizes a smooth adaptive inverse of the deadzone. By means of Lyapunov analyses, the exponential convergence of the tracking error to a bounded zone is proven. The second scheme considers the deadzone as a combination of a linear term and a disturbance-like term. Thus, the estimation of the deadzone inverse is not required. By using a Lyapunov-like analyses, the asymptotic converge of the tracking error to a bounded zone is demonstrated. Since this control strategy requires the knowledge of a bound for an uncertainty/disturbance term, a procedure to find such bound is provided. In both schemes, the boundedness of all closed-loop signals is guaranteed. A numerical experiment shows that a satisfactory performance can be obtained by using any of the two proposed controllers.

1. Introduction

The deadzone is a nonsmooth nonlinearity commonly found in many practical systems such as hydraulic positioning systems [1], pneumatic servo systems [2], DC servo motors, among others. When the deadzone is not considered explicitly during the design process, the performance of the control system could be degraded by an increase of the steady-state error, the presence of limit cycles, or inclusive instability [36]. A direct way of compensating the deleterious effect of the deadzone is by calculating its inverse. However, this is not an easy question because in many practical situations, both the parameters and the output of the deadzone are unknown. To overcome this problem, in a pioneer work [3], Tao and Kokotović proposed to employ an adaptive inverse of the deadzone. This scheme was applied to linear systems in transfer function form. Cho and Bai [7] extended this work and achieved a perfect asymptotic adaptive cancellation of the deadzone. However, their work assumed that the deadzone output was measurable. In [8], the work of Tao and Kokotović was extended to linear systems in a state space form with nonmeasurable deadzone output. In [9], a new smooth parameterization of the deadzone was proposed and a class of SISO systems with completely known nonlinear functions and with linearly parameterized unknown constants was controlled by using backstepping technique. In order to avoid the construction of the adaptive inverse, in [10], the same class of nonlinear systems as in [9] was controlled by means of a robust adaptive approach and by modeling the deadzone as a combination of a linear term and a disturbance-like term. The controller design in [10] is based on the assumption that maximum and minimum values for the deadzone parameters are a priori known. However, a specific procedure to find such bounds is not provided. In order to extend the class of systems previously considered in [9, 10], in this paper, we propose the development of two controllers based on universal approximation property of the neural networks. The first scheme utilizes a smooth adaptive inverse of the deadzone as in [9]. By means of Lyapunov analyses, the exponential convergence of the tracking error to a bounded zone is proven. The second scheme considers the deadzone as a combination of a linear term and a disturbance-like term as in [10]. Thus, the estimation of the deadzone inverse is not required. By using the Lyapunov-like analyses, the asymptotic converge of the tracking error to a bounded zone is demonstrated. Since this control strategy requires the knowledge of a bound for an uncertainty/disturbance term, a procedure to find such bound is provided. In both schemes, the boundedness of all closed-loop signals is guaranteed. A numerical experiment with a second-order nonlinear system shows that a satisfactory performance can be obtained by using any of the two proposed controllers.

2. Preliminaries and Problem Statement

In this study, the system which will be controlled is composed of an unknown nonlinear plant preceded by an actuator with an unknown deadzone in such a way that the deadzone output is the input to the plant. Consider that the n-order dynamics of the nonlinear plant can be represented as follows: where the scalar is the output of interest, for represents the th derivative of —each one of these derivatives are assumed measurable, is the deadzone output (and the input to the plant), is an unknown but continuous nonlinear function, is an unknown constant, and is an unknown but bounded disturbance. Defining the state variables as , (2.1) can be expressed as follows: where is the measurable state vector for , which is defined as . The nonsymmetric deadzone can be represented by where and are the right and left constant slopes for the deadzone characteristic, and represent the right and left constant breakpoints, and are the output and the input of the deadzone, respectively. Note that is the actual control input to the global system formed by the actuator and the plant. In accordance with [3, 4], the deadzone model (2.3) is a static simplification of diverse physical phenomena with negligible fast dynamics. Hereafter, it is considered that the following assumptions are valid.

Assumption 2.1. Without loss of generality, the unknown constant is assumed positive.

Assumption 2.2. The deadzone output is not available for measurement.

Assumption 2.3. Although the deadzone parameters , and are unknown constants, we can assure that , and .

The objective that we are trying to achieve is to determine a control signal such that the output of the plant (2.2), , follows a specified reference trajectory , and, at the same time, all closed-loop signals stay bounded.

2.1. Smooth Parameterization of the Deadzone

A direct way of compensating the deleterious effect of the deadzone is by calculating its inverse. From (2.3), the deadzone inverse can be obtained as Notwithstanding, clearly this is a discontinuous function. A smooth approximation of (2.4) was presented in [9] as where and is a positive constant chosen by the designer. Since both the parameters and the output of the deadzone are unknown, approximation (2.5) cannot be utilized directly. To overcome this problem, a smooth parameterization of the deadzone was proposed in [9]. For completeness, this parameterization is explained here. Note that (2.3) can be expressed alternatively as where Given that , , and are unknown, the deadzone output is approximated by where is an estimation of and . From (2.7) and (2.9), can be expressed as where . Although is an unknown term, its boundedness can be guaranteed [11]. Consider that the positive constant is an upper bound for , that is, . From (2.9), can be expressed in terms of as This expression can be utilized only if vector is properly determined. To avoid the singularity problem, note that and must always be different from zero.

2.2. Deadzone Representation as a Linear Term and a Disturbance-Like Term

For the particular case when , the deadzone model (2.3) can alternatively be described as [10, 12] where is given by Note that (2.13) is the negative of a saturation function. Thus, although could not be exactly known, its boundedness can be assured. Consider that the positive constant is an upper bound for , that is, .

3. Neurocontroller Design and Stability Analyses

Based on the universal approximation property of the neural networks, two control schemes are presented in this section to handle the trajectory tracking problem.

Assumption 3.1. The reference trajectory and their first n-derivatives are continuous and bounded. Besides, all these variables are available for the design.

Given the reference trajectory and their first -derivatives, the vector can be defined as . Let us define the filtered tracking error as where is the first element of the tracking error vector which is defined simply as and is a positive constant chosen by the designer. Note that can also be expressed as where is a constant vector given by .

Remark 3.2. Note that, from (3.1), can be considered as the input to a stable linear system whose output is . Consequently, if , then . Specifically, has the following properties proven in [13]: (i) converges exponentially to zero, if , (ii) if and where is a positive constant, then belongs to a compact set given by for , and (iii) if and , then will converge to within a time-constant .

The first derivative of can be calculated as where .

Now then, it is well known that any unknown continuous function can be approximated on a compact set by a neural network as follows [1416]: where is the activation vector function with sigmoidal components, that is, : where , and are positive constants which can be specified by the designer, is the approximation error which is bounded by for all , is a positive constant, and is an unknown constant vector. Typically, is considered as the optimal weight such that By substituting (3.4) into (2.2), the following alternative representation for the plant dynamics can be obtained: By substituting (3.7) into (3.3), we get

3.1. Scheme I

A control scheme which does not require the specific knowledge of the upper bound for the term is developed below.

In order to take into account the effect of the deadzone, the adaptive parameterization model (2.10) is substituted into (3.8): Consider that is chosen as where is an online estimation of and is selected as where is an online estimation of and is a positive constant. Note that where . If (3.12) is substituted into (3.9), the following is obtained: Now, by substituting (3.11) into (3.13) and reducing like terms, the filtered tracking error dynamics can be expressed as where and is an unknown term but bounded by the positive constant , that is, .

Once the filtered tracking error dynamics has been determined, the following Lyapunov function candidate is suggested: where , and , , and are positive constants. The first derivative of is The first derivative of can be calculated as follows. Since is a scalar and the transpose of a scalar is the same scalar, then Proceeding in a similar way for , it can be determined that Substituting (3.14), (3.18), and (3.19) into (3.16) yields Consider that the learning laws for , , and are chosen as where , , and are positive constants, , , and are ideally (but not necessarily) good estimations of , , and , respectively, and represents a smooth projection operation as in [17] or [18] in order to avoid that and can be equal to zero. In accordance with (3.21), (3.22), and (3.23) and taking into account that , , and , (3.20) can be expressed as In [11], it is mentioned that the projection operation has the following property: On the other hand, the truthfulness of the following inequalities was proven in [9] Likewise, it can be demonstrated that If the inequalities (3.25), (3.26), (3.27) are substituted into (3.24), we obtain If and defining and the following bound as a function of can finally be determined for : (3.30) can be rewritten in the following form: Multiplying both sides of the last inequality by , it is possible to obtain The left-hand side of (3.32) can be rewritten as or, equivalently, as Integrating both sides of the last inequality from to yields Adding to both sides of the last inequality, we obtain Multiplying both sides of the inequality (3.36) by yields and, consequently As and are positive constants, the right-hand side of the last inequality can be bounded by . Thus, and since by construction is a nonnegative function, the boundedness of , , , and can be guaranteed. Because , , and are bounded, , , and must be bounded too. If , from Remark 3.2, we can assure that and converges to . From (3.11), it can be seen that is formed by bounded terms and consequently . From (3.10), it can be seen that is the product of two bounded variables. Therefore, . As , and and are always different from zero, from (2.11), the boundedness of can be concluded. Now, note that the following is true: . Taking into account this fact and from (3.38), we get By taking the limit as of the last inequality, we can guarantee that converges exponentially fast to a zone bounded by the term . Based on this fact together with Remark 3.2, we can conclude that converges exponentially fast to a region around zero bounded by the term . Thus, the following theorem has been proven.

Theorem 3.3. If Assumptions 2.13.1 are satisfied, , and the control law (2.11), (3.10), (3.11) with the learning laws (3.21), (3.22), and (3.23) are applied to the system formed by (2.2)-(2.3), then the following hold:(a)the filtered tracking error, tracking error, the weights, the estimations of the control gain reciprocal and the deadzone parameters, and the control signal are bounded: (b)the actual tracking error converges exponentially fast to a region around zero bounded by the term:

3.2. Scheme II

A simpler scheme can be obtained by using the model (2.12) and the tuning error instead of the model (2.10) and the filtered tracking error , respectively. However, the implementation of this scheme requires necessarily the a priori knowledge of a good bound for the term . The development of this scheme is explained below.

If the deadzone model (2.12) is substituted into (3.8), we have Note that, by using the model (2.12), the actual control input appears now directly into the error dynamics (3.42) only multiplied by a constant gain .

Consider that is chosen as where is an online estimation of and is selected as where is an online estimation of ; , , and are positive constants, and represents a saturation function given by Note that where .

Substituting (3.46) into (3.42) yields If (3.44) is substituted into (3.47) and reducing like terms, the filtered tracking error dynamics can be expressed as where and is an unknown term bounded by the positive constant , that is, . Consider the following Lyapunov function candidate: where and are positive constants and is the tuning error defined as [10] Taking into account (3.18), the first derivative of can be calculated as It can be demonstrated that . By substituting (3.48) into the last equality and the resulting expression into (3.51), we get Consider that and are chosen as If (3.53) and (3.54) are substituted into (3.52), we obtain Now, from (3.50), Besides, Substituting (3.56) and (3.57) into (3.55) yields Considering that , (3.58) can be expressed as Note that, if and are selected in such a way that , then Because is a positive constant, the last inequality implies that . Based on this fact, it is possible to establish that and, therefore, . Since is a nonnegative function, , and belongs to . Because , and , and as and are constants, the boundedness of and can be guaranteed. From the definition of tuning error (3.50) and as is a bounded term, if , then . Keeping in view the above fact, and on the basis of Remark 3.2, the boundedness of can be assured. Now, it can be seen from (3.44) that is formed by bounded terms. Thus, . Likewise, it can be seen from (3.43) that is the product of two bounded variables and consequently is also bounded. On the other hand, an inspection of (3.48) reveals that . From (3.50), this means that is bounded too. Integrating both sides of (3.60) from to yields Note that the last inequality can be expressed as Since is a nonnegative function, the following is true: Substituting (3.63) into (3.62) yields taking the limit as of both sides of the last inequality, finally, we obtain This means that . As and , from Barbalat’s Lemma, we can conclude that converges asymptotically to zero. From definition (3.50), this implies that asymptotically converges to a region around zero bounded by . In view of the above and from Remark 3.2, we can conclude that converges asymptotically to a region around zero bounded by the term . Thus, the following theorem has been proven.

Theorem 3.4. If Assumptions 2.13.1 are satisfied, , and the control input (3.43), (3.44) with the learning laws , are applied to the system (2.2)-(2.3), then (a)the tuning error, the filtered tracking error, tracking error, the weights, the reciprocal estimation, and the control signal are bounded: (b)the actual tracking error converges asymptotically to a region around zero bounded by the term where and are positive constants chosen by the designer.

3.2.1. Estimation of a Bound for

Certainly, compared with the scheme I, the structure of the scheme II is simpler. Nevertheless, the implementation of this last scheme requires a good estimation of in order to guarantee the theoretical performance provided by Theorem 3.4. Here, we propose an offline practical procedure to achieve this goal.

Consider that some experimental data are available (see Remark 3.5). By substituting (2.12) into (3.7), we obtain Certainly, if , , and could be known, can be completely determined. By hypothesis, is known. However, because of the noise, must not be calculated directly from . Instead, a robust differentiation strategy should be used such as sliding modes [19], high-order sliding modes [20, 21], or smoothing by least squares, among others. Hereafter, the estimation of is denoted by . Once is obtained, let us consider the following regression model: where is simply an error term. By defining , and , (3.69) can be expressed as By using least-squares method, can be estimated as where and . Once is determined, can be approximated as Next, can be estimated from (3.72).

Remark 3.5. These experimental data can be generated by trying to use the neurocontroller II with relatively large values for the constant parameters. A first attempt could be to let and . If the tracking is not satisfactory, then larger values could be tried.

Remark 3.6. Alternatively, in [22], the determination of an error term is achieved by using of sliding modes.

4. Numerical Example

In this section, the proposed neurocontrollers are tested by simulation on the following second order nonlinear system: The initial condition for system (4.1) is is the deadzone output whose parameters are ; , the disturbance term is selected as . The following reference trajectory is proposed . The first and second derivative of can be calculated analytically. In another case, a robust differentiation method must be used. It is very important to mention that the nonlinear system (4.1), the exact values for deadzone parameters, and the disturbance term are assumed completely unknown for the neurocontrollers during the design and simulation process. That is, the system (4.1) is only used as a data generator.

With respect to the tuning process, we must recognize that, similarly to many techniques of adaptive control, we do not have a systematic procedure in order to find the proper values for the controller parameters. Instead, an approach by trial and error is used. Thus, through various iterative simulations, the following values for the parameters of the neurocontroller I were found: Notice that a great freedom is allowed in order to select , , and . However, the designer should be aware of that as these parameters take values increasingly different from the optimal ones, the parameter in Theorem 3.3 becomes larger. On the other hand, we have seen that by setting , , and , a more regular behavior of the closed-loop system can be obtained [23, 24].

The simulation is carried out by means of Simulink with ode45 method, relative tolerance equal to , and absolute tolerance equal to . A Simulink block diagram for the neurocontroller based on scheme I is shown in Figure 1. The results of the tracking process are presented in Figures 24 for the first 20 seconds of the simulation. In Figure 2, the output of nonlinear system (4.1), , is represented by dashed line whereas the reference trajectory is represented by solid line. In spite of the difference between and , the tracking process shows a satisfactory behavior. This can be verified more specifically in Figure 3 where the actual tracking error, , is illustrated. In order to appreciate better the quality of the tracking process, a zoom of Figure 3 is presented in Figure 4. From Figures 3 and 4, we can appreciate that the actual tracking error converges fast to a zone bounded by 0.02. Finally, the control signal acting as the input of deadzone is shown in Figure 5.

For the case of the neurocontroller II, its parameters are selected very similarly to the neurocontroller I parameters in order to provide a proper reference for comparison. The parameters of the neurocontroller II are A Simulink block diagram for the neurocontroller based on scheme II is shown in Figure 6. Since a very similar performance is obtained, the signals of the actual tracking error for the two neurocontrollers are presented together in Figure 7. In Figure 8, a zoom of Figure 7 is presented. As can be seen from Figures 7 and 8, the actual tracking error for the controller II converges fast to a zone bounded by 0.02. This is a remarkable behavior since Theorem 3.4 only guarantees the asymptotical convergence to a zone bounded by 0.2. Finally, the control signals of the two neurocontrollers are displayed in Figure 9.

So far, the neurocontroller II has been used in a nonrigorous way. In order to guarantee its stability, a bound for must be first determined. By using the procedure presented in Section 3, can be estimated as , . With these parameters, the maximum value for is estimated as 7.5819 and a very conservative bound for can be established as . In view of the above and in accordance with the selected values for the neurocontroller II parameters and from Theorem 3.4, the stability of the closed-loop system can be guaranteed.

As the comparison between two empirically tuned techniques is necessarily limited, the main objective of this numerical example was rather to show that by using any of the two proposed controllers, a good performance can be obtained. Notwithstanding, it must be mentioned that the tuning process was easier for the case of Scheme II.

5. Conclusions

In this paper, two adaptive schemes based on universal approximation property of the neural networks are proposed in order to control an unknown SISO nonlinear system in Brunovsky canonical form with unknown deadzone input. The objective is to determine a control signal such that the output of the unknown plant follows a specified reference trajectory, and, at the same time, all closed-loop signals stay bounded in spite of the presence of unknown but bounded disturbances. The first scheme utilizes a smooth adaptive inverse of the deadzone while the second scheme considers the deadzone as a combination of a linear term and a disturbance-like term. For the first case, the exponential convergence of the tracking error is guaranteed by using a Lyapunov analyses. For the second case, only the asymptotical convergence can be guaranteed theoretically. However, a numerical example shows that a similarly satisfactory performance can be obtained by using any of the two proposed controllers. The designer should select between these two schemes in accordance with the particular features of his application and considering the following:(i)Scheme I does not need the specific knowledge of a bound for the unknown dynamics and/or the disturbance term.(ii)Scheme I can handle the case when .(iii)The tuning and implementation process is easier for Scheme II.Finally, it must be mentioned that a compromise should be established between the accuracy of the tracking process and the smoothness of the control signal.

Acknowledgment

J. H. Pérez-Cruz would like to thank the financial support through a postdoctoral fellowship from the Mexican National Council for Science and Technology (CONACYT).