#### Abstract

A novel active disturbance rejection control (ADRC) controller is proposed based on support vector regression (SVR). The SVR-ADRC is designed to force an underactuated autonomous underwater vehicle (AUV) to follow a path in the horizontal plane with the ocean current disturbance. It is derived using SVR algorithm to adjust the coefficients of the nonlinear state error feedback (ELSEF) part in ADRC to deal with nonlinear variations at different operating points. The trend of change about ELSEF coefficients in the simulation proves that the designed SVR algorithm maintains the characteristics of astringency and stability. Furthermore, the path following errors under current in simulation has proved the high accuracy, strong robustness, and stability of the proposed SVR-ADRC. The contributions of the proposed controller are to improve the characteristics of ADRC considering the changing parameters in operating environment which make the controller more adaptive for the situation.

#### 1. Introduction

AUVs are unmanned submarines that carry their own power source and a computer unit, running software, and control solutions that allow the execution of a mission without human intervention [1]. Now, they are rapidly becoming popular in the oceanographic sampling community. Compared with ship-based sampling, the convenience of deployment, low operational cost, and high-resolution sampling capabilities have made them valuable tools for studying the mysterious ocean.

The control problem of underactuated AUV has been one of the active research areas because of its intrinsic nonlinear feature and practical requirements [2]. When the vehicle is sailing under water, the motions of six degrees have coupling phenomenon. In addition, it usually has characteristics of large inertia and time delay, nonlinearity, and underactuation. And its tracking motion is strongly influenced by the hydrodynamic perturbations, as well as the effects of wave and current flow disturbances, which make steering control becomes a difficult task [3]. On the other hand, the motion parameters of AUV are necessary factors which have to be considered, for example, turning rate and turning radius [4]. Otherwise, the control accuracy of path following will be reduced a lot if these parameters reach the design limit [5]. It can be concluded that the design of tracking controller with high performance is always complicated and the relative research work is significant.

At present, some certain theoretical study results have been achieved on the AUV tracking control, for example, the fuzzy slid mode control, the neural network adaptive PID, the backstepping control, and so on [6]. Zhou et al. [7] designed three adaptive neural network controllers which are based on the Lyapunov stability theorem to estimate uncertain parameters of the vehicle’s model and unknown current disturbances. These controllers are designed to guarantee that all the error states in the path following system are asymptotically stable. Lapierre and Jouvencel [8] designed a kinematic controller and extended it to cope with vehicle dynamics by resorting to backstepping and Lyapunov-based techniques. To a certain extent, these control algorithms have obtained some achievements. However, when it comes to solve the disturbance problems of exterior interfere and uncertain model of AUV, the above control algorithm is incapable of realizing high performance [9]. In this paper, an algorithm called SVR-ADRC is proposed to deal with these two disturbances.

Considering the particular steering scheme, obvious nonlinearity, and disturbance sensitivity, a new adaptive ADRC control algorithm using the support vector regression is designed to improve the control performance. In the case of path following, the interior and exterior disturbances can be estimated by a designed ESO observer [10]. Then, this estimation is used to compensate the NLSEF in order to improve control quality, where a tracking differentiator (TD) is designed to arrange the transient process and optimize control command. The system output at next sampling point can be obtained by using the support vector regression identification method which takes account of nonlinear relationship between control input and system output. Meanwhile, the local linearization and control parameter self-turning of ADRC are realized based on the quadratic performance index; thus the adaptability of ADRC will be greatly enhanced.

The idea of ADRC technique, which originated from PID control algorithm, is proved to be very effective because it does not entirely depend on mathematical model of the plant and it can compensate the internal and external disturbances dynamically [11]. Han [12] proposed the ADRC technique to improve the control performance in the real practice of engineering. Combining with the special nonlinear feedback structure, it can realize good control quality, such as small exceeding value, fast response, and strong robustness. And its algorithm is simple and suitable for digital realization. Additionally, a normally complex and nonlinear system can be considered as a linear set of integrators, which is easier to control with classical techniques. Some successful practical realizations of ADRC are presented in [13, 14].

In recent years, Support Vector Machines (SVMs) have been proposed as learning-from-samples tools for a number of problems, including classification and regression. Since Vapnik [15] firstly advanced SVM in the 1990s, it has been extensively applied and improved in solving classification and regression problems owing to its favorable performance [16]. Compared with neural network (NN) algorithm, SVM has several merits. Firstly, unlike NN using empirical risk minimization (ERM), SVM is based on the criteria of structural risk minimization (SRM), so that better generalization ability can be achieved. Secondly, SVM can guarantee a global solution by adopting convex quadratic programming, while NN is apt to fall into local optimization. Thirdly, SVM can easily overcome the curse of dimensionality with the help of kernel function, which is much too difficult for NN in [13].

In the traditional ADRC method, the adaptability in different operating environments cannot fulfill the accuracy required, because the parameters of NLSEF in ADRC method will be obtained as a fixed constant which is achieved based on experience or experimental data. Even in different operating conditions, the vehicle will also move under these set parameters, where some unexpected tracking error happens. If the parameters in NLSEF are changed by some method according to different conditions which mean different parameters and nonlinear fitting functions, the effect of controller will be improved obviously. In practical implementation, this method can be realized easily.

This paper makes an effort to apply SVR-ADRC to the path following for an underactuated AUV with the disturbances of ocean current and model uncertainty. Firstly, the desired course angle for the steering control is derived by using line-of-sight (LOS) guidance law. Secondly, the ultimate control command is computed with ADRC, where the coefficients of NLSEF in SVR-ADRC are adjusted along with output of SVR to deal with nonlinear variations at different operating points. Finally, the computer simulation proves that the controller has satisfying path following characteristics, including high accuracy and strong robustness.

#### 2. Math Model

It is well known that establishing an accurate dynamic model of AUV is of prime importance for their maneuvering prediction and control application. The notation of math model according to SNAME is mentioned in Table 1.

From the control viewpoint, the 6-degree-of-freedom (DOF) nonlinear dynamics of AUV, together with hydrodynamic coefficients’ uncertainties, makes underwater vehicles a challenging system to be accurately modeled and controlled [17]. Research solutions based on the AUV dynamic model such as control algorithm, fault tolerance schemes, and navigation systems have great importance for them moving through an unknown, unstructured, and probably dangerous environment.

##### 2.1. AUV Model

The modeling method chosen in this work is a geometrical-based analysis. It consists mainly of finding the parameters of the model, which have been well defined through physical laws describing the motion of a rigid body in a liquid environment.

The dynamic model of AUV can be simplified as

The kinematic equation is

In this mathematical model, and represent the length and mass of AUV, respectively, , and are the inertia moment, and , and are the translational velocity and the angular velocity in body-fixed coordinates.

##### 2.2. SVR-ADRC

###### 2.2.1. SVR

The SVR maps the input space to the high-dimensional feature space, and, in the feature space, the optimal linear regression function is built; then it can infer the output of any input. So, it has described the nonlinear relationship between the input and output space. With the training sample set, the regression function, which is used to describe the nonlinear relationship between the input and output space, is built by the machine learning; then the regression estimation of the output is finished.

Given a set of data points, , such that is an input and is a target output, where . Choosing a nonlinear mapping function , then map the input space to the high-dimensional feature space ; thus the optimal linear regression function can be constructed in . Consider

Based on the theory of SVM, the optimized goal can be achieved, and the standard form of SVR is

Then, an optimization problem has been formed:

In order to solve the dual problem of (5), we introduce a kernel function as follows:

Then, the solution of original problem can be described as follows:

Considering the stability of the system, we design as follows:

Combined with (3), the approximate function can be rewritten as

###### 2.2.2. ADRC

ADRC is a relatively new control design concept and method. It is well known that the primary reason for using feedback control is to deal with the variation and uncertainties of the plant dynamics and unknown disturbance from the outside [18].

The controller contains a tracking differentiator (TD), an extended state observer (ESO), and a nonlinear state error feedback (NLSEF) [19].

The TD is functioned as below. The input signal passes through it and there are two or more outputs; one output signal is tracking the input signal, and other signals are the th order differential signals.

The input signal is assumed as ; then the tracking differentiator which is derived from second-order nonlinear system can be shown as follows:

With the parameter increasing, the solution of (10) is which is adequately approaching to the input signal in any finite time. Thus, which is the differentiation of can be regarded as the differential signal of input signal .

Usually (10) can be changed as follows: where is a time optimal integrated function, whose detailed expression is described as follows:

The ESO was first proposed by Han [12] to estimate the total dynamics online, including the internal nonlinear dynamics and the external disturbance. The ESO is a primary component in ADRC. The observer results determine the control quality. A three-order ESO for the path following system is proposed as follows: where and are usually chosen as functions, and the specific form is shown as

In the ESO, can track the real-time action variable of the acceleration in an open loop system. If the system has observability, and the acceleration is playing a role in it, the effect will be reflected in the output of the system; accordingly, the action variable may be extracted from output information. Based on the estimated value of state of the extended state observer, control variable can be regarded as (15) if the parameter is known. Consider where is an initial control variable and is a predefined parameter.

The disturbance is assumed as in Figure 1. and are state variables of AUV; meanwhile is the real-time action variable of unknown disturbance and uncertain model.

If can compensate for the uncertain model and disturbance and define

Considering , , the nonlinear allocation can be written as follows [20]: where and are control gains.

###### 2.2.3. The Control Algorithm of SVR-ADRC

Complicated structure, multiparameter, and lacking adaptivity are the disadvantages of ADRC. It is known that the parameters of TD and ESO own a big range of adaptability, but the coefficients of ELSEF need to be adjusted according to different system work plot.

The quadratic performance index is defined as where and are, respectively, weighted coefficient of the tracking error and control increment, is control expectation, and is the output of control.

The relationship between the input and output can be expressed as where is the system disturbance.

Define .

Then, (9) can be described as follows:

With the help of practical linearization, a new formula can be obtained as follows: where

Considering the form of , we define

Thus, (18) can be changed to where

#### 3. The Path Following Control of SVR-ADRC

##### 3.1. The Steering Control Based on SVR-ADRC

is the transient process time of tracking control. Combined with (10), the relationship between and can be written as follows: Then, the max tracking velocity of the system can be calculated as

Considering the relationship between the max tracking velocity and limit turning rate, a coordinated control scheme can be designed for the steering control system.

The function of steering control is to adjust the rudder angle of AUV based on the system input [21]. The SVR can establish a function that describes the relationship between the input and output,* *and, with the function we obtained, the quadratic index can provide the parameters of NLSEF to improve the performance of it.

According to Figure 2, there are four parts, and they are, respectively, TD (30), ESO (31), NLSEF (33), and SVR (32), which provides the variables and as the time changes by combining (26). Consider where is heading-turning rate coordinated control, and are filter parameters, and and are optimal parameters.

Finally, the control input can be calculated as

##### 3.2. The Path Following Control Based on SVR-ADRC

Based on the steering controller designed above, the path following control architecture can be obtained as in Figure 3. The path following controller consists of two parts: the controller of inner loop is the steering controller and the other is the outer loop which is used to calculate the heading command for the inner loop from the path following error.

The inner loop controller is the steering controller which is introduced above, and the outer loop can provide the actual heading command with the path command and path error obtained by using the path planning and the position information calculated from the output of the system. Then, actual heading command is regarded as the input of the steering controller. Through the steering controller, the actual heading command can be translated into the rudder angle of AUV, and thus the AUV can achieve the expect motion.

Like the steering controller, the path error controller also has four parts, which are, respectively, TD (35), ESO (36), NLSEF (38), and SVR (37) which provides the variable and as the time changes by combining (26). Consider where and are optimal parameters, is the path following error, and is the expected path error. Consider

Finally, combined with the azimuth angle and modified heading command , the actual heading command can be obtained as follows:

#### 4. Simulation Results

Two simulations are designed to demonstrate the path following control of the AUV using the SVR-ADRC and ADRC, respectively. In the simulation, the depth of AUV is assumed to have no change and keep moving in the horizontal plane with the ocean current disturbance, while the vehicle implements the searching and patrolling mission. Furthermore, there is a current interference around the AUV during mission. The current speed is 0.2 m/s, and its angle is 0 degree (NED).

It is assumed that the speed of AUV is 2 knot, initial position is at (0, 0) m, and initial heading is 50 degree. The planned path, which consists of three semicircles and four straight lines, starts at the point of (10, 100) m and the end is at (460, 100) m.

Figure 4 shows that the path following result by SVR-ADRC is better than ADRC. The position errors in the simulation of ADRC are more obvious, especially in the connected areas between the semicircle and straight line. In these areas, surge and overshoot happen. At (10, 600) in Figure 4, there is about 2.5 m distance error the same as the points (310, 600) and (160, 100). Additionally, during the path following mission, the position errors are decreased at the connected points (160, 600), (310, 100), and (460, 600). However, the position errors are eliminated with the help of SVR-ADRC.

Figure 5 shows the heading responses of the AUV with the two control algorithms. With ADRC, the heading angle has relatively obvious oscillation which becomes worse when the vehicle is turning around near the connected point. Without changing the parameters of the ADRC, the efficiency, accuracy, and rapidity of the path following could be affected under current and trajectory transition on path. On the contrary, the heading response using the SVR-ADRC algorithm indicates better feasibility and robustness. It is demonstrated that the robustness and adaptability are guaranteed by SVR-ADRC which can regulate the parameters in ADRC.

Figure 6 shows different position errors in the path following mission. Combined with Figure 1, it is indicated that the improvement of SVR-ADRC is significant. Figure 7 shows that the SVR-ADRC has better interference immunity than the ADRC. When the AUV is coming close to the first point and moving onto the path, the rudder angle will be actuated obviously and has a big overshooting with ADRC. But, with the other control algorithm, it shows characteristics of stability, continuity, and robustness.

Compared with sliding mode control method from Figure 4 to Figure 7, the advantages of SVR-ADRC are obviously shown in stability, rapidity, and accuracy. Though the sliding mode control also has good performance in some aspects, it has not restricted the negative effect resulting from the external disturbance. In simulation, the chatting phenomenon is very obvious with sliding mode control method.

In Figure 8, and are changed along with the path following control. When it does not arrive at the semicircle, the current has almost no influence on the heading of AUV, so the parameters have almost no changing. On the other hand, when the vehicle is moving to the starting point and on the semicircle, the current has a big influence on the motion of AUV, so the parameters change rapidly. In addition, the changing parameters in Figure 8 prove that the SVR is stable and convergent.

In Figure 9, and are also changed along with the path error control. Like and in the steering control, when the AUV is sailing in the straight line, there is no current influence, so the parameters have little fluctuate. On the other hand, when it is sailing in the semicircle, the ocean current leads the parameters of NLSEF in Figures 8 and 9 to have fluctuation.

#### 5. Conclusions

In this paper, a nonlinear path following in the horizontal plane for an underactuated AUV in the presence of ocean current and uncertain parameters of ADRC was proposed by using the simulation result. The designed SVR-ADRC path following controller is robust to nonlinear motion, the model parameter perturbations, and the external disturbances. The continuous changing parameters of NLSEF by SVR in the simulation indicated its stabilization and a better control effect. Finally, the simulation experiments had also verified good following result. And it was demonstrated that the proposed algorithm had high accuracy, strong robustness, and stability. In the future work, the SVR-ADRC will be applied to path following in 3D. In addition, more attention should be paid to the development of SVR-ADRC.

#### Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

#### Acknowledgments

This work was supported by the National Natural Science Foundation of China (NSFC-51179038), Program for New Century Excellent Talents in University in China (NCET-10-0053), and the Fundamental Research Funds for the Central Universities (HEUCFX041401).