Distributed Cooperative Sliding Mode Fault-Tolerant Control for Multiple High-Speed Trains Based on Actor-Critic Neural Network
This article investigates the cooperative fault-tolerant control problem for multiple high-speed trains (MHSTs) with actuator faults and communication delays. Based on the actor-critic neural network, a distributed sliding mode fault-tolerant controller is designed for MHSTs to solve the problem of actuator faults. To eliminate the negative effects of unknown disturbances and time delay on train control system, a distributed radial basis function neural network (RBFNN) with adaptive compensation term of the error is designed to approximate the nonlinear disturbances and predict the time delay, respectively. By calculating the tracking error online, an actor-critic structure with RBFNN is used to estimate the switching gain of the distributed controller, which reduces the chattering phenomenon caused by sliding mode control. The global stability and ultimate bounded of all signals of the closed-loop system are proposed with strict mathematic proof. Simulations show that the proposed method has superior effectiveness and robustness compared with other fault-tolerant control methods, which ensures the safe operation of MHSTs under moving block conditions.
In recent years, high-speed trains (HSTs) are increasingly popular because of the characteristics of high speed, high efficiency, and low energy consumption . With the rapid development of HSTs, the requirements for reliability and safety of systems are becoming higher. In the course of the train operation, actuators in traction and braking systems are frequently affected by high temperature and violent vibration so that it may fail after long time high load operation. If the failures are not detected and treated in time, it may cause time delay or stop of trains; in critical condition, it will even lead to derailment, overturning, and other major accidents . Thus, it is an urgent demand to design a fault-tolerant control scheme to guarantee the safe operation of train control system.
During the past years, there were various fault-tolerant control schemes developed for single train, including back stepping control , neural network control , and sliding mode control (SMC) . In , an adaptive backstepping fault-tolerant control scheme was designed for HSTs with unknown parameters, actuator faults, and disturbances, and a piecewise time-varying indicator function was used to describe the train motion model. In , a neuroadaptive fault-tolerant control method was proposed for HSTs with actuation notches and antiskid constraints, and the radial basis function neural network (RBFNN) was used to study the nonlinear parameters of the system. In , an adaptive sliding mode fault-tolerant control (SMFTC) scheme was designed to solve the actuator uncertainties and faults of HSTs, simultaneously, and a dynamic model with input distribution matrix uncertainty is established to describe the properties of the train system. Nowadays, with the increasing pressure of urban traffic, the number of HSTs is gradually increasing, and the methods and ways for single-train operation control have not been enough to meet the efficiency and safety requirements so that there are frequent incidents of train delays and passenger dissatisfaction . To improve the operation efficiency and ability of the rail transit system, the cooperative control among multiple high-speed trains (MHSTs) has aroused widespread attention of many scholars . For MHSTs, the purpose of cooperative control is to make decisions to coordinate train behaviour based on more information and use the intelligent control algorithm to coordinate their respective state, speed, and the next step, which guarantees there are no accidents between two neighboring trains happening . In , an adaptive fuzzy fault-tolerant control scheme with proportional and integral-based sliding mode technique was proposed for MHSTs with actuator faults, and a novel fuzzy system with minimal computational burden was designed to approximate the unknown disturbances. In , a distributed cooperative control method was proposed for MHSTs under moving block mode, and a virtual marshalling topology was used to establish the cooperative control model. In , a robust distributed controller with disturbance observer was designed to solve the cruise control problem of MHSTs under external disturbances, and the observer topology was used to approximate the unknown disturbances.
In addition to the design of controller, the cooperative control for MHSTs needs high communication requirements, such as packet losses and time delays during data transmission . It is noted that the difficulties may worsen seriously with the increase in speed in real engineering . To overcome the above shortcomings, in , a recursive filter was designed for train control systems with measurement noise and packet dropouts, and the upper bound of the filtering error was obtained by deriving the filter gain. In , a T-S fuzzy model was established to describe the nonlinear train networked control system, and the least mean square algorithm was used to predict the network delays to improve the control performance. Until now, there are still few works on the problems of cooperative control for MHSTs by considering actuator fault and communication delays, comprehensively. Therefore, it is of vital importance to design an appropriate distributed fault-tolerant control scheme to deal with the actuator faults and time delays for MHSTs to improve work efficiency and operation safety. In the field of control theory, due to its strong robustness, SMC has been widely applied to the cooperative control for complex nonlinear systems such as multilateral telerobotics , multiple quad rotors , and MHSTs . However, the applications of conventional SMC are mainly limited by its chattering phenomenon in practice. For the drawback, some parameter tuning methods, such as fuzzy inference technology  and reinforcement learning algorithm , have been used to adjust the control gains to reduce the chattering phenomenon caused by switching control term of SMC. Considering the above problems mentioned, our main contributions are summarized as follows:(1)Considering the complexity of the actual operation condition, a method that combines the distributed SMFTC with neural networks is proposed for the cooperative control of MHSTs with actuator faults and communication delays.(2)With the compensation mechanism of the approximation error, a distributed RBFNN is designed to deal with the unknown nonlinear disturbances from train traction and braking, and a time delay prediction model based on RBFNN is established to predict the forward channel delay between controller and actuator.(3)Based on the current observed input-output data, an actor-critic neural network is designed to tune the switching gains of the distributed sliding mode controller in an adaptive way for overcoming the chattering problem.
The structure is presented as follows. In Section 2, the problem formulation for MHSTs with actuator faults is discussed. In Section 3, a distributed collaborative SMFTC scheme with an actor-critic neural network is designed, and the stability proof is analyzed. The main results and the conclusions are presented in Sections 4 and 5, respectively.
2. Problem Formulation
In this section, the dynamic equation of MHSTs is described, the actuator fault model is given, and the constraints on cooperative control of MHSTs are introduced.
2.1. Dynamic Model of MHSTs
Considering the operation process of MHSTs, its dynamic model is described as follows:where is the number of trains running on the same railway, and are the position and speed of train , respectively, is the acceleration factor, is the unit control force of train , and and are the unit basic resistance and additional resistance of train , respectively, which can be described as follows :where , , and are the time-varying resistance coefficients, which are closely related to the operation condition of the train, and , , and are the gradient angle, curved track radius, and tunnel length, respectively.
2.2. Actuator Fault Model
With the train running for a long time, many components in the traction control system will suffer performance degradation to varying degrees and cause various faults . Among them, the most common is the partial failure of the actuator. If not dealt with in time, it is easy to cause the tractive force and braking force to deviate from the expected value, resulting in various train accidents . In general, the dynamic model of MHSTs with actuator partial failures can be described as follows:where is the actuator effective factor of train , which can be described as follows:where is the fault occurring time instant, . It follows from (4), after the actuator failures, the control force will deviate from the expected value, which certainly affects the stability of the train system.
2.3. Constraints on Cooperative Operation of MHSTs
In order to ensure the operation safety of MHSTs on the same line, there must be a certain safety interval between each train . As shown in Figure 1, when the following trains run on the same line, the radio block center (RBC) can collect real-time information such as train speed and position via the global system for mobile communications for railways (GSM-R). Afterwards, the RBC transmits the information to the vehicle equipment of the neighboring train to activate the distance monitoring module so that the multiple trains can calculate the minimum safe distance and track the preceding trains under moving block mode.
Under the moving block condition, the between neighboring trains is given as follows:where is the additional safety distance, is the braking distance, is the proper redundant safe distance, and the is the length of the train, as shown in .where is the running time of the train after braking, is the actual speed of the train, and are the initial and final speeds of the train in the sampling time during braking, respectively, is the average acceleration of the train in the sampling process, and is the constant coefficient.
In the process of cooperative operation of MHSTs, the constraint conditions required by the train control system can be described as follows :where and are the actual running distance of the preceding and following trains, respectively.
3. Design of Distributed Collaborative SMFTC Scheme Based on Actor-Critic Neural Network
Under the condition of the actuator faults and communication delays, the control objective is to design a distributed cooperative fault-tolerant control scheme, which enables MHSTs to accurately track the desired speed and position so that the headway distances of each train with its neighbors are maintained in proper ranges. The diagram of the proposed fault-tolerant control scheme is illustrated in Figure 2.
Figure 2 shows that the distributed controller sends the control law and forward timestamp into a process data packet to the train traction system via the multifunction vehicle bus (MVB) network. Under the partial failure conditions, the actuators in the traction system execute the control law and record the actual forward channel time delay . Then, the sensors detect the actual output of the train at regular intervals and send , feedback timestamp , and into a process data packet to the train control system via the MVB network. Further, the distributed controller calculates the latest control law combined with the key information including the position information of other trains from the RBC and GSM-R, the gain parameters from the actor-critic neural network, the disturbance estimation and time delay prediction from the distributed RBFNN, and the ideal reference input.
3.1. Design of Distributed Sliding Mode Fault-Tolerant Controller
In order to realize the accurate tracking of the first train to the desired position and speed, the tracking error signal can be defined as follows:where and are the position and speed tracking error of the first train, respectively, and are the desired position and speed of the first train, respectively, and and are the actual position and speed of the first train, respectively.
For tracking error signal (8), the following sliding surface is designed:where .
The time derivative of is derived as follows:
Taking use of (1) to substitute for , (10) can be rewritten as follows:where , is the unknown nonlinear function. As can be seen from (2), with the increase of train speed, the nonlinear characteristic of becomes more and more obvious, thus increasing the design difficulty of the controller.
As the advantages of simple structure, fast learning speed, and strong approximation ability, the RBFNN is widely used with real-time control . To solve the problem that the controller depends on the train parameters, we adopt the RBFNN to approximate the during the train traction braking process. Based on the Lyapunov theory, we use the train speed information collected by sensors to train the RBF neural network online. The neural network algorithms are described as follows:where is the input of network, is the number of input nodes, is the number of hidden layer nodes, is the output of Gaussian function, is the center vector of hidden layer neurons, is the width of Gaussian function, is the ideal weight, and is the approximation errors and meets .
To achieve this control objective, we define the input of RBFNN as , and the actual output of RBFNN is expressed as follows:where is the estimated weight; we define .
For the multitrain system (3), in terms of (11) and (13), the fault-tolerant controller of the first train is designed in the following form:where and are constructed in the following form:where and .
If the control law is designed as (16), the adaptive laws of the first train will be designed as follows:where , , and .
In order to realize the cooperative control of MHSTs, we assume that all states of MHSTs are measurable and define the tracking error signal of the following trains aswhere is the desired distance headway of each train with its neighbors.
Based on the tracking error signal (18), the sliding surface is designed as follows:where .
The derivative of can be expressed as follows:
For the convenience of calculation, we define the and as follows:where and .
Referring to the same design idea as (16), the control law of the following trains is designed as follows:and the adaptive parameters are updated as follows:where , , and .
3.2. Stability Analysis
Theorem 1. Consider the dynamic model (3) and constraint conditions (7) of MHSTs with actuator faults and communication delays, if the sliding surfaces are selected in (9) and (19), the distributed control laws are designed in (16) and (23), and the adaptive parameters are updated in (17) and (24); then all signals of the whole closed-loop system will be bounded, and the position and speed tracking errors will converge to zero. Furthermore, the whole closed-loop system is ultimately stable.
Proof of Theorem 1. The Lyapunov function candidate is selected as follows:where , , and .
The derivative of can be calculated as follows:Substituting (22) into (26), we have the following:Substitute (23) and (24) into (27), thenWhen meets , , then .
As and , is bounded, then , , , and are bounded. According to , the following inequality can be obtained:From (29), we obtainWhen , is bounded because of the boundedness of . According to the Barbalat lemma , as , then , . The proof is completed.
3.3. Design of Actor-Critic Neural Network
In the design of the sliding mode controller, in order to ensure the stability of closed-loop system, a large switching gain is needed for large disturbances, resulting in the chattering of the system. In (28), the switching gain is used to compensate for the uncertain disturbances to ensure . Because the unknown disturbances are time-varying, the switching gain should be designed as the time-varying parameter for reducing the chattering phenomenon. Because of its fast convergence and good optimization ability, reinforcement learning algorithms are widely used in the field of artificial intelligence . Among them, the actor-critic learning algorithm is often used for strategic approximation of continuous control problems . To overcome the problem mentioned above, we use the RBFNN to construct a reinforcement learning algorithm with an actor-critic structure to estimate the online, so as to counteract the unknown disturbances, and the train speed and speed tracking error are applied to the training of the actor-critic neural network.
3.3.1. Design of Critic Neural Network
The state value function is defined as follows:where is the estimate value of the ideal weight , is the Gaussian basis function, and is the input of network with .
Considering the influence of system tracking error on control performance, the utility function is defined as follows:where is the enhanced signal coefficient of tracking error and is the enhanced signal of tracking error, which is given as follows :where is the tolerate error. When the error function is less than , the tracking performance is ideal. On the contrary, it indicates that the tracking performance is poor.
Time difference error expresses the quality of decisions of actor neural network, which is defined as follows:where is the discount factor with . If the is small, it means the agent is more concerned with maximizing the present income. In contrast, as the approaches 1, the agent will think more about future gains.
The cost function of the system is defined as follows:
The partial derivatives of (36) are obtained as follows:
The partial derivatives of (35) can be obtained as follows:
According to the gradient descent method, the weight update law of critic neural network is given bywhere is the learning rate of .
3.3.2. Design of Actor Neural Network
In order to improve computational efficiency, the same RBFNN is used to learn value function and action function. Therefore, the action function of actor neural network is defined as follows:where is the estimated value of the ideal weight and is the estimation of the switching gain.
In general, the output of the actor neural network needs to be superimposed with a Gaussian signal to act on the sliding mode controller; the specific calculation method is as follows:where and is the actual output of the actor neural network.
Based on the gradient descent algorithm, the weight updating mode of the actor neural network can be obtained as follows:where is the learning rate of .
4. Simulation Results
In order to analyze the performance of the above distributed cooperative SMFTC method, we select four CRH3 trains as the controlled object. The main parameters of the CRH3 train are as follows : the speed range is 0∼350 km/h, the continuous running speed is 300 km/h, the total weight of the train is 400 tons, the train length is 200 m, and the rotary mass coefficient is 0.06. The reference input is set as the actual working conditions of the train including traction, braking, and inertia. Referring to the literature , the task cycle is set as 50 ms, the sampling cycle is set as 64 ms, and the load rate is set as 45%. The change of failure factors of each train actuator is set as (49). Control parameters are shown in Table 1.
4.1. Analysis of Cooperative Control Results of MHSTs
To illustrate the effectiveness of the proposed method, we use (5)–(7) to fully calculate the length of each train and the safety redundancy distance and other factors. The tracking interval distances between two neighboring trains are set as 1000 m. The initial state of each train is set as , , , , , , , , , , , and . The speed and position tracking results of MHSTs are shown in Figure 3.
Figure 3 shows that under the same desired speed and line conditions, the actual running speed of all trains is always the same. Thus, the headway distance between two neighboring trains is always maintained at 1000 m, which ensures the coordinated operation of MHSTs with actuator faults and communication delays under the moving block condition.
4.2. Comparison of Different Fault-Tolerant Control Methods under Actual Working Conditions
In this section, both fuzzy logic system and actor-critic neural network can be used to estimate the switching gain of sliding mode controllers. Among them, the design of the fuzzy logic system is more dependent on experience. Thus, with the increase of the parameters to be optimized, the design difficulty of fuzzy logic systems will be increased. In order to verify the advantages of the proposed method, we choose the adaptive fuzzy sliding mode fault-tolerant control (AFSMFTC) method  with , , , , , , , , , , , , , , , and and the adaptive sliding mode fault-tolerant control (ASMFTC) method  with , , , , , , , , , , , , , , , and as a comparison. The initial state of each train is set the same as in Section 4.1. Figure 4 shows the control effects of the first train using different fault-tolerant control methods. Figure 5 shows the speed tracking errors of MHSTs using different fault-tolerant control methods. Figure 6 shows the displacement interval errors of MHSTs using different fault-tolerant control methods. Table 2 shows the position interval error and speed tracking error of each train under different control methods recorded by root mean square error (RMSE) and mean absolute error (MAE). The RMSE and MAE are defined as follows:where is the total running time of the train , .
As shown in Figure 4, in the traction stage, the speed tracking result of the proposed method is close to that of the ASMFTC method, but the AFSMFTC method has a large speed tracking error, which indicates that, with the increase of the number of trains, the design difficulty of the fuzzy logic rule base increases, the number of learning parameters increases sharply, and the dynamic performance of the system is seriously affected. In the braking and inertia stages, compared with the ASMFTC and AFSMFTC methods, the proposed method can achieve smooth switching at the steady-state working point and has the advantages of accurate tracking and fast response. In addition, the proposed method uses the actor-critic neural network to estimate the switching gain of the sliding mode controller, which can effectively reduce the chattering of the system, and the output fluctuation range of the control force is obviously reduced compared with other methods.
It can be seen from Figure 5 that with the partial failure of actuators, the proposed method uses the actor-critic neural network and RBFNN with error compensation to deal with the problems of unknown disturbances and controller tuning of the system, respectively. Thus, the speed tracking error can be quickly adjusted to a minimum range, which guarantees the smooth tracking of desired speed. However, due to its poor ability of fault suppression, the tracking error of other control methods frequently oscillates, leading to a poor speed tracking effect.
Figure 6 shows that, compared with other control methods, the proposed method always keeps the position interval error between each train within a small range, which ensures that the following train can track the front train in real time and accurately under the moving block mode, thus solving the cooperative fault-tolerant control problems of MHSTs.
From the data in Table 2, under actual working conditions, the proposed method significantly reduces the position interval and speed tracking errors of each train and has more ideal robustness and stability compared with other control methods, which can meet the complex nonlinearities and time variations of MHSTs.
With the wide application of modern communication technology in train control systems, the design of a cooperative control scheme for multiple trains has become the best choice to improve the train running efficiency and system safety. Considering that there are few research methods for fault-tolerant control of multiple trains at present, we extend the research idea of fault-tolerant control for a single train to multiple trains and propose a distributed SMFTC method for multiple trains. The proposed method can solve the unknown actuator faults and the system uncertainties at the same time by designing the adaptive compensation law. Compared with reference , this method does not need the neural network fault observer and cannot be affected by fault diagnosis error, which can well deal with the uncertainty caused by fault and simplifies the design complexity of the fault-tolerant controller in . It should be noted that on the basis of existing cooperative fault-tolerant control methods for multiple trains , the proposed method further considers communication delays during signal transmission, and the distributed neural network prediction model is introduced into the control scheme to compensate for the influence of network delay on the train control system.
In this paper, we have proposed a distributed SMFTC method based on the actor-critic neural network for MHSTs with actuator faults and communication delays. An adaptive compensation control law is designed to eliminate the influence of unknown actuator faults on the train control system. The distributed RBFNN with the compensation mechanism of approximate error is designed to deal with the external disturbance and time delay of the system, respectively. By estimating the switching gain in the distributed sliding mode controller using the actor-critic neural network, the precise control quantity is obtained to reduce the damages caused by the chattering of the system. Simulations indicate that the proposed method can effectively reduce the chattering phenomenon and track the change of reference input quickly and accurately, which ensures the safe operation of MHSTs. In addition, under the same fault conditions and operating environment, the parameter of the controller can be optimized by using the online adaptive parameter tuning mechanism of an actor-critic neural network, which is not available in other general RBFNNs. Future research will focus on the problems of sensor failure and cyber security of cooperative control for MHSTs.
The data used to support the findings of this study are available from the corresponding author upon request.
Conflicts of Interest
The authors declare that there are no conflicts of interest regarding the publication of this paper.
This research was funded by the Natural Science Foundation of Liaoning Province (grant no. 20180551003).
Z. Mao, G. Tao, B. Jiang, X.-G. Yan, and M. Zhong, “Adaptive position tracking compensation for high-speed trains with actuator failures ∗ ∗this work was supported in part by the national natural science foundation of China under grant 61490703, grant 61573180 and grant 61374130,” IFAC-PapersOnLine, vol. 50, no. 1, pp. 14266–14271, 2017.View at: Publisher Site | Google Scholar
D.-Y. Li, P. Li, W.-C. Cai, X.-P. Ma, B. Liu, and H.-H. Dong, “Neural adaptive fault tolerant control for high speed trains considering actuation notches and antiskid constraints,” IEEE Transactions on Intelligent Transportation Systems, vol. 20, no. 5, pp. 1706–1718, 2019.View at: Publisher Site | Google Scholar
X. G. Guo, J. L. Wang, and F. Liao, “Adaptive fuzzy fault-tolerant control for multiple high-speed trains with PI-based sliding mode,” IET Control Theory and Applications, vol. 11, no. 8, pp. 1234–1244, 2016.View at: Google Scholar
B. Chen, “Development and reference of the TCMS on power centralized EMU abroad,” Railway Locomotive & Car, vol. 39, no. 1, pp. 7–14, 2019.View at: Google Scholar
T. Zhang, “Real-time control method for communication network of high-speed EMU based on T-S fuzzy model,” China Railway Science, vol. 39, no. 3, pp. 93–99, 2018.View at: Google Scholar
S. Y. Song, J. B. Hu, Y. Y. Wang, and X. L. Han, “Actor-critic learning algorithm for parameter tuning of sliding mode controller,” Electronics Optics & Control, vol. 27, no. 9, pp. 24–27, 2020.View at: Google Scholar
B. Jang, Y. K. Wu, N. Y. Lu, and Z. H. Mao, “Review of fault diagnosis and prognosis techniques for high-speed railway traction system,” Control and Decision, vol. 33, no. 5, pp. 841–855, 2018.View at: Google Scholar
D. C. Li, “Rearch on control strategy of high-speed train safety running under crosswind condition,” Dept. Mechatronic. Eng., Lanzhou Jiao Tong Univ., Lanzhou, China, 2019, Ph.D. dissertation.View at: Google Scholar
Y. Y. Min and Y. G. Liu, “Barbalat Lemma and its application in analysis of system stability,” Journal of Shandong University (Engineering Science), vol. 37, no. 1, pp. 51–55, 2007.View at: Google Scholar
K. Zhang, H. G. Zhang, Y. L. Cai, and R. Su, “Parallel optimal tracking control schemes for mode-dependent control of coupled markov jump systems via integral RL method,” IEEE Transactions on Automation Ence and Engineering, vol. 17, no. 3, pp. 1332–1342, 2020.View at: Google Scholar
L. S. Zhong, B. Li, J. Gong, Y. X. Zhang, and Z. M. Zhu, “Maximum likelihood identification of nonlinear model for high-speed train,” Acta Automatica Sinica, vol. 40, no. 12, pp. 2950–2958, 2014.View at: Google Scholar
Z. Samir, M. Hemza, B. Abderrahmen, and D. Ali, “Actuator fault tolerant control using adaptive RBFNN fuzzy sliding mode controller for coaxial octorotor UAV,” ISA Transactions, vol. 80, pp. 267–278, 2018.View at: Google Scholar