Abstract

A strategy for increasing the accuracy rate of internal combustion engine (ICE) fault diagnosis based on the probabilistic graphical model is proposed. In this method, a three-layer network with inference of probability is constructed, and both the material conditions and the signals collected from different engine parts are considered as the inputs of the system. Machine signals measured by sensors were processed in order to diagnose potential faults, which were presented as probabilities based on the components in layer 1, fault categories in layer 2, and fault symptoms in layer 3. The diagnosis model was built by using nodes and arcs, and the results depended on the connections between the fault categories and symptoms. The parameters of the network represented quantitative probabilistic relationships among all layers, and the conditional probabilities of each type of fault and relevant symptoms were summarized. Fault cases were simulated on a 12-cylinder diesel engine, and three fault types that often occur on ICEs were tested based on five different fault symptoms with different loads, respectively. The diagnostic capability of the method was investigated, reporting high accuracy rates.

1. Introduction

Internal combustion engines (ICEs) play a significant role in the energy supply of basic industrial applications, often working for long periods without stopping. Any fault in the ICE would interrupt the production process, bring significant economic losses, and even harm the operators on the spot if severe damage happens with explosion. Therefore, it is important to monitor the engine status during the running process for the fault diagnosis.

ICE fault diagnosis has been studied since the invention of ICEs, and over the past several decades, different methods have been proposed by researchers. Wavelet packet [15] has been developed and applied in the field of ICE fault diagnosis. A fault diagnosis system using wavelet packet transform and artificial neural network techniques has been proposed by Wu and Liu [6]. The application of wavelet transform in machine condition monitoring and fault diagnostics has been summarized by Peng and Chu [7]. Fault feature extraction using algorithm of sparse analysis [8, 9] and size estimation method [10] has been studied. A novel fault localization method has been proposed by Cui et al. [11]. Convolutional neural network [12] and Bayesian diagnosis method [13, 14] have been proposed. Several other methods, the Monte Carlo method [15], the fuzzy diagnosis method [16], and the support vector machine (SVM) method [17, 18], have been also applied in machine fault diagnosis with good performance. Additionally, novel approaches like Switching Unscented Kalman Filter method [19], Extreme Learning Machines [20], Online Dictionary Learning [21], and the discriminative nonnegative matrix factorization (DNMF) method [22] have been proposed. As a branch of applied artificial intelligence (AI), expert systems are also introduced in the field of fault diagnosis and have been built demonstrating good performances [2325].

Although some promising results have been achieved by previous studies, there is still a need for improvement in a number of aspects. Firstly, taking into consideration only a single type of signal as the diagnostic input cannot accurately reflect the running state of the engine. Due to the fact that one type of fault may lead to several symptoms, various state signals need to be collected from the machine and be used in the diagnostic process of the system. Secondly, only one kind of fault category has been reported for each method. An advanced system should be able to cope with common faults that may occur during the running process, and multiple faults can be diagnosed at the same time. Thirdly, there is no evidence that a specific engine symptom can be exclusively used as the unique signal of a fault, and thus a probability model should be introduced in the process able to describe the equipment state in a more precise manner than just indicating the existence or not of the fault. The probability of a signal that represents a sign of potential fault should be considered.

The inference results should be associated with a probability that can describe the state of the engine precisely. The probabilistic graphical model is a combination of the graph theory with the probability theory, which is suitable for dealing with complex systems [26]. In the present study, in order to overcome the aforementioned problems, a method for three kinds of ICE faults using a probabilistic graphical model is presented. The inference engine of the system was established based on the probabilistic graphical model. Combined information on the machine vibration and rotation speeds was simultaneously collected and used as the system input. Various fault symptoms that occurred together during the running of the engine were taken into consideration. Three fault categories were set as the outputs of the system during the experiments, and eight cases for each category were studied. The results demonstrated a high accuracy rate.

2. The Probabilistic Graphical Model

2.1. The Bayesian Theory

The probabilistic graphical model, consisting of a directed acyclic graph (DAG) with nodes and arcs, can model the causal relationships between several facts represented by nodes connected to each other with directed arcs since the occurrence probability of a node can be calculated by the independent probability and the conditional probability. The Noisy-OR gate was introduced in order to improve the diagnosis accuracy and enhance the robustness of the system. The leak Noisy-OR gate, a calculation module, can solve a problem with a large number of parameters by modeling the conditional probability, which, due to the complex working environment, is often extremely time consuming in the construction of the system.

Considering that event A and event B are independent to each other, the conditional probability of event A given event B is denoted as and is defined aswhere is the joint probability.

Assume that , , where S is a certain event, and hypotheses are mutually exclusive. When the events are a set of random variables, the probability of isand the posterior probability of is

A classic probabilistic graphical model can be seen in Figure 1, and there are three necessary requirements for the construction of the Noisy-OR gate model:(1)All nodes in the network must be binary, which means that every node can have only two states, True (T) or False (F).(2)For each node S and its parent nodes must be independent of each other.(3)For each parent node there is a probability of connection: , which is the probability of node S whose state is T when is T and other states of its parent nodes are F.

The probability that event S is in true state iswhere H is the entire parent set of nodes of S, and is the subset of H nodes that are in true state. However, if is the empty set, the value of would be zero. Due to the fact that there would be no zero probability in the industrial processes, the leak Noisy-OR gate was applied for the solution.

2.2. The Leak Noisy-OR Gate

It is possible that a fault of the machine can be a result of symptoms not included in the cause nodes, and the diagnosis system cannot be built explicitly with all possible causes. Thus, the leak probability should be included in the model [2729].

The leak Noisy-OR gate is introduced in the probabilistic graphical model as shown in Figure 2. All the events that are unclear are collected in one node, denoted by , with the probability .

The leak probability can be calculated as

After the probabilities of are calculated, the conditional probability table (CPT) can be obtained based on

2.3. The Prior Probability of the Model

The prior probabilities of the whole model depend on the survival probability of the materials and can be affected by the current material properties of the engine components during running conditions. The movement of the pistons and the conrods with the crankshaft can be defined by the reciprocating linear motion of the piston and the rotary motion of the crankshaft. The inertial force can be described as the combination of the reciprocating inertial force generated by the reciprocating mass with the centrifugal force generated by the rotating mass. As such, the prior probabilities can be described by the Probabilistic-S-N curve (P-S-N curve) of the material [30] based on the Bayesian theory. According to the probability model presented by Feng et al. [31], the survival probability density function of a material parameter iswhere is the fatigue life of the component and T is the number of cycles. The prior probabilities can be obtained from that combined with data handbook [32].

3. The Fault Diagnosis System

3.1. The Three-Layer Structure

The logic system is made up of nodes and arcs. The nodes are divided into three different layers: the upper layer, the middle layer, and the lower layer [33]. The arcs are the links between nodes in neighboring layers, but there are no direct links from the upper to the lower layer. Therefore, all the arcs from upper layer nodes point to middle layer nodes, and the arcs from middle layer nodes point to lower layer nodes.

The first layer, which is called the component layer, consists of key components of the engine related to the probabilities of fault occurrence that can affect the fault prior probabilities. In this layer, the states of the nodes provide the network with prior probability in the calculation process. Any adjustments in the states of the first layer nodes will influence the fault nodes in the second layer. The second layer, which is called the fault layer, consists of different types of fault that may occur in the engine. Each type of fault is represented by one node. After the calculation, each node will be in a determined state, with a percentage indicating the probability of the correspondent fault to occur during the running. The third layer, which is called fault symptom layer, consists of several engine characteristics. The working state information of the engine can be measured and transmitted to a computer, where the engine signals, which can reflect fault symptoms, will activate the related nodes in this layer. Then, the activated nodes will turn to the fault state, which is also T state meaning there exists a kind of fault, and the calculation will proceed to the second layer. The three-layer structure is shown in Figure 3.

For the CPT construction, assuming that, can be obtained according to (4),and the connection probability of the parent nodes of S can be calculated based on (8):since can be obtained by the following equation to get .For the prior probabilities based on (7), Here obeys the lognormal distribution. Since we have standard normal distribution variable which is denoted bythe distribution function of standard normal distribution isThen the prior probabilities of engine component of fault in the ICE can be

3.2. The Establishment of the Model

For the ICE fault diagnosis, the engine performance information can be detected by sensors and transmitted to the computer that runs the system. The probability of the potential fault can be calculated by the inference engine.

Sensors are placed on key parts of the ICE in order to collect useful information. Vibration piezoelectric and eddy current sensors are placed on the surface of the engine in order to collect running state information, which will be transferred to the data collector. Before the inference procedure, all signals are processed using a denoising filter. Then the information collected in the computer storage will be processed and analyzed. The combination of the acquired data can reveal the current state of the machine.

The system was established based on diagnostic rules acquired from experts, published papers [3436], or experimental knowledge. Each signal collected by the sensors and transmitted to the computer is compared to a threshold value that determines the node state, which is set beforehand by an expert at the initialization of the system. Based on a single signal or a combination of signals, different fault probabilities can be calculated, depending on if the value is above or below the threshold. The third-layer nodes and the corresponded IDs can be seen in Table 1, while the corresponding relations between the signals and the fault types among the three layers is illustrated in Figure 4. When the input signal is transmitted into the system, the component layer and the symptom layer nodes will affect the fault layer. The direct interactions between any two nodes of adjacent layers are presented.

The upper layer contains information about the current condition of the key ICE components, which is related to the fault types in the second layer. The key engine components are the cylinder, the piston, and the bearing. As it can be seen in Figure 4, each node in first layer is connected to one or more nodes in the middle layer.

The second layer is the fault layer and contains nodes of three different fault categories that may occur: cylinder knock fault, cylinder score fault, and bearing wear fault. The fault nodes are independent of each other but they can be activated at the same time. There is a node of leak fault in this layer containing the leak probability with Noisy-OR gate. The leak node of fault is connected to every node in the third layer with directed arcs.

The third layer contains the fault characteristics that can be detected, according to the parameter thresholds, directly from the signals. Once the value of the current signal is above or below the threshold value, the state of the node is activated. The system will analyze the signals to deduce the fault possibility based on the knowledge base.

The prior probabilities of the different components are set independently according to the parameters of each component, and the leak probabilities are set to 0.001 as default. Based on previously collected fault case data and experience, a probability analysis is performed by the system, setting the conditional probabilities of the initial parameters, which can be adjusted with experiences (Table 2).

4. Experiments on a Diesel Engine

4.1. Test Procedure and Results

A 12-cylinder diesel engine (Figure 5) was used as the fault simulation testbed for the case study. Several different fault-related signals were measured by the sensors, while the machine was operated under a manually developed fault environment. A fault symptom signal could be reflected by the vibration and the transient rotation speed and could be captured by different sensors on the machine. The sensors, with a data acquisition frequency of 25600 Hz, were placed on the top of each cylinder head and on both sides of the crankshaft case. The vibration signals of the components in the running state were collected and transferred to the system and then were analyzed.

The diesel engine is one TBD234 twelve-cylinder V-shaped direct injection diesel engine, made in Henan Diesel Engine Industry Co. Ltd., Luoyang, China. The major features of the engine are summarized in Table 3.

In order to investigate the suggested approach, three different fault types were simulated that are cylinder knock fault, bearing wear fault, and cylinder score fault. For each fault simulation, the experiments were repeated eight times under the same environment but with different loads. The loads were supplied by a hydraulic dynamometer and were 0 N·m, 100 N·m, 200 N·m, 500 N·m, 600 N·m, 800 N·m, 1000 N·m, and 1200 N·m from group a to group h, respectively.

4.1.1. The Cylinder Knock Fault

In order to simulate the cylinder knock fault, the space between the piston and the cylinder head was filled with bronze ring. When the piston moved to the top dead point, the collision between the piston and the cylinder head would cause the impact damage to the engine, which could lead to the deformation of the piston and the crankshaft. Figure 6 illustrates the bronze ring before and after the impact with the piston.

In the simulation, the vibration speed and vibration acceleration signal values increased, and both the vibration speed and the vibration acceleration nodes were triggered and activated. The combination of the two different signals was used by system to diagnose the occurred fault. Comparisons between the fault and nonfault vibration speed values of each simulation group are illustrated in Figures 7(a)7(h), while comparisons between the fault and nonfault vibration acceleration values of each group are illustrated in Figures 8(a)8(h).

During the running of the engine, a combustion takes place in the combustion stroke and the resulted vibration is displayed every four strokes. In Figures 7 and 8, the waves in the areas indicated by numbers of each experiment group can be clearly observed, evaluating that vibration speed features and vibration acceleration features are able to diagnose cylinder knock fault, also proving that both features can be used for fault diagnosis. The cylinder knock fault diagnosis results of each experimental group based on the probabilistic graphical model can be seen in Table 4. It can be observed that cylinder knock fault was diagnosed with high accuracy.

4.1.2. The Bearing Wear Fault

Due to the contact and the impact between the conrod and the bent axle, force is exerted on the tile of the engine by the alternating load. In the experiment, in order to simulate the bearing wear fault, the tile was adjusted and the gap between the bearing and the tile was reduced, leading to a decrease in the transient rotation speed of the engine, an increase in kurtosis, and an increase in the vibration speed of the crankshaft case. The comparison of a new tile and a tile with wear surface after the experiment can be seen in Figure 9. Comparisons between the fault and nonfault vibration acceleration values of each simulation group are illustrated in Figures 10(a)10(h), while the transient rotation speed can be seen in Figure 11. In order to have a succinct view of kurtosis trend, the relative value of the kurtosis was given and displayed in Figure 12. It came out after the calculation that was 0 minus the kurtosis value. The value of kurtosis was getting larger while the trend of the relative kurtosis went down as shown.

In Figures 10(a)10(h), the waves of the fault simulation demonstrated high vibration speed values compared to the nonfault vibration waves, during all the four stroke running states, especially in the areas indicated by numbers. The initial setting of the transient rotation speed was at 1500 rpm, while, as it can be seen in Figure 11, the transient rotation speed during the fault simulation was lower than the setting due to the increase of the tile friction. In addition, the value of kurtosis increased for all groups while the relative value of kurtosis decreased. The bearing wear diagnosis results for each experimental group can be seen in Table 5. The features of bearing wear fault, which are reflected by the changes of vibration speed, transient rotation speed, and kurtosis, were clearly proved to be able to diagnose the bearing wear fault. It can be used for the diagnosis, and the diagnosis based on the probabilistic graphical model was achieved with high accuracy.

4.1.3. The Cylinder Score Fault

In the cylinder score fault case, the friction between the piston and the cylinder was increased for the test simulation. This led to an increase in the vibration acceleration values and an increase in the torsional vibration. In Figure 13, the piston surface after the score fault simulation is illustrated. The vibration acceleration signals of the fault for each group can be seen in Figures 14(a)14(h), while the torsional vibration values are illustrated in Figure 15.

In 8 groups of Figure 14, the waves of vibration acceleration appeared in the areas indicated by numbers. In Figure 15, the peak values of torsional vibration amplitude of all groups were found higher compared to the values of the nonfault running state. The features of cylinder score fault were proved to be able to diagnose the fault, and the diagnosis based on the probabilistic graphical model was achieved with high accuracy. The cylinder score diagnosis results for each experimental group can be seen in Table 6.

5. Discussion

The average percentages of each fault category in the simulation environment can be seen in Table 7. The results demonstrated that the diagnosis correction rates of cylinder knock fault, bearing wear, and cylinder score were 99.80%, 96.89%, and 98.44%, respectively. Basically, a type of fault can be determined based on the rule that the largest fault probability is larger than a certain threshold . The initial value of can be determined by experts, and it is suggested to be 80% at the beginning for the ICE fault diagnosis. All the faults were distinguished successfully with high accuracy.

The diagnosis results were compared to the results that are processed and calculated with incomplete information. Single symptom can not distinguish fault type since the fault of cylinder knock and the fault of cylinder score have the same symptom that is vibration acceleration, while the fault of cylinder knock and the fault of bearing wear have the same symptom that is vibration speed. For the fault of cylinder knock, the vibration speed and vibration acceleration are also characters for other two faults. The probabilities of it were 88% and 76%, respectively, when two symptoms appeared separately. For the fault of cylinder score, it could not be distinguished only with the symptom of vibration acceleration. The probability of cylinder score was 89% when the character of vibration acceleration appeared alone and was 79% when the symptom of torsional vibration appeared alone. For the fault of bearing wear, although the transient rotation speed and kurtosis can be used for diagnosis when they appeared as fault symptom separately, the probability was only 67% and 85%, respectively. The probability was 85% when only the vibration speed symptom appeared. All the percentages of each type of fault are lower than proposed results. The suggested fault diagnosis method with the work of nodes together demonstrated good performance and high accuracy.

In further work, the fault categories can be expanded in the model, and the diagnosis network can be built with more diagnosis evidences. The capacity of the ICE fault diagnosis model can be improved.

6. Conclusion

A method for ICE fault diagnosis using the probabilistic graphical model was presented. Three types of machine faults and five symptoms were connected based on a three-layer network. The combination of various engine signals was used for diagnosing faults that may occur, and the use of leak probability with Noisy-OR gate improved the diagnosis accuracy. A 12-cylinder diesel engine was manually adjusted to simulate real fault cases of cylinder knock, bearing wear, and cylinder score. The output of the diagnosis method showed high diagnosis accuracy in the results presented in percentages.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This research has been supported by the National Research Program of China (863 Program) (no. 2014AA041806) and the National Key Research and Development Plan (no. 2016YFF0203305).