Abstract

In the monitoring process of wind turbines the utmost attention should be given to gearboxes. This conclusion is derived from numerous summary papers. They reveal that, on the one hand, gearboxes are one of the most fault susceptible elements in the drive-train and, on the other, the most expensive to replace. Although state-of-the-art CMS can usually provide advanced signal processing tools for extraction of diagnostic information, there are still many installations, where the diagnosis is based simply on the averaged wideband features like root-mean-square (RMS) or peak-peak (PP). Furthermore, for machinery working in highly changing operational conditions, like wind turbines, those estimators are strongly fluctuating, and this fluctuation is not linearly correlated to operation parameters. Thus, the sudden increase of a particular feature does not necessarily have to indicate the development of fault. To overcome this obstacle, it is proposed to detect a fault development with Artificial Neural Network (ANN) and further observation of linear regression parameters calculated on the estimation error between healthy and unknown condition. The proposed reasoning is presented on the real life example of ring gear fault in wind turbine’s planetary gearbox.

1. Introduction

In recent years, large power planetary gearboxes became very popular elements of wind turbine’s drive-trains due to their ability to transmit a relatively high load from blades simultaneously providing a high gear ratio, which is required for efficient production of electrical energy by the generator.

On the other hand, as it was pointed out in a number of statistical summaries [14], gearboxes (with no special distinction for the planetary ones and others) are located on top of the list of most damage susceptible drive-train components in this group of rotating machinery. This is one of the reasons why wind turbine gearboxes, and planetary ones in particular, received the utmost attention of researchers and maintenance engineers. In consequence, this initiated the pursuit for providing the methodology that could detect the damage at the early stage and therefore limit the downtime of the entire machine. It should be mentioned that, due to the sharing of load between several meshes during operation of a planetary gearbox, usually a single malfunction results in total gear damage requiring full gearbox replacement. According to [2], a 2.5 MW wind turbine planetary gearbox replacement costs over £400,000, which significantly influences a total wind turbine revenue. Figure 1 presents such high power gearbox with two parallel stages and single planetary stage.

One of the possibilities for damage assessment of planetary gearboxes is vibration analysis. Unlike other typical monitored industrial objects, vibration analysis of such planetary gearboxes is generally more challenging. This is due to few reasons. Firstly, wind turbines inherently operate under nonstationary conditions, including variable speed and load. Therefore, analysis of its data requires advanced signal selection algorithms [5, 6] as well as sophisticated signal resampling methods [7]. Secondly, in order to maximize even teeth wear, planetary gearboxes are designed with maximum length of a full cycle (i.e., exact repeatable position of planets and all teeth). Thus, faults related to planets may manifest themselves differently upon measurements not covering entire cycles. Tertiary, due to relatively low speed of wind turbine planetary gearbox elements consequent in relatively low characteristic frequencies, accurate fault identification requires extremely long records, which are difficult to store and analyze. Moreover, usually low speed results in low fault induced energy, which in turn requires high sensitivity and high quality data acquisition. Finally, commercial wind turbine manufacturers do not allow any mechanical interference into planetary gearbox casing, limiting signal sources to vibration recorded from outside a gearbox.

The first attempts to the vibration analysis of gears were performed in the early 80s [8]. Over the years, a number of methods were developed; however many of them referred to analysis of spur gears [913] covering wide range of approaches like dynamic modelling [14, 15], wavelet transform [16], Wigner-Ville distribution [17], and Principal Component Analysis [18], just to name a few. The research interest in planetary gearboxes increased along with the computational and storage capabilities of modern CMS. This allowed providing methodology for detection of damage of a different nature, that is, spalling [19, 20] and pitting [21, 22], cracked tooth [23] and carrier [24], faulty planet gears [25, 26] and ring gears [27], and so forth. Zimroz and Bartkowiak [28] analyzed application of Principal Component Analysis and Canonical Discriminant Analysis with multidimensional matrix of variables for classification of planetary gearbox condition. This work was preceded by [29], where authors developed a method that takes advantage of highly varying operational parameters by relating gearbox’s condition to susceptibility of diagnostic features to load. It has to be mentioned that the exhaustive list of references related to this topic can be found in the review papers like [30, 31].

Unfortunately, although many diagnostic methods were proposed, a lot of them suffer from shortcomings that prevent their industrial applications. In [32], the authors noticed a requirement for visual inspection of diagnostic results for comparative assessment between healthy and damaged condition. Jablonski and Barszcz [33] point that many of the approaches assume precise location of gears in time, which is impractical for the majority of commercial installation. It was also observed that for particular techniques the data analysis is performed without the inclusion of information about the unsteady conditions of work [28]. Finally, based on the experience of the authors of this paper, in many contemporary CMS, especially in SCADA-based systems, raw vibration signals are not available for the end user. This is due to the limited data transfer, like for remote WTs, where data transmission is utilized via GPRS connection and prevents transmission of data packages containing highly sampled vibration signals of long duration. In such applications, the condition monitoring process is performed entirely on the basis of simple diagnostic features, like root-mean-square or the amplitude of vibration.

In such case, these indicators are observed over time (in the so-called trends) for the potential increase that might provide information of the occurrence of a malfunction (e.g., [34]). The additional obstacle is a nonlinear correlation between the level of diagnostic indicators and the corresponding operating conditions. Therefore the assessment on the behavior of the indicators is difficult [35, 36]. This demand for stationary or quasistationary conditions is rarely satisfied for quite large group of machines like wind turbines [3739] or mining excavators [40].

The above issue may be overcome by introducing operational states of machine [41, 42]. In this approach, each trend is tracked in a precalculated range of operational conditions. Moreover, operational states are used for data classification during the acquisition procedure and consequently in data analysis process. Such modes are selected in order to most accurately describe a particular machine. Furthermore, it is desired to define a state for which calculated trends show low (or the lowest there is) level of variation [42]. Such stability is required for reliable justification of alarm threshold value. Otherwise, that is, for high variability, sudden increase of estimator would cause false positive alarms, misleading in the process of condition monitoring. Unfortunately, as mentioned in [42], the methodology for this approach is in the early stage of development.

In this paper, the authors follow the idea of the comparative study of simple vibration features in highly nonstationary operating conditions. The proposed reasoning is based on backpropagation Artificial Neural Network (ANN) and linear regression that fit the estimation error in the function of selected operation parameter. According to the literature review, so far such solution has not been proposed to the failure detection process of planetary gearboxes. Good accuracy of ANN for condition monitoring was shown for rolling element bearings [4345], spur gears [46], centrifugal pumps [47], and power transformers [48]. In [49] the author employed ANN for estimation of remaining useful life of bearings. Bahmanyar and Karami [50] applied ANN to voltage signals while Crivelli et al. [51] verified its efficiency for acoustic emission. An interesting approach was proposed by Zhang and Wang [52] where ANN is used to model the behavior of generator’s bearing, yet this method was applied to slowly varying parameter, that is, temperature.

The technique proposed in this paper takes into consideration fluctuation of vibration signal features under highly varying operating parameters of speed and load. Firstly, the ANN is trained at the normal (referential) behavior of the machine. Then, for each upcoming unclassified observation, the given value of a feature is compared against the value estimated using the taught net. In case of the regular operation, the error in assessment is relatively small, and it would increase in case of arising malfunction. In order to limit the influence of changing conditions, the linear regression is employed and parameter (also called the error term) is observed. The method is presented on the case study example of planetary gearbox stationary ring fault that was not detected using classical trend analysis.

The paper is organized as follows. Within the introductory part, the state of the art of planetary gearbox diagnostics is presented and discussed with its shortcoming related to implementation in SCADA-based systems. The authors propose to use Artificial Neural Network and discuss its hitherto applications. Next, the principles of Artificial Neural Networks are described. The proposed fault detection approach is presented in Section 3. In Section 4, the usage of the method is shown in data acquired from commercially used wind turbine. Finally, the results are discussed and paper is concluded.

2. Artificial Neural Networks

The concept of backpropagation ANN was firstly proposed in [53]. It is constructed as a multilayer structure (Figure 2), containing three essential layers: input, hidden, and output, whereas the hidden layer might consist of multiple layers. The inputs information is passed to the unseen hidden layers with initial weights, generally described as which corresponds to weight given to information between input neuron and hidden neuron . In the process of a forward phase, the outputs are calculated using different set of weights between the hidden layer and the output layer. Then, the trained outputs are compared with the target ones and the error is established. In the following step, the error backpropagation is computed, which means that the weights are calibrated in order to minimize the estimation error for the entire set of training data. The last operation can be repeated in successive iterations called epochs, until the acceptable level of estimation error is established.

3. The Proposed Algorithm

As previously stated, the authors’ intention is to present the method that takes into account process parameters, like oil temperature, wind, and rotational speed or load, and detect the increase of one or more of diagnostic features in the presence of fault development. In order to do that, the dependency on fluctuation of process parameters has to be reduced. It is possible due to the learning process of Artificial Neural Network that estimates a relationship between the input process parameters and the output diagnostic ones. The contribution of the paper is to use ANN as the model of the normal behavior of the vibration signal features, depending on the most important process variables, and later to use the trained model as the reference to the actual values of these features.

After establishing the network for the training set of inputs and outputs, the testing procedure can be performed. The net estimates values of diagnostic features based on given inputs. Next, this prediction is subtracted from the actual indication carried from CMS, resulting in estimation error. In fact, there are three reasons for the erroneous estimation.

The first one is the improper training of the neural network resulting in invalid output estimations of trained ANN for the unseen input. In this case, the estimation error changes with operational parameters, yet it stays at the similar level in time. In order to minimize this shortcoming, it is required to train the neural network using long time measurements that cover the entire range of operation parameters.

The second reason is related to the dynamics of the observed machine, that is, with unequivocal relation between operational parameters and diagnostic features. This happens when for a given combination of input parameters the output takes various values. In the training process of ANN such ambiguity is not possible to model since the taught net assigns a single value for each combination of inputs. It should be pointed out that, for the unchanged condition of machine, the estimation error related to imprecise training of ANN remains constant in time.

The third one appears when measured values do not correspond to those calculated in the referential period. This might indicate either sensor failure or, even more importantly, a damage development.

All of the aforementioned reasons might exist at the same time; therefore it is suggested to minimize its random influence and emphasize any deterministic phenomena. This can be done using the idea similar to the one presented in [29, 37]. Zimroz et al. proposed to observe diagnostic features against chosen operational parameter in several segments obtained from trend plot. For each segment, the linear regression function is fit to the cloud of observations. The obtained parameters and are used for decision making process.

In the approach proposed in this paper, the linear regression is used to fit the estimation error in the function of selected operation parameter and not to the raw diagnostic features. It is based on the conclusion that in bad condition of machine the values of diagnostic features, and simultaneously the estimation error, would increase regardless of the change of operation conditions. Thanks to use of ANN, the dependency of error on operational parameters is limited, so the diagnostic reasoning can be limited simply to parameter , also called the error term. For good condition, this parameter remains at low level (close to zero), and for bad condition it is significantly increased (Figure 3). In this case, threshold value may be established using statistical methods as presented, for example, in [54, 55], with no concern on highly changing operating conditions. Additionally, in order to improve the time domain resolution and hence accelerate the diagnostic decision, the segmentation may be performed with overlap [37]. In the proposed reasoning parameter is not taken into consideration since in general it is more susceptible to ambiguous association between inputs and output of ANN, and it is related to character of machine’s operation. As presented in Figure 3, the distribution of error might be uneven in the domain of operational parameter, so it influences fitted regression line (marked in blue). Therefore it is concluded that parameter does not carry diagnostic information.

4. Case Study

4.1. Object Description

The proposed method is presented on the data acquired using vibration-based CMS installed on commercially utilized wind turbine (WT) located in western Germany. This 1.5 (MW) WT has a standard design of a drive-train; that is, gearbox and generator are in line in the nacelle, as presented in Figure 3. The main rotor is driven by three blades and supported by the main bearing (A1). The planetary gearbox (A2) has three planets, which are impelled by the planet carrier. The planets roll over the ring and transmit the torque to the sun gear, simultaneously increasing the rotational speed. The sun shaft passes the torque from the planetary gear to the two-stage parallel gear (A3 and A4). The parallel gears have three shafts: the slow one that clutches to the sun shaft, the intermediate one, and the fast shaft, which drives the generator. The generator produces AC current of a varying frequency. It is firstly converted into DC and then into AC current of a frequency equal to the grid one. Electric transformations are performed by the controller located at the base of the WT’s tower. In general, such drive-train multiplies the rotational speed from about 25 (RPM) on the main rotor to about 1800 (RPM) at the generator.

In the typical vibration-based CMS, it takes six sensors to collect data adequate for the proper diagnostics of the entire drive-train. They are located as presented in Figure 4: on main bearing (A1), on planetary gearbox (A2), on two parallel gearboxes (A3 and A4), and on the generator bearings (driven end (A5) and nondriven end (A6)).

As listed in [56], for machine with similar construction, nearly 150 indicators are extracted for the purpose of condition monitoring. The data is collected together with the corresponding operating parameters, namely, load and rotational speed.

In the described CMS, each parameter was acquired every 15 minutes from 10 (s) measurements. The vibration signals were collected with 25 kHz of sampling frequency. Process parameters were recorded simultaneously to vibration signal at rate of once per second, so that 10 measurements were collected for each vibration signal. In the next step, these values were averaged, so that, as the output, the end user of the system received one value per observation for each of the parameters. It should be mentioned that, before the procedure of feature extraction, the validation of signals was performed according to the guideline proposed in [5]. It included partial filtration of data for low rotational speed (below 1050 (RPM)) and periods where load was below 0 (kW), that is, when wind turbine did not generate power.

In the presented case study, the wind turbine was observed for about 13 months—from the time of CMS installation until its stoppage for gearbox replacement. In total, about 38500 observations were recorded out of which about 26000 satisfied the validation requirement mentioned above. In the meantime, the extended wear of ring gear was developed. This phenomenon was not detected by CMS, yet it was found during periodical service measurements around observation 22500. In Figure 5, one can observe frequent changes of the operational state, although slight increase of wideband features was visible. The increase is only a small percent, and so typical approach for setting a threshold will not work, since it is based on the empirical distribution function [54].

The authors would like to discuss the approach based on ANN and linear regression that could reveal fault development in this gearbox. The data used for the proposed algorithm include two process parameters, namely, rotational speed of the generator’s shaft and generator’s output power, understood as load. There is a large number of features, which are calculated from raw vibration signals. Most popular ones are narrowband energies representing frequencies equal to gear mesh, shaft harmonics, and so forth. There are also broadband features, which are less susceptible to noise and windowing effects and are considered more robust ones. This is the main reason why it has been normalized and is used as an industry standard [57]. Two diagnostic features were selected, namely, root-mean-square (RMS) and peak-peak (PP).

4.2. Construction of ANN

One of the most important problems in condition monitoring of machinery in highly changing operational conditions is a nonlinear influence of such conditions, in particular rotational speed and load, on the level of vibration and thus on the fault indicators. It is often observed that the increase in amplitude related to changing conditions is much higher than the one created by a machinery fault, especially in its early stage. In the authors’ opinion, one of the possible methods to reduce this problem is the use of ANN. In this approach, the network is trained using data collected in the referential (healthy) period of machine’s operation and then used for estimation of values in the testing period, when the condition of machine is unknown. In the beginning of the investigation, a number of combinations for the ANN structure were tested. This resulted in conclusion that the least erroneous estimation is obtained using the 2-10-20-2 combination of nodes with mean square error equal to 0.0144 for PP and 0.0025 for RMS. The structure described as 2-10-20-2 means that two nodes were used as inputs (i.e., rotational speed and load); there were two hidden layers consisting of 10 and 20 nodes, respectively. Finally, the RMS and PP indicators were employed as the output (target) values.

4.2.1. Training

The proposed network was trained using about 3 months of observation (Figure 5), which amounted to about 6000 samples for each parameter. As mentioned previously, the rotational speed and load (power output) were treated as input parameters for training of the network, since it can be stated that it mostly influences the diagnostic parameters [29]. The RMS and PP were used as outputs.

The training on the given structure took place in 500 repetitions (epochs), during which the initial weights were corrected for minimizing the output error of prediction in the process of backpropagation (Figure 2). Finally the established net was used for the remaining data for possible damage detection.

4.2.2. Testing

The testing set covered about 20000 observations for each of the selected features, as presented in Figure 5. The input data were used by the established net of RMS and PP as outputs in order to achieve estimations of these parameters in the substantial time of the unknown operation. In the next step, the approximations were compared with the values calculated from the actual vibration signals recorded by CMS. Thanks to this procedure, the error of the estimation was obtained.

4.2.3. Linear Regression Fitting

In the next step, the data was divided into overlapping segments (overlap was equal to 50%), each containing 2000 observations (Figure 6).

In Figures 7 and 8 one may observe distribution of estimation error against two operating parameters for PP and RMS, respectively. It can be seen that errors increase regardless of speed and load, yet they do not separate entirely from the initial segment. This is due to the small increase of diagnostic features in the initially observed trends and it prevents using classical methodology based on thresholds.

4.3. Fault Recognition

In the next step, for each segment linear regression was fit using least squares approach and parameters and were obtained and presented in Tables 1 and 2. Additionally, it is plotted in Figures 9 and 10. It can be seen that parameter does not change significantly regardless of the diagnostic indicator and operation parameter used for regression analysis. In each case, the oscillation around zero indicates even level of estimation error.

On the other hand, for parameter the difference between initial and deteriorating condition of ring gear is significant. It is clear that the increase is higher using this parameter when compared to trend of diagnostic feature alone. Moreover it is now possible to establish the control limit.

It is important to mention that the observed increase is not related to selected operating parameter, yet it is higher when observed in speed domain. This might be due to the wind turbine’s design, which aims at establishing rotational speed at certain levels (typically low and high). The arising error in estimation cumulates in few clouds around these speeds and influence the regression analysis. In the authors’ opinion, this should not be considered as a drawback of a technique since it emphasizes arising malfunction.

5. Conclusions

Within the research paper, the application of backpropagation Artificial Neural Networks was investigated for detection of fault in wind turbine’s planetary gearbox. ANN modelled the selected features depending on operational parameters and linear regression was later used to distinguish between the healthy and damaged state.

The analysis was focused on two commonly used diagnostic features, namely, root-mean-square (RMS) and peak-peak (PP). In the presented case study, it was shown that these indicators did not reveal sufficient symptoms of failure occurrence. To the contrary, the presented methodology exposed early stage of damage several months before gearbox replacement. This was possible by trending the erroneous estimation of diagnostic features and further fitting of linear regression in function of operation parameters. Thanks to the ability of ANN to model nonlinear dependencies, it was possible to reduce difficulties related to constantly and highly varying operating parameters. Parameter of linear regression provided information about general increase of error for the entire range of operating parameter and hence provided information of the development of damage in gearbox.

The additional benefit of the proposed approach is that it can be used with various input parameters and fault indicators. The method does not require preceding filtration of data, so there is no need for operational state setup. Clear distinction between initial and deteriorating condition makes it possible to establish a control limit and therefore automatize the diagnostic process. In the authors’ opinion, it shows great potential for application in condition monitoring of various rotating machinery, such as wind turbines (as presented), mining excavators, or gas turbines.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

The authors would like to acknowledge the financial support of AGH University of Science and Technology within the program of Dean’s Scholarships (no. 15.11.130 987).