Abstract

In the present study, artificial neural network is used to model the relationship between NOx emissions and operating parameters of a direct injection diesel engine. To provide data for training and testing the network, a 6-inline-cylinder, four-stroke, diesel test engine is used and tested for various engine speeds, mass fuel injection rates, and intake air temperatures. 80% of a total of 144 obtained experimental data is employed for training process. In addition, 10% of the data (randomly selected) is used for network validation and the remaining data is employed for testing the accuracy of the network. The mean square error function is used for evaluating the performance of the network. The results show that the artificial neural network can efficiently be used to predict NOx emissions from the tested engine with about 10% error.

1. Introduction

Direct injection diesel engines are used as propulsion systems with low fuel consumption and very high efficiency for automotive applications. Any attempts to use their privileges require considering emissions stringent disciplines which enforce engine manufacturers to tender their productions with lower emissions [1]. Recently, Lenz and Cozzarini [2] have presented statistics showing that the worldwide passenger car and commercial vehicle traffic contribute 20% of the total anthropogenic emissions of nitrogen oxides (NOx). These emissions have damaging effects upon the environment and people. Therefore, how to control the exhaust emissions especially NOx from diesel engines has become an essential subject for researchers of the automotive field in the world.

The diesel engine industry has undergone a great technical development in the last few years, creating a number of new strategies such as electronic control units (ECUs) and/or engineering management systems (EMSs) as well as new injection systems [37]. They all use some kinds of artificial intelligence (AI) techniques such as artificial neural network (ANN) to process the engine operating conditions and prognosticate the fairly best values of the controlling parameters with the aim of optimizing the engine characteristics.

Digital computers have provided a rapid means of performing many calculations involving the ANN methods. Along with the development of high-speed digital computers, the application of the ANN approach could be outspread in a very impressive rate in several fields. One of the major applications of ANN is industrial pollutants control. Kalogirou [8] presented an elaborated review on the recent applications of AI in environmental pollutants control.

Neural networks are powerful modeling techniques with the ability of identifying cryptic nonlinear highly complex relationships between their input and output data [9]. ANN describes such relations by updating network weights using a trial-and-error-based arithmetic method and a training algorithm such as Levenberg-Marquardt (LM).

A number of studies have been conducted to predict the characteristics of internal combustion engines (ICE) by using ANN approach. This approach has been used by Xu et al. [10] to predict engine systems reliability. The injection characteristics of direct injection (DI) diesel engines have been investigated by Yang et al. [11]. In [12], the effects of NOx and soot level in the case of high-pressure fuel injection have been investigated in a single-cylinder DI diesel engine. ANN has been used to predict the exhaust emissions and performance of a diesel engine taking into account several operating conditions such as the percentage of throttle opening, injection time, engine speed, and fuel compositions as the network inputs [1315]. However, there is no literature that reports the application of neural network to predict and model NOx emissions in terms of engine speed, intake air temperature, and mass fuel injection (MFI) rate. In this study, NOx emissions from a diesel engine are investigated using ANN. For this purpose, experimental tests have been conducted for 144 engine speeds ranging from 591 to 2308 rpm.

2. Experimental Setup

The test engine used to conduct the experiments is a heavy duty (HD) six-cylinder, direct-injection, four-stroke diesel engine. The technical specifications of the engine are given in Table 1. Standard laboratory procedures are used to measure the engine operating parameters and its tailpipe emissions (see Figure 1). The engine is connected to the data acquisition systems, so that several operating parameters could be simultaneously measured and precisely controlled. The ST10 fuel controller sensor is used to measure the mass fuel injection rate in the range of 0.39–10.31 g/sec. An electrical dynamometer is assembled on the engine and used to measure the speed, brake power, and torque of the engine. The engine speeds are recorded between 591 and 2308 rpm. Simultaneously, other engine properties such as exhaust emissions (NOx, soot, HC, CO, CO2), air-fuel ratio (AFR), and intake air temperature are measured by various connected instruments. The range of variations of the operating parameters and the corresponding values of NOx emissions have been listed in Tables 2 and 3, respectively [16, 17].

3. ANN Approach

The building unit of an ANN is a simplified model of the much more complex one known as organic neuron. This model was introduced by the neurophysiologist McCluch and the logician Pitts in 1943 [18], but its learning behavior was first treated extensively in a book by Rosenblatt in 1962 [19].

One of the main advantages of ANN is its ability to model complex nonlinear relationships between multiple input variables and the required outputs. Another important advantage of the ANN approach is its fast response, which allows one to use it in more complex procedures including optimization applications. Therefore, it offers the advantage of being fast, accurate, reliable, and powerful in dealing with multivariate problems as well as in the prediction or approximation affairs, especially when numerical and mathematical methods fail [20, 21].

To get the best prediction by the network, many parameters should be adjusted such as biases, weights, number of hidden layers, number of hidden layer neurons, and type of transfer function. The biases and weights must be modified in every epoch by using training algorithms such as LM algorithm. The performance of the network is evaluated by comparing the error obtained from converged neural network runs and the measured data. The error of the network is calculated at the end of training, validation, and testing processes based on the differences between the targeted and calculated outputs. The back propagation algorithm is used to minimize the error function, which relates the outputs of each neuron in the output layer and the corresponding desired output. The error function used here is the so-called mean square error (MSE) function given by 1𝐸=𝑄𝑝𝑘𝑦𝑝𝑘𝑓𝑝𝑘2,(1) where 𝑄 represents training pairs of vectors, 𝑘 is the index of elements in the output vector, 𝑦𝑝𝑘 is the 𝑘th element of the 𝑝th desired pattern (target value) vector, and 𝑓𝑝𝑘 is the 𝑘th element of the output vector when pattern 𝑝 is introduced as input to the network. Investigations have proved the accuracy and rapid convergence of LM algorithm for training in engineering applications with limited number of experimental data [22, 23]. In the present work, the LM training algorithm is employed, which uses Hessian matrix approximation. In what follows, a detailed description of this algorithm is presented.

4. LM Algorithm

The LM algorithm is a virtual standard in nonlinear optimization which significantly outperforms gradient descent and conjugate gradient methods for medium-sized problems. It is a pseudo-second-order method which means that it works with only function evaluations and gradient information but it estimates the Hessian matrix using the sum of outer products of the gradients (for more details, see [23]). It also fits a curve on a given dataset by finding the optimum parameters 𝛽 based on a user specified model 𝑓 such that the final parameters can characterize the target function by minimizing the errors. This is analogous to solving the least squares problem, where we want to minimize the sum of squares between the target values 𝑦𝑖 and the output of the user model 𝑓 with the input values 𝑥𝑖 and the estimated parameter vector 𝛽 with 𝑚 number of input data points [24]: 𝑆(𝛽)=𝑚𝑖=1𝑦𝑖𝑥𝑓𝑖,𝛽2.(2)

Here, 𝑆(𝛽) is the function to be minimized. To start a minimization, the user has to provide an initial guess for the parameter vector, 𝛽, which may be critical in convergence if the user model 𝑓 is high dimensional or nonlinear with a large number of parameters. In many cases, an uninformed standard guess like 𝛽𝑇=(1,1,,1) will work fine; in other cases, the algorithm converges only if the initial guess is already somewhat close to the final solution. At each step, the initial parameters are updated by a small amount in the optimum direction by adding the update delta values, 𝛿, such that 𝛽𝑡+1=𝛽𝑡+𝛿.(3)

To find the update delta value, 𝛿, we need to solve for the approximation of the sum of squares function by setting the gradient equal to zero as follows: 𝐽𝑇𝑓𝑥𝑦𝑖,𝛽+𝐽𝑖𝛿=0.(4)

This relation can readily be reduced to 𝐽𝑇𝐽𝑖𝛿=𝐽𝑇𝑥𝑦𝑓𝑖,𝛽,(5) where 𝐽 is the Jacobian matrix or gradient of the function that describes the user model and the superscript 𝑇 represents its transpose form. Equation (5) yields a set of linear equations in the form 𝐴𝑥=𝑏, where 𝐴 is a square matrix, 𝑥 represents the vector of unknown delta values, and 𝑏 is a known vector of the same length as 𝑥. At each step, the delta values can be obtained by solving this set of linear equations. However, the LM algorithm also adds a regularization parameter 𝜆 that helps as the dampening factor to produce the final LM equation: 𝐽𝑇𝐽𝑖𝐽+𝜆diag𝑇𝐽𝑖𝛿=𝐽𝑇𝑥𝑦𝑓𝑖,𝛽.(6)

Here, diag(𝐽𝑇𝐽𝑖) stands for the principal diagonal elements of the matrix 𝐽𝑇𝐽𝑖. This equation is still a system of linear equations of the form 𝐴𝑥=𝑏 and one can use direct solvers to solve for delta values at each step.

5. Implementation of the ANN to Predict NOx Emissions

The neural network toolbox of MATLAB 7.8 is used to form the ANN. Simple and detailed structures of the employed ANN have been shown in Figures 2 and 3, respectively. According to the Kolmogorov theory, multilayer perceptron algorithms can approximate any complex and nonlinear relation between input and output data, among which the three-layer algorithm is the simplest but efficient one. The three layers include the input layer, the hidden layer, and the output layer. Each layer involves some neurons which should be properly determined. The number of input and output parameters of the system determines the number of neurons in the input and output layers of the network, respectively. Thus, the input layer has three neurons while the output layer has only one neuron. It should be noted that in the present work, nominal 19 neurons (determined by trial and error) are used in the hidden layer.

The number of data patterns required for training the network should be chosen in such a way that the network is properly trained and in the meantime adequate data is remained for testing the network. In addition, it is essential to set aside some data patterns for validating the network during the process. About 80% of the total 144 experimental data (i.e., 116 data) is used for training the network and 10% (i.e., 14 data) of the data is used for validation. The remainder data is left for testing the network. Neural network requires that the range of both the input and output values lies between 0 and 1. For this purpose the following formula is used to normalize these values [25]: 𝑉norm=𝑉𝑎𝑉min𝑉max𝑉min×𝑉𝑉𝑙+𝑉𝑙,(7) where 𝑉norm is the normalized value and 𝑉𝑎 is the actual value of the data. 𝑉min and 𝑉max are the minimum and maximum values of the experimental data, respectively. Also, the values of 𝑉 and 𝑉𝑙 are set to be 0 and 1, respectively.

There are various types of transfer functions such as logsig, tansig, purelin, among others. In the present work, the logsig transfer function is used in both the hidden and output layers. This function is defined as 1logsig(𝑛)=1+𝑒𝑛,(8) where 𝑛 is the weighted sum of the input.

6. Results and Discussions

The artificial neural network is used to predict the NOx emissions from a direct injection diesel engine using LM training algorithm. Technical specifications of the test engine are given in Table 1. The input data of the network are the measured operating parameters of the engine such as mass fuel injection rate, intake air temperature, and speed of engine whose range of variations is given in Table 2.

A MATLAB program has been developed to first obtain the desired correlations for training, validation, and testing stages of the network. Then, the accuracy of the network is evaluated through the comparison of the predicted values of NOx emissions with the experimentally measured ones. The total 144 measured engine’s operating parameters and the corresponding values of NOx emissions are listed in Table 3.

Figures 2 and 3 show the simple and detailed structures of the ANN employed, respectively. These figures demonstrate the three layers of the network, namely, input layer, hidden layer, and output layer. The operating parameters of the engine are fed into the network as inputs and NOx emissions leave the network as outputs. Note that 𝑊 in Figure 3 represents the weight of the layer.

Figure 4 shows a regression analysis between the network response (outputs) and the corresponding targets. According to this figure, the training process has been properly performed, where the correlation factor between outputs and targets is 0.91972. Figures 5 and 6 show the validation and testing results of the network, respectively. It is observed from these figures that the ANN represents the best accuracy in modeling the NOx emissions with correlation factors of 0.98222 and 0.89123, respectively, for the network validation and testing.

The results show that the ANN with LM training algorithm is an appropriate technique, which can accurately predict NOx emissions for different engine operating parameters including engine speed, intake air temperature, and mass fuel rate. A comparison between the predicted and the measured values of NOx emissions are depicted in Figure 7. There is a good agreement between the predicted values using the neural network model and the measured values obtained from experimental tests. It may be noted that, for medium engine speeds, the agreement is more considerable than that for the medium speeds. For medium speeds the MSE is less than 8%.

7. Conclusions

The operating parameters involving speed, intake air temperature, and mass fuel rate of a DI diesel engine have been used to train the ANN to predict NOx emissions from the engine. The results of this research reveal that a three-layer neural network along with LM training algorithm leads to a desirable mapping between the inputs and outputs of the network case. The proposed ANN model for prediction of the NOx emissions gives the correlation factors of 0.92, 0.98, and 0.89 for training, validating, and testing the network, respectively. It is concluded that, ANN model is a potentially feasible tool for prediction of NOx emissions from a diesel engine with respect to the engine operating parameters, especially in medium engine speeds.

Nomenclature

𝑓:ANN output value
𝑀𝑓: Mass fuel injection (g/sec)
𝑁: Engine speed (rpm)
𝑄: Number of pairs
𝑅: Correlation factor
𝑇𝑖: Intake air temperature (C)
𝑥: Input value
𝑦: Target value.

Greek  Symbols

𝛽:Estimated parameter
𝛿:Updated delta value.

Abbreviations

AFR:Air-fuel ratio
AI:Artificial intelligence
ANN:Artificial neural network
DI:Direct injection
ECU:Electronic control unit
EMS:Engineering management system
GD:Gradient descent
HD: Heavy duty
LM:Levenberg-Marquardt
MF:Mass fuel injection.

Acknowledgment

This research has been supported by the Iranian Diesel Engine Manufacturing (IDEM) Company.