Abstract

Ozone (O3) flux-based indices are considered better than O3 concentration-based indices in assessing the effects of ground O3 on ecosystem and crop yields. However, O3 flux (Fo) measurements are often lacking due to technical reasons and environmental conditions. This hampers the calculation of flux-based indices. In this paper, an artificial neural network (ANN) method was attempted to simulate the relationships between Fo and environmental factors measured over a wheat field in Yucheng, China. The results show that the ANN-modeled Fo values were in good agreement with the measured Fo values. The R2 of an ANN model with 6 routine independent environmental variables exceeded 0.8 for training datasets, and the RMSE and MAE were 3.074 nmol·m−2·s and 2.276 nmol·m−2·s for test dataset, respectively. CO2 flux and water vapor flux have strong correlations with Fo and could improve the fitness of ANN models. Besides the combinations of included variables and selection of training data, the number of neurons is also a source of uncertainties in an ANN model. The fitness of the modeled Fo was sensitive to the neuron number when it ranged from 1 to 10. The ANN model consists of complex arithmetic expressions between Fo and independent variables, and the response analysis shows that the model can reflect their basic physical relationships and importance. O3 concentration, global radiation, and wind speed are the important factors affecting O3 deposition. ANN methods exhibit significant value for filling the gaps of Fo measured with micrometeorological methods.

1. Introduction

In an atmospheric column, approximately 90% of total ozone (O3) exists in the stratospheric O3 layer and protects Earth’s surface from excessive UV radiation. This category of O3 is therefore referred to as “good” ozone [1]. However, the troposphere O3 can negatively affect vegetation tissues, photosynthesis, and crop yields [2]. The tropospheric O3 mainly results from the in situ photochemical production, and a small portion is transported from the stratosphere [3]. O3 destruction involves chemical decomposition and deposition on Earth’s surface. The total O3 deposition can be quantitatively reflected by measured O3 flux (Fo), and its magnitude and variations can facilitate understanding of the O3 deposition processes.

The quantitative assessment of the effects of ground O3 on ecosystem and crop yield loss in a region requires selection of an assessment model and calculation of the corresponding assessment index, and the calculation of assessment indices requires continuous measurement of O3 concentration or flux [4, 5]. The concentration-based indices do not consider the status of vegetation and ecosystems (e.g., stomatal conductance and leaf area index). Many studies have shown that O3-reduced yield loss is more closely related to the stomatal O3 flux [57].

Gerosa et al. [8] presented a method of estimating O3 stomatal flux by partitioning total Fo. Briefly, O3 stomatal resistance was obtained by converting vapor stomatal resistance (Penman–Monteith approach), and the O3 concentration near the canopy was calculated using the resistance model, which needs the total O3 flux. So, measuring total Fo is the first step in estimating stomatal O3 flux [9]. Fo is usually measured by micrometeorological methods, and the eddy covariance (EC) method is considered the most direct flux-measuring method [8, 1012]. However, due to a lack of high performance fast-response O3 analyzers (e.g., it needs to change O3-sensitive dye often and its sensitivity is unstable), EC-measured O3 flux has many temporal gaps in the associated datasets [8, 13, 14]. This leads to difficulties in calculating the value of a O3 stomatal flux-based index using the EC O3 flux since the effect of O3 on crop yields results from the accumulation of O3 damage. Therefore, it is necessary to find a method of filling the gaps in O3 flux datasets.

The gap-filling methods for ecosystem CO2 flux have been summarized in the literature [15, 16]. However, there have not been investigations into methods for filling gaps in Fo data. As the controlling factors for O3 flux are more complex, it is difficult to produce physically based equations to reflect the Fo and environmental variables under any conditions. Although some mechanism-based models (typically a resistance-in-series) have been developed, many parameters must be empirically estimated based on the specific site or type of vegetation [17, 18].

Artificial neural networks (ANN), as an artificial intelligence using machine learning techniques, have been applied for approximation, prediction, classification, modeling, and data gap-filling in many scientific disciplines [16, 1921]. ANN is able to learn from training data and to represent the nonlinearity between the dependent and independent variables. It is suitable for simulating empirical datasets with complex and nonlinear relationships between variables and dependent variable(s). It is particularly useful when it is difficult to express these relationships using one or more simple equations. Compared to physically based models, ANNs have more success in simulating complex processes with a high precision without any analytical models [2224]. These attributes make ANNs a good candidate for attempts at filling gaps in O3 flux data. In addition, a sensitivity analysis can be conducted by analyzing the response curves of the different input variables that are created by fixing all other input variables at a certain value [25, 26]. For gap filling of Fo, ANN modeling is also a practical and simple method [16].

ANN methods have been used widely to model and predict O3 concentration [27, 28] and CO2 fluxes [21, 29, 30]. However, the ANN approach has not been used to simulate O3 flux variation with other factor fluxes or meteorological factors. The objectives of this study are to (1) develop a series of ANN models to simulate the relationship between Fo and environmental variables; (2) determine the optimal ANN model to simulate Fo for interpolation of measured Fo; and (3) analyze the responses and relative importance of different environmental variables to O3 flux.

2. Materials and Methods

2.1. Site and Observations

The observations were carried out over a winter wheat field at the Yucheng Comprehensive Experiment Station of the Chinese Academy of Sciences (36°50′N, 116°34′E; 28 m a.s.l.; Shandong Province, China). The site is located in the Northwest-Shandong plain, characterized by loamy soil texture and semiarid and warm temperate climate. The experimental site is very flat, and the fetch requirements for EC measurements are met. The study period of the field experiment covered entire growing season of wheat (from 2 March to 6 June 2012).

The ambient O3 concentration was measured with a slow-response UV absorption-based O3 analyzer (Model 205, 2B Technologies Inc., USA). The O3 flux was measured with the eddy covariance method in combination with observations of CO2, H2O, and sensible heat fluxes. These variables were measured with a 3D sonic anemometer (CSAT3, Campbell Scientific Instrument, USA) and an open-path CO2/H2O gas analyzer (LI-7500, LI-COR Biosciences, USA). The O3 fluctuation was measured with a fast-response O3 analyzer developed by Enviscope GmbH [31] (hereafter referred to as ENVI). Micrometeorological and radiation variables measurements include air temperature and relative humidity (HMP45C; Vaisala Co., Finland), wind speed (A100R; Vector Instruments, UK), net radiation (CNR1; Kipp and Zonen, the Netherlands), and photosynthetically active radiation (LI-190SB; LI-COR, USA). All sensors were installed at 2.2 m height. Because of the continuous consumption of organic dye, the sensitivity of the ENVI slowly decreased with time. To maintain high sensitivity in the ENVI, we replaced the organic dye disc every 3 to 4 days. Raw data from the EC system were recorded at 10 Hz, and 30-minute mean data were recorded by a data logger (CR3000, Campbell Sci., USA).

2.2. O3 Flux Calculation

Since the fast-response O3 analyzer’s sensitivity is not constant [14], the sensitivity must be calibrated by a slow-response O3 concentration analyzer. However, it has been shown that the sensitivity can be treated as constant when using a 30 min averaging interval [32]. This means that the ENVI’s output X (in mV) is proportional to the absolute ambient O3 concentration over a 30 min period. Based on this assumption, the O3 deposition velocity (Vd), defined as the O3 flux divided by O3 concentration, can be calculated by the following equation [33]:where is the O3 mass density (or concentration; μg·m−3), is the vertical wind speed (m·s−1), and VG is the output signal (mV) of the fast-response O3 analyzer. The overbar denotes the time average, and the primes indicate the temporal fluctuation of each variable. The O3 vertical turbulent flux Fo (μg·s−1·m−2) between atmosphere and surface is calculated by the following equation:where xo is the mean O3 mixing ratio (ppb) measured by the slow-response O3 analyzer, P is the barometric pressure (N·m−2), T is the air temperature (K), R is the universal gas constant (8.314 J·mol−1·K−1), and Mo3 is the molar mass of O3 (48.0 g·mol−1).

In practice, all fluxes were calculated by EddyPro® software with a series of corrections. Double rotation was utilized to correct tilt errors [34]. O3 flux loss caused by time delay was corrected by the maximum covariance method [35]. Only the effect of density variation caused by water vapor on O3 flux was considered [14, 36]. The more detailed description of observations, calculations, and corrections can be found in Zhu et al. [14].

2.3. Artificial Neural Network Modeling

In a general model, variable(s) and the independent variable can be expressed as one or more analytical expression(s). In contrast, artificial neural networks (ANNs) are capable of recognizing and simulating complex relations without any analytical expression(s) of variables and outputs. The relation is produced by the ANN that has been trained. Although there are many types of ANN (e.g., Hopfield networks and Kohonen networks) and many algorithms to be used to train ANN, the feedforward network (FFN) with backpropagation (BP) algorithm is the most popular and has been used in more than other types of neural networks for a wide variety of problems [26, 3740]. The feedforward BP networks have a simple structure for simulating complex systems, and its structure is sufficiently robust to simulate many nonlinear systems [27, 41]. Therefore, the BP network was used in the study. Figure 1 shows a schematic of the three-layer BP network. The nonlinear elements or neurons are arranged in successive layers, and the information flows from input layer to output layer through the hidden layers. The number of inputs depends on the model performance, and theoretically, more inputs increase model performance. The hidden layers are another important parameter in a BP network.

Because of the complexity of the ANN calculation process, the ANN core algorithms used in this study are adopted from the MATLAB software package. However, the ANN requires a prepared dataset and the determination of parameters. The general steps for the setup of the BP network and parameters are described as follows:(1)The determination of independent variables (Xi, i = 1, 2, …, n). Although ANNs do not require the specification of dependent variables and independent variables, variables should have a certain relationship in terms of mechanism(s) or variation patterns. Based on the previous studies and observed items, we attempt to build a series of ANN models. These models have different combinations of input variables to determine the most relevant input variables for modeling O3 flux. The input variables include micrometeorological variables, radiation values, and fluxes.(2)The determination of the number of nodes or “neurons” in the hidden layer. This is related to experience or examinations. In general, this number should be larger than the number of Xi.(3)The generation of a network by training. The training dataset and parameters are the keys to generate a good BP network. The training dataset must be representative and of high quality, and the parameters should be suitable. The training process will determine a set of optimal weights and biases. Although there are different transfer functions, such as linear function and a threshold function [38], sigmoid transfer function is the most popular choice for many studies [40]. This transfer function was also used for the neurons in the hidden layer. Using the MATLAB software package, the weights and biases in the ANN are obtained in an iterative calibration process based on the Levenberg-Marquardt (LM) algorithm. The root-mean-squared error is used to monitor the performance of modeling. Normally, the validation error decreases during the initial phase of training and begins to rise when the network begins to overfit the data. To avoid overfitting, the weights and biases of the network can be automatically saved when the validation error reaches a minimum.(4)The evaluation of the ANN model using training and test datasets. In this study, three statistical parameters, i.e., root-mean-squared error (RMSE), mean absolute error (MAE), and coefficient of determination (R2), were used to evaluate the performance of models. RMSE and MAE present information about the degree of accuracy of an ANN model, and R2 measures the degree of the linear relationship between measured and modeled O3 fluxes.

The three parameters were calculated with the following equations:where Yi and Fi are the simulated and measured O3 flux, respectively; the overbar denotes the average of each variables, and N is the sample number.

2.4. Dataset Preparation

To create a satisfactory ANN model, a high-quality dataset is very important. Due to different reasons, all variables, especially the measured O3 flux, presented many low quality or erroneous data. In this study, the selected data for training and testing satisfy the conditions as follows. Approximately 58.5% of the total data points in the set are low quality or erroneous, and they were filtered using the following metrics.(1)Only daytime data were adopted. During nighttime, O3 concentration is generally low and wheat stomata are closed, and the effect of O3 on wheat can be ignored. Nighttime fluxes also exhibit a significant amount of inaccuracies because of weak turbulence [42].(2)Ozone fluxes are between −40 nmol·m−2·s−1 and 0 nmol m−2·s−1. As O3 is always deposited downward on the surface, the positive O3 fluxes represent errors. The initial threshold value of −40 nmol·m−2·s−1 was empirically determined via the time series of O3 flux. Moreover, the threshold was variable based on the wheat status. For example, the threshold was set to −20 nmol·m−2·s−1 in the first stage of wheat. In addition, some rapid spikes of O3 flux were deemed unlikely based on the underlying mechanisms. These data points were deleted even though they did not exceed the threshold value.(3)The data of all inputs at a time are completed. Some input variables, especially the turbulent fluxes, have errors or gaps due to various reasons. To keep training data consistent, these data were also excluded.

Finally, 1402 data points (41.5% of total O3 flux data) were used to train and validate all ANN models. Half of the high-quality dataset (701 data points) was selected for ANN training, and the other half of-high quality data were used for the test dataset. According to the MATLAB software manual, the training dataset is randomly divided into three subsets, i.e., the training set, the validation set, and the test set. Their ratios are 70%, 15%, and 15%, and the corresponding data points are 491, 105, and 105, respectively. The training set is used for computing the gradient as well as updating the network weights and biases. The validation set is used to validate the ANN performance, and the test set is used to compare different models.

3. Results and Discussion

3.1. Input Variables Choice and Its Performances

Although an ANN model is only dependent on the mathematical relations of inputs and outputs, variables that are irrelevant or lack a physical basis should be excluded. Based on equation (1), O3 flux is the product of O3 concentration and deposition velocity (Vd). For O3 concentration, the production and decomposition result from a series of complex chemical and physical processes. The local O3 concentration variation mainly depends on its precursors (e.g., VOCs and NOx), long-range transport and a subset of environmental factors, such as radiation, temperature, and humidity [4345]. Vd is affected to a certain extent by atmospheric turbulence intensity and underlying surface conditions, such as wind speed, friction speed, soil moisture, O3-reactive chemicals, and vegetation status [4648].

Due to a lack of observations for O3-reactive chemicals, only some micrometeorological and radiation variables were selected to model Fo variation in this study. The initial input variables for attempting to model Fo include O3 concentration (), global radiation (Q), air temperature (Ta), relative humidity (RH), wind speed (U), and soil water content (SW). In addition, O3 flux access to stomata is coupled with the CO2 flux (Fc) and water vapor fluxes (LE). These two fluxes are also deployed in our attempts to model Fo. Because of the high correlation between the mean diurnal pattern of many variables and time [30], the effect of time (Tm) on the ANN model was also investigated in this study.

To test the dependency of each variable on O3 flux, the O3 flux with one variable was first simulated, and then, O3 flux with different variable combinations were simulated. Table 1 presents the statistics for each variable and variable combinations. For any single micrometeorological variable, the RMSE values exceed 5 ng·m−2·s−1 and all R2 values are less than 0.4. This means that any single variable cannot accurately model O3 flux. Among these variables, O3 concentration and radiation have relatively strong correlations. For other turbulent fluxes (LE, Fc), ANN-modeled Fo values were strongly correlated with Fc and LE, but modeled Fo values were weakly correlated with sensible heat flux (H). This highlights the fact that O3 flux is mainly affected by the stomata flux, as stomata are the main pathway of LE and Fc. Although these fluxes have similar diurnal patterns, Fc and LE cannot be considered as the driving factors of O3 deposition. The O3 deposition process is complex, and changes in the process are mainly driven by many meteorological or environmental factors.

To obtain an optimal ANN model, we attempted to build more ANN models with different combinations of input variables. The combinations can be divided into two classes. Class 1 combinations only consist of routine micrometeorological and radiation factors (Ta, RH, U, Q, SW, and ), i.e., models T11 to T15 in Table 1. Meanwhile, we also tested the effect of time (T16). The variables in Class 1 are easy to observe and possess sufficiently high accuracy. The combinations can increase the fitness of observed and ANN-modeled Fo, even if the combination only uses meteorological variables (T12). Class 2 combinations add the turbulent fluxes (H, Fc, and LE) in addition to the variables in Class 1, i.e., models T17 to T20 in Table 1. Although the correlation levels increase slightly in Class 2 models when compared to the models of Class 1, the effect of these fluxes on Fo gap filling is very limited. When there is a gap in Fo data, other fluxes (Fc and LE) often exhibit simultaneous gaps in their observational datasets. Meanwhile, the flux data have lower accuracies than the micrometeorological and radiation variables. Among the two classes of combinations, we select T15 and T20 as examples to analyze their performances as well as the contributions and responses of each variable to O3 flux.

Figure 2 shows a scatter plot of measured and modeled O3 fluxes using ANN models T15 and T20 for the training dataset and test dataset. The modeled O3 fluxes in these two models match the measured O3 flux to a high degree. There are not obvious deviations from the 1 : 1 line. Although the first model contains only six routine variables (no turbulent fluxes), the model produced an R2 of 0.826 and 0.767 for the training dataset and test dataset, respectively. The fitness of model T20 is marginally superior to the fitness of T15. The R2 values of T20 are 0.881 and 0.839 for the training dataset and test dataset, respectively. The R2 can be interpreted as the ratio of the variances for the modeled flux and the real flux.

3.2. Effect of ANN Schema on the Model Performance

In addition to the selection of variable combinations, the training data quality and architecture parameters also affect the performance of an ANN model. The ANN model can be considered as a black-box, so the performance is strongly dependent on the training dataset quality. Network architecture consists of numbers of layers, number of nodes, and weight calculation method. Generally, greater numbers of layers and nodes produce better performance. However, to avoid overtraining, the ANN training process requires an appropriate learning rate (weight adjustment steps) and a stopping criterion [16].

Compared to other types of statistical models, ANN models are difficult to replicate. Even if the same dataset and parameters are used, there are slight differences in the modeled results between different runs. The results after training vary slightly because the neural network is initialized with random weights. As an example, Figure 3 shows the daily variations of O3 flux measured on May 12 with EC method and ANN modeled with T15. It is clear that the ANN modeled Fo each run are different, and they are around the measured Fo. To mitigate these uncertainties, we calculated the average of 10 runs with the same variables combinations and parameters to generate the modeled O3 flux. Table 2 compares the statistics of ANN-modeled Fo with T15. As shown, statistical indicators of 10-run-averaged are better than separate indicators. The RMSE and MAE are the smallest, and the R2 is the largest.

Another problem that arises in ANN modeling is so-called overtraining or overfitting. An overtrained network cannot learn the general traits that exist in the training set, and the network will lose the capacity to generalize. Three important parameters can be used to avoid this phenomenon: the number of hidden layers, the number of epochs, and the number of neurons in each hidden layer. Although these parameters are crucial to a model, there are no definite rules on how to determine them. In the ANN package of the MATLAB software, the number of hidden layers and the number of neurons in each hidden layer can be assigned. To decrease the number of ANN parameters, a feedforward network with one hidden layer and a sufficient number of neurons was used in this study. This satisfies any finite input-output statistical modeling [41]. The optimal number of epochs (iterations) can be determined automatically by the software based on the best performances. Therefore, only the optimum neurons should be determined by testing the performance of an ANN model with different neurons.

Figure 4 shows the variations of MAE, RMSE, and R2 for the models T15 and T20 as the number of neurons change. It can be seen that the MAE and RMSE of the two ANN models rapidly decrease when the number of neurons changes from 1 to 10, and then the variations are slower and more stable. The minima for MAE and RMSE appeared when there were 17 neurons. The performance of R2 is similar (Figures 4(b) and 4(d)). Therefore, the number 17 was determined to be the optimal neuron number for the hidden layer in our analysis.

3.3. Response of Fo to Variables Based on ANN Models

All parameters of an ANN model are fixed once training is finished. The responses of an ANN model can be evaluated by varying a single variable within its measured range and keeping other inputs fixed at their mean values [30, 37]. Figure 5 demonstrates the details of how O3 flux varies across the range of all variables within models T15 and T20. The ranges (minimum and maximum) and mean values of each input variables in the training dataset are given in Table 3.

By excluding the effects of turbulent fluxes, the effect of each environmental variable on Fo can be seen more clearly (Figure 5(a)). Positive responses between O3 concentration, temperature, solar radiation, and relative humidity are shown clearly. After considering the effects of turbulent fluxes (Figure 5(b)), the responses of Fc and LE are very obvious, whereas the responses of Ta, RH, and SW are very weak. Moreover, O3 flux shows a unidirectional variation with most variables, such as O3 concentration, wind speed, temperature, and humidity. Nevertheless, the variation of Fo with soil water content (SW) shows a different pattern (Figure 5(a)). O3 deposition decreased when soil water content was too high or too low. The results might be unreliable and should be explained carefully. One possible reason for this pattern is that the change in soil water content is slow over the course of a day and there is no clear diurnal variation. The response of SW and Fo in the ANN models mainly reflects the long-term variation of O3 flux. For other variables, the diurnal variation amplitude is larger than that of long-term variations. A simple expression that relates Fo and the various variables could not be given by the ANNs, as the response of Fo to a single variable may depend on the values of the other inputs. However, the response curves of variables can basically reflect the effect of each variable on O3 flux.

The response curve can also reveal the importance of each variable [41]. This method is called the “profile method” [25]. In this study, the importance of each variable is described by comparing the variation ranges of modeled values when a variable is varied from its minimum to maximum and other variables are fixed at their mean values given in Table 3. A larger variation in Fo indicates a higher parameter importance. Excluding the effects of fluxes, the order of variable importance in model T15 is as follows:  > U > Q> Ta > RH > SW (Figure 5(a)). O3 concentration, wind speed, and radiation are the important factors affecting ozone deposition over wheat fields. In addition, to quantitatively reflect the effect of each variable, the relative importance (RI) of each variable was calculated [39, 49, 50]. The RI of , U, Q, Ta, RH, and SW are 26.7, 21.6, 20.6, 12.4, 11.3 and 7.4, respectively.

Using equation (2), it is easy to understand that the ozone concentration is the most important factor affecting ozone flux. Other factors controlling ozone deposition, such as solar radiation, temperature, humidity, and soil water content, have also been studied [47, 51]. Wind speed is another important factor affecting ozone deposition in this study. Although wind speed does not directly result in the increase of ozone deposition, it does enhance atmospheric turbulence. This causes more ozone in the air to enter crop stomata. In fact, radiation indirectly influences ozone deposition by influencing other model inputs. It results in the production of local ozone and leads to the changes in temperature, humidity, wind speed, and other environmental factors.

When considering the effects of fluxes, the order of variable importance in model T20 is as follows: Fc >  > U > LE > Q > Ta (Figure 5(b)). As most CO2 and water vapor are exchanged via the same pathway of stomata [52, 53], there are strong corrections between Fo and other gas fluxes (Fc and LE). Stomatal uptake might play an important role in ozone deposition. In model T20, the effect of Q on ozone flux becomes small. This might be because the effects of radiation are replaced by these fluxes. In other words, when the effects of CO2/H2O fluxes are taken into account, the correlations between Fo and these fluxes are more significant than those between Fo and radiation.

4. Conclusions

Based on the EC-measured Fo and other environmental factors over a wheat field, an artificial neural network method was attempted to simulate the relationships between Fo and environmental factors. Our conclusions can be summarized as follows:(1)Generally, more input variables can increase the performance of a model. By only using the routine micrometeorological variables of radiation and O3 concentration, we can create an ANN model to simulate the variation of Fo. The R2 of a model with 6 routine independent variables can exceed 0.8 for the training dataset and test dataset. CO2 flux and water vapor flux have strong corrections to Fo, and they can improve the fitness of the ANN models when they are considered as independent variables.(2)In addition to variable combinations and training data, the model parameters (e.g., the number of neurons) are also a source of uncertainties in an ANN model. The effect of neuron number on our model fitness (MAE, RMSE, and R2) is obvious when it changes from 1 to 10. Empirically, one hidden layer with 17 neurons produced the best results.(3)The ANN model is composed of complex arithmetic expressions between Fo and independent variables, an optimal ANN model can roughly reflect their inner relationships and relative importance. Global radiation, O3 concentration, and wind speed are the important factors that affect O3 deposition. There are strong correlations between Fo and gas fluxes (Fc and LE). The ANN method represents a valuable route for filling the gaps of a time series of Fo during the experimental period with micrometeorological methods.

Data Availability

The MS-Excel data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (grant number 41675147) and the National Key R&D program of China (grant number 2017YFC0503801). The author would also like to thank Accdon for providing linguistic assistance during the preparation and revision of this manuscript.