Abstract

Accurate interpretation of geological structures inverted from gravity data is highly dependent on the coverage of the recorded gravity data. In this work, Artificial Neural Networks (ANNs) are implemented using Levenberg-Marquardt algorithm (LMA) to construct a background density model for predicting gravity data across Northern Cameroon and its surroundings. This approach yields statistical predictions of gravity values (low values of errors) with 97.48%, 0.10, and 0.89, respectively, for correlation, Mean Bias Error, and Root Mean Square Error for two inputs (latitude, longitude) and 97.08%, 0.13, and 1.14 for three inputs (latitude, longitude, and elevation) for a set of anomalies as output. The model validation is obtained by comparing the results to other classical approaches and to the computed Bouguer, lineaments, and Euler maps obtained from measured gravity data. The depth of most of the deep faults and their orientation are in agreement with those obtained from other studies. The results achieved in this study establish the possibility of enhancing the quality of the analysis, interpretation, and modeling of gravity data collected on sparse grid of recording stations.

1. Introduction

The Northern Cameroon and its surroundings, the subject of this study, have prompted many researchers and prospectors to identify superficial and deep structures and to indicate their geodynamics and tectonic implications [15]. The vast majority of these studies used the data from surveys carried out by French Research Institute for Development (IRD) and other private and public institutions. Unfortunately, these data are scattered and unevenly distributed over the Cameroonian territory since they were mostly collected along available roads. Researchers have used interpolation techniques to extract relevant information from uncovered areas, such as Generic Mapping Tool (GMT) [6], minimum curvature [7], kriging [8, 9], the least-squares method [10], and finite element approach through cubic B-spline function [11]. These methods have yielded interesting results and allowed in specific cases extracting relevant information from uncovered areas. However, the reliability and robustness of these conventional interpolation techniques are hampered by a limited number of input variables and a small sample size. These limit the validity of the reconstructed density models to narrow areas. In order to construct models that cover large areas more robustly, we propose to use ANN technique through Levenberg-Marquardt algorithm.

Haykin [12] defined Neural Network as a massively parallel distributed processor made up of simple processing units that has a natural propensity for storing experiential knowledge and making it available for use like the brain. They have recently gained in popularity in geophysics and have been applied to a variety of problems. Geophysical problems such as seismic waveform recognition [13], first-break picking and trace editing [14], earthquake prediction [15, 16], electromagnetic [17], magnetotelluric [18], seismic inversion [1921], shear-wave splitting [22], well-log analysis [23], seismic deconvolution [24, 25], seismic discrimination [26], and seismic signals detection and classification [27] used ANNs to forecast the unknown. Grêt et al. [28] and Ghalambaz et al. [29] performed gravity interpretations in two dimensions and event classification, respectively, to discriminate bodies of a similar kind of anomaly and approximate shape parameters like depth, vertical extension, and radius. They use a Hybrid Neural Network and Gravitational Search Algorithm (HNGSA) to solve Wessinger’s equation [29]. In this paper, we develop a background density model using ANN to extract gravity anomalies with their geographical variables (latitude, longitude, and elevation). The model integrates input and output data of an area covering North Cameroon and its surroundings (Figure 1) measured by IRD. In addition, a model validation is carried out by establishing Bouguer and residual maps of the studied area and then comparing them to those obtained by ANN. With this approach, we expect the estimation of gravity values in uncovered areas during data acquisition. The results show that ANNs can be used to interpolate data of uncovered area and increase the resolution of geological structures in poorly covered areas.

2. Geology of the Area

The data used in this study cover three main domains: northern extension of Benue trough in the East of Nigeria, Northern Cameroon, and West of Chad (Doba basin). Fairhead and Okereke [30] listed four important events in West Central Africa Rift System: extensional, compressional, tectonics, and subsidence. The surface geology (Figure 2) is mainly composed of the Precambrian basement made up of migmatites and anatectites of the Mokolo unit [31] and formations of the Pan African Mobile Belt. Old sediments (Early Cretaceous in age) of 4 km thickness cover the Garoua sedimentary basin [4] while young sediments (lower cretaceous in age) cover the Precambrian basement [32].

3. Material and Method

3.1. Data Collection and Packaging

The measurement of gravity fields can be used to calculate relative and absolute values regarding variations of the fields across earth surface. It is linked to frameworks like Global Positioning System and Digital Terrain Model. The data are treated with respect to the equipment used (for example, Scintrex CG3/CG5 relative gravity meters or Micro-g Lacoste and Romberg as shown in Figure 3).

The data is processed to remove undesirable influences from the surroundings in order to isolate Bouguer anomaly (BA). It represents the difference between the measured and calculated gravity (formula (1)). BA is then modeled to derive geological features for characterizing, quantifying, and interpreting the mass or density distributions in the soil.where represent the latitude, longitude, and elevation, respectively, measures the gravity, represents the theoretical gravity, , , and stand, respectively, for free air, slab/curvature, and topography corrections.

We aim to develop a connectionist model (Neural Network) to correlate Bouguer anomalies with the geographical variables. In general, an array of Artificial Neural Networks is a juxtaposition of unitary, functional, and interconnected elements [33, 34]. There are a multitude of possible arrangements [35]. We choose to use the Multilayer Perceptron (MLP). It is mostly used in time series prediction because of its general property of being universal parsimonious approximator [36]. In its architecture, the neurons are organized in layers as shown in Figure 4.

The inputs , k=1,...,K are multiplied by weights and summed up together with the constant bias term . The resulting is the input to the activation functions g and f. The activation function is originally chosen to be a relay function, but for mathematical convenience a hyperbolic tangent (tanh) or a sigmoid function is most commonly used.

The output y of the MLP network is given by equation below:

From (2), the MLP network is a nonlinear parameterized map from input space to output space y (here m=3). The parameters are the weights and the biases. f and g are activation functions defined in advance. In our study, we have used tansig for the hidden layer and purelin for output layer. Given input-output data, (, y), 1,..., N, finding the best MLP network is formulated as a data fitting problem. The parameters to be determined are (, ).

From an arbitrary weight (random value) defined at the beginning, the weights are adjusted by backpropagating the error according to the expression:wherewhere the local gradient is defined in In (5), is the difference between the output y and target d values, as shown in E(n) is the sum of the quadratic errors observed on the set of output neurons written as

3.2. Shape of MLP

The data set is composed of 2922 samples which comprise latitude, longitude, elevation, and the corresponding Bouguer anomalies. These data are extracted from a database computed for the whole Cameroon by Collignon [37] and Poudjom-Djomani [38]. They cover the study area, the Northern Cameroon, and its surroundings located between longitude of 12° and 16°E and latitude of 8° and 12°N (Figure 5).

60% of these data (1754 samples) are used for training, 20% of the data (584 samples) are used for the validation, and 20% (584 samples) for testing the models. Data conditioning processes are conducted to speed up the training ANNs, which includes interpolating missing data, normalizing the data, and then randomizing them. Usually, the missing data are calculated imprecisely by averaging the neighboring values. In this study, the missing values are forecasted by ANN.

3.3. Training Algorithm

The diagram (Figure 6) stresses the steps to follow when implementing this scheme of ANN using Levenberg-Marquardt algorithm scheme:The algorithm adjusts the weights according to (8) where J is the Jacobian, μ is positive and called combination coefficient, I is the identity matrix, and e the error vector. This algorithm takes more memory but less time. Training automatically stops when generalization stop improving, as indicated by an increase in the mean square error of the validation sample.

3.4. Results and Discussion

To quantitatively evaluate the ANN and verify its trend, we conduct statistical analysis involving the coefficient of determination (R2), the Root Mean Square Error (RMSE), and the Mean Bias Error (MBE). The network structure identification is 2-190-1 and 3-370-1, respectively, for 2 and 3 inputs where the first number indicates number of neurons in the input layer, the last number represents neurons in the output layer, and the numbers in between represent neurons in the hidden layers. We present the best achieved results for the MLP ANN models (Figures 7 and 8).

As shown in Figures 8 and 9, we observe a good match on the plot for regression for all data in both networks, where R2 has values of 0.95027 and 0.94254, respectively, for two and three inputs; only a few data are not too close to the fitting line. There is a slight difference between the two models; that with two inputs yields a suitable correlation of 97.48% with less neuron in the hidden layer whereas the model with three inputs yields a 97.08%, coefficient of determination.

MBE and RMSE yield very low values as shown below:(i)For model 1 (model inputs L and l) the network structure is 2-190-1 for 0.95027, 0.10, and 0.89 representing, respectively, R2, RMSE and MBE.(ii)For model 2 (model input L-l-h), the network structure is 3-370-1 and the values of statistical errors R2, RMSE, and MBE are, respectively, 0.94254, 0.13, and 1.14.

The results indicate that, for the test base, there is a very good correlation (Figures 9 and 10). This signifies the possibility of having values of anomalies for areas where they do not exist by giving the geographical location.

Comparing observed and simulated data, we can see as shown in Figures 9 and 10 a good match for two and three entries.

Model Validation. We now compare results obtained using ANNs to classical approaches and, in addition, Bouguer, Euler, and lineaments maps for two and three entries.

Comparing ANNs Results to Other Methods. For two entries, we compare Neural Networks with classical methods based on multiple linear regression with specific approaches developed in most software used in geophysics such as Surfer and Oasis Montaj (Table 1). Z, the anomaly, is given bywhere , , and are constants to be determined; and are geographical coordinates of a given point.

For three inputs, we use a classical multiple linear regression implemented through a matrix approach programmed [39] in Matlab or Excel to solvewhere , , , and are constants to be determined; , , and , respectively, are geographical coordinates and elevation of a given point.

For Multiple Linear Regression Analysis, the following constants were obtained:(i)Multiple Linear Regression Analysis with 2 inputs (MLRA2): ; ; (ii)Multiple Linear Regression Analysis with 3 inputs (MLRA3): ; ; ; .

In Figure 11, we represent Bouguer anomalies versus the outputs inferred from MLRA3. In addition, in Table 2 and Figure 12, we analyze the evolution of errors and Root Mean Square Error (RMSE) against the number of iterations and neurons in the hidden layer (NI and NNHL, respectively).

Plotting errors against iterations to us did not have any mathematical explanation; instead we plot errors against number of neurons in the hidden layer where it is obvious that one has low value of RMSE with increasing number of neurons in the hidden layer, unless the situation (above 190) where there is an overfitting (increasing RMSE) exists. The same conclusion arises for three inputs (Figure 13).

The results obtained show very good precision for Neural Network compared to classical approaches. Though ANNs can approximate any function, regardless of its linearity, they have some limitations such as their “black box” nature, greater computational burden, increasing accuracy by a few percent which can bump up the scale by several magnitudes (proneness to overfitting), and the empirical nature of model development (needs a lot of data for the training and cases for validation and test).

Comparing Measured and Calculated Bouguer, Euler, and Lineaments Maps. We generate and discuss the data obtained through ANN by establishing Bouguer, Euler (Reid et al. [40]; (formula (11)), and lineament maps from Bouguer data obtained after prediction through inversion using Oasis Montaj 6.3 software.where T is the total field of magnetic or gravity source detected at (x, y, z), B is the regional gravity or magnetic field, and N is the structural index value that needs to be chosen according to a prior knowledge of the source geometry.

Bouguer, Bouguer 2 and 3 entries maps (Figures 14(a), 15(a), and 16(a)) present structurally the same geological entities. A strict analysis makes it possible to distinguish between these maps: the positive anomaly structures (Waza, Maroua, South of Tcheboa, and Yagoua); the negative anomaly structures (Mogobé, south boundary of the area); and the gradient zones ensuring the transition between these anomalies of different signatures. These different features are the signature of the Precambrian, Early Cretaceous, and Tertiary-Quaternary rocks in the studied area. Transitions between structures require better materialization.

By using and comparing also Bouguer, Bouguer 2 entries, and Bouguer 3 entries, there is also a similarity between the lineament maps (Figures 14(b), 15(b), and 16(b)), thus highlighting tectonics in the area. Between Euler maps (Figures 14(c), 15(c), and 16(c)), the depths of the source structures of anomalies are described.

The lineaments obtained are later compared with existing results from other works. From Figures 1416, we have a network of faults (above forty) with similar strikes. The depths range from about 2.6 km (at Tcheboa, Garoua sedimentary basin) to about 18.7 km (north of Waza). The results obtained by inversion of gravity data match and complete those obtained by Mouzong et al. [41], Eyike et al. (2010), and Kamguia et al. [4].

The ANN based model for gravity anomalies is accurate for the prediction of these anomalies in Northern Cameroon and its surroundings.

4. Conclusion

In this paper, an Artificial Neural Network (ANN) model was estimated for the prediction of gravity anomalies using, respectively, two (longitude and latitude) and three (longitude, latitude, and elevation) inputs in Northern Cameroon and its surroundings along with the corresponding anomaly. Existing gravity data were used for training, validation, and testing of the Neural Network. With each of these inputs, we obtained a good correlation on the plot for regression for all data in both networks, where R2 has values of 0.95027 and 0.94254, respectively. In order to validate the model, results were compared to those from classical interpolation approaches; in addition Bouguer, Euler, and lineaments maps were compared to our prediction. Low values of MBE and RMSE indicate the effectiveness of the approach. We provide in this work new deep faults for the studied area. The model is promising for evaluating the gravity anomaly at a specific point where there is no measured value. This method can therefore be recommended in geophysics to improve the resolution of geological features for uneven coverage of recorded gravity data and also to reduce the cost of geophysical surveys.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

The authors are indebted to IRD (Institut de Recherche pour le Developpement) for providing them with the data used in this work.