Journal of Electrical and Computer Engineering

Volume 2017, Article ID 3824086, 7 pages

https://doi.org/10.1155/2017/3824086

## Acoustic Log Prediction on the Basis of Kernel Extreme Learning Machine for Wells in GJH Survey, Erdos Basin

College of Computer Science and Information Engineering, Tianjin University of Science and Technology, Tianjin, China

Correspondence should be addressed to Jianhua Cao; nc.ude.tsut@hjoac

Received 24 November 2016; Accepted 22 January 2017; Published 22 February 2017

Academic Editor: Hui Cheng

Copyright © 2017 Jianhua Cao et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

#### Abstract

In petroleum exploration, the acoustic log (DT) is popularly used as an estimator to calculate formation porosity, to carry out petrophysical studies, or to participate in geological analysis and research (e.g., to map abnormal pore-fluid pressure). But sometime it does not exist in those old wells drilled 20 years ago, either because of data loss or because of just being not recorded at that time. Thus synthesizing the DT log becomes the necessary task for the researchers. In this paper we propose using kernel extreme learning machine (KELM) to predict missing sonic (DT) logs when only common logs (e.g., natural gamma ray: GR, deep resistivity: REID, and bulk density: DEN) are available. The common logs are set as predictors and the DT log is the target. By using KELM, a prediction model is firstly created based on the experimental data and then confirmed and validated by blind-testing the results in wells containing both the predictors and the target (DT) values used in the supervised training. Finally the optimal model is set up as a predictor. A case study for wells in GJH survey from the Erdos Basin, about velocity inversion using the KELM-estimated DT values, is presented. The results are promising and encouraging.

#### 1. Introduction

Oil and gas exploration in sedimentary basins is very complicated, since all the targets are buried underground and they cannot be viewed or touched directly. So all the properties for the buried targets have to be predicted or estimated by using modern electrical or magnetic tools. The physical properties of the geologic formations include pore-fluid pressure, rock lithology, porosity, permeability, and oil or water saturation. Nowadays the conventional tool for characterizing these geophysical properties is well logging, and some logs such as gamma ray (GR), dual induction log, formation density (DEN) compensated, deep resistivity (REID), self-potential (SP), and sonic log (DT) are usually recorded. Among them, the sonic log (DT) has largely been used to predict rock porosity, to perform petrophysical analysis, or to carry out well-to-seismic inversion.

Owing to historical operation mistakes or recording loss, the sonic log may not be available in well logging suites. The traditional way solving this problem is to transform the DEN or REID log to DT log based on some experimental formula built between these logs. It might be feasible for some area, but sometimes the errors are unacceptable.

Artificial intelligence techniques have the advantage in connecting unrelated parameters and solving nonlinear problems. Such techniques, including BP neural network, fuzzy reasoning, or evolutionary computing for data analysis and interpretation have become effective tools in the workflow for well drilling and reservoir characterization [1–10]. However, traditional neural networks have many known drawbacks in the learning process, such as multiple local minima, slow learning speed, and poor generalization performances [11].

Extreme learning machine (ELM) is a single-hidden layer feed-forward neural network (SLFN) proposed by Huang et al. [12, 13]. The ELM approach to training SLFN consists in the random generation of the hidden layer weights, followed by solving a linear system of equations by least-squares for the estimation of the output layer weights. This learning strategy is very fast and gives good prediction accuracy. Theoretically and practically, this algorithm can produce good generalization performance in most cases and can learn thousands of times faster than conventional popular learning algorithms for feed-forward neural networks [14]. A lot of real-life applications [15–18] have already demonstrated advantages of using basic ELM. A kernel-based ELM (KELM) has also been developed lately [19], where the hidden layer feature mapping is determined by the kernel matrix. In this version, only the kernel function and its parameters are needed to be defined; the number of hidden nodes is not required. With the use of kernel function, KELM is expected to achieve better generalization performance than basic ELM. Furthermore, as randomness does not occur in KELM, the chance of result variations could be reduced [20].

In this paper, kernel-based extreme learning machine is used to predict missing sonic (DT) logs when only common logs (e.g., natural gamma ray—GR, bulk density—DEN, or deep resistivity—REID) are available. By using KELM, we first create and train a supervised network model based on experimental data and then confirm and validate the model by blind-testing the results. The optimal model is at last applied to wells containing the predictor data but with lack of DT log. We use this workflow in GJH survey from Erdos Basin and the KELM-estimated DT logs are then integrated in the seismic inversion to identify the sandstone reservoir.

The rest of this paper proceeds as follows. Section 2 gives a short review of ELM and KELM. Section 3 describes the experiments using KELM, including the data preparation, parameter selection, and model validation. Section 4 gives the prediction application in GJH survey. Finally, Section 5 gives the conclusion of this work.

#### 2. Methodology

In this study, the kernel extreme learning machine (KELM) is employed to predict the DT logs for the wells in GJH survey. So we present an overview of the ELM and kernel-based ELM as follows.

##### 2.1. ELM

The classical ELM was proposed for SLFNs by Huang et al. [12, 13]. Different from BP network, the input weights and biases of ELM are randomly assigned and need not be fine-tuned within the training phase, and the output weights can be determined analytically by finding the least-square solution. The prediction of ELM is given by where is the weight vector connecting the hidden node and the output nodes and is the output of the hidden layer with respect to the sample . Since the weights and biases are initially assigned for the hidden layer, when the activation function is set, is determined and need not be tuned. And the only unknown parameter is , which can be solved as constrained optimization problem:where is control parameter for a tradeoff between structural risk and empirical risk, is the target output for the network. And when and , a popular and efficient closed-form solution for is

##### 2.2. KELM

As proposed in Huang et al. [19], if is unknown, that is, an implicit function, one can apply Mercer’s conditions on ELM and define a kernel matrix for ELM that takes the formwhere is a kernel function. Many kernel functions can be used in kernel-based ELM, such as linear, polynomial, and radial basis function, so that we can obtain the kernel form of the output function as follows:

Similar to the SVM, need not be known; instead, its kernel can be provided (e.g., Gaussian kernel . The optimal penalty parameter and kernel width are determined by try and error way. Node number of the hidden layer need not be available beforehand either. The experimental and theoretical analysis of Huang et al. showed that KELM produces improved generalization performance over the SVM/LS-SVM [21].

For the given type of the kernel function, the training dataset, and the initial parameters of the network, the following steps are considered.

*Step 1. *Initiate the population based on the kernel function.

*Step 2. *Evaluate the fitness function of each parameter.

*Step 3. *The optimal parameters of kernel function can be determined. Then, based on the optimized parameters, the hidden layer kernel matrix is computed.

*Step 4. *Determine the final output weights.

#### 3. Experimental Study

##### 3.1. Problem Description and Related Work

Well logging is the practice of making a detailed record of the geologic formations penetrated by a borehole. Normally the log is based on the physical measurements made by instruments lowered into the borehole. According to the geophysical properties of the rocks, the logs are always classified as follows: electrical logs, porosity logs, lithology logs, and miscellaneous logs. Sonic log (DT) belongs to the porosity logs, and it provides a formation interval transit time, which typically varies lithology and rock texture, especially porosity for the rocks. Gamma ray log is a log of the natural radioactivity of the formation along the borehole, measured in API units, particularly useful for distinguishing between sands and shales in a siliciclastic environment. This is because sandstones are usually nonradioactive quartz, whereas shales are naturally radioactive due to potassium isotopes in clays and adsorbed uranium and thorium.

The main datasets used in this study include acoustic log (DT), the gamma ray (GR), the resistivity log (REID), which represents the variation of the electric resistivity, the density (DEN), which records the density variation with depth in the borehole, and the self-Potential (SP), a measurement of natural electric potential. These geophysical parameters DT, GR, REID, DEN, and SP are intrinsically linked, since each of them reflects some physical property of the same rock layer. Take sandstone as an example. Pores are sure to exist at the sandstone interval, and if the pores are not filled with other types of tight materials, fluid is the only also important stuffing. There might be oil or gas and water as well. Since the fluid has different physical parameters than the surrounding sandstone, obvious differences will be recorded on the measuring logs: lower GR, lower DT, higher REID, lower DEN, and abnormal change on SP. Thus just observing the characters of the logs, especially those abnormal changes, the experienced researchers have confidence to tell the geological information along the borehole. And then some researchers try to build theoretical relationships between the logs. Thousands of experiments result in empirical equations. For example, DEN could be transformed using DT log when DEN is missing and the relation is defined as Gardener formula [6]:where, *α*, *β* are the coefficients and their values are up to the core tests for the studied area.

In this study, the key we focus on is the DT log, and we want to find the optimal way to get the DT log when it is missing.

The sonic log (DT) is very important in petroleum exploration phase. One way for using DT is to estimate rock porosity, which is the critical parameter for the reservoir evaluation, and identify the fluid information along the borehole. Additionally, since DT log has both time and velocity information, it becomes the reliable key for the time-depth conversion when using seismic data to interpret structures and geological mapping. In one word, the DT log is indispensable for the geophysical and geological study.

But there has always been imperfection, and sometimes, owing to operation mistake or recording loss, DT log may not be available in some wells. One solution for obtaining the DT log is to carry out empirical transformation from other logs, and the model is built by experiment analysis. The formula is just for specific field condition, and it can not be used for all the formation conditions. For instance, Faust formula is just for DT calculation using REID log, and cases [7] have shown that the formula is not suitable when fluid exits in the formation. So another study to synthesize missing DT is to use soft-computing methods, such as artificial neural network, gene expressing programming, and fuzzy reasoning. ANN (artificial neural network) has been frequently used in petrophysical properties estimation, and results show satisfied performances when choosing proper models and parameters [8, 9, 16]. The most important property of ANNs is their ability to approximate virtually any function in a stable and efficient way. By using ANNs, it is possible to create a platform on which different models can be constructed. Baziar et al. [22] tested coactive neurofuzzy inference system which combines fuzzy model and neural network in permeability prediction in a tight gas reservoir and gained convincing results.

Since DT has intrinsic links with the other geophysical logs, researchers often use logs like GR, REID, and so forth as the original inputs and the DT as outputs. Linear and nonlinear relationships have been set up using the soft-computing methods. But the results are not always satisfied. Thus our purpose is to build an optimal and reliable relationship between those geophysical logs and DT log.

In this paper, we investigate the capability of a kernel extreme learning machine in building the nonlinear mathematical model that best explains DT (target) as a function of GR, REID, DEN, and SP (predictors).

##### 3.2. Data Preparation

In order to validate the use of KELM in the context of log data recorded in oil and gas wells, we employed datasets obtained from seven wells drilled in the GJH survey in Erdos Basin.

The study involves the following well logging parameters: gamma ray (GR), deep resistivity (REID), self-potential (SP), formation density (DEN), and sonic log (DT). Among the wells, wells of YQ2, Y209, S211, S212, and S215 have full suites of well logs, while DT log is not available in the other two wells (S219 and S205). According to the evaluation conclusion for the logging process, we choose the farther four wells as training dataset sources and well S215 as the testing dataset. Shanxi group of the Permian formation is set as the analysis interval. Logs of GR, REID, SP, DEN, and DT in the interval from the mentioned four wells are collected and grouped as training dataset, while logs of well S215 as the validation target.

Figure 1 is the example of logs showing of well YQ2 in the Shanxi group of Permian formation ranging from 2700 to 2798 meters. The lithology includes sandstone, mudstone, and thin coal layer, and it is easy to differentiate them from the GR log. Coal layer has very low GR and DEN response and abnormal high DT and REID. Thus, for the same type of rocks, these logs have close geophysical link, which is the foundation for DT prediction using these logs.