Abstract

The prediction of continued profile for static Poisson’s ratio is quite expensive and requires huge experimental works, and the discontinuity in the measurement and the limited applicability and accuracy of the present empirical correlations necessitated the utilization of artificial intelligence with its prosperous application in oil and gas industry. This work aims to construct different artificial intelligence models for predicting static Poisson’s ratio of complex lithology at real time during drilling. The functional networks (FN) and random forest (RF) approaches were utilized using the mechanical drilling parameters as inputs. This study uses a vertical well with 1775 records from complex lithology containing shale, sand, and carbonate for model building. Besides, a different dataset from another well was used to check the models’ validity. The results demonstrated that both FN- and RF-based models predicted static Poisson’s ratio with significant matching accuracy. The FN technique results’ correlation coefficient (R) value of 0.89 and average absolute percentage error (AAPE) values of 10.23% and 10.28% in training and testing processes. While the RF technique is outperformed, as illustrated by the highest R values of 0.99 and 0.94 and the lowest AAPE values of 1.89% and 5.19% for training and testing processes, the robustness and reliability of the developed models were confirmed in the validation process with R values of 0.94 and 0.86 and AAPE values of 11.23% and 5.12% for FN- and RF-based models, respectively. The constructed models developed a basis for inexpensive static Poisson’s ratio prediction in real time with significant accuracy.

1. Introduction

Poisson’s ratio (ν) is an elastic geomechanical characteristic defined as the ratio between transversal and axial strains. It measures the rock’s ability to recuperate from a deformation resulted by external strains and defines the relationship between these strains and the caused deformation [1]. Knowing Poisson’s ratio with the other rock elastic properties helps design hydraulic fracturing and interpret the in situ stresses, drilling performance, and wellbore stability [25], which help in optimizing the drilling operations, well planning, and development of completion and production strategies.

In practical, Poisson’s ratio varies from 0.0 to 0.5 and has two types: static (νst) and dynamic (νdyn). The static is determined experimentally from compressional tests, while the dynamic derived from the shear and compressional wave velocities from well logs. The high expenses and discontinuity in the νst measurement necessitated its prediction using empirical relationships. Different correlations were developed to predict the νst using νdyn or compressional and/or shear wave velocities (Vp and Vs) with different empirical coefficients, as summarized in Table 1.

The abovementioned empirical correlations have reliability limitations and depend on the availability of VP and VS. Since the artificial intelligence (AI) can be applied for inexpensively analysing and interpreting tremendous data and trends to obtain outputs in least time [9], it can be utilized to predict a continued profile of Poisson’s ratio.

The AI techniques became a hotspot in petroleum industry since the early nineties and applied in different aspects [10, 11]. The AI was applied in exploration to predict the feature recognition [12], travel time calculation [13, 14], and clarity and resolution of seismic data enhancement [15]. Several AI models were applied in drilling aspects such as operations monitoring, data correction, and prediction of drilling parameters and mud properties [1625]. Also, in reservoir and production engineering, the AI was applied in prediction of reservoir characteristics, inflow performance, production rate, and others [2638].

Different AI models were developed to estimate the νst, using the artificial neural network (ANN), support vector machine (SVM), alternating conditional expectation (ACE), fuzzy logic (FL), functional networks (FN), and adaptive neuro-fuzzy inference system (ANFIS), as listed in Table 2.

From a practical point of view, the drilling parameters, such as weight on bit (WOB), standpipe pressure (SPP), torque (T), rotation speed (RPM), pumping rate (Q), and rate of penetration (ROP) are linked to the formation features and measured at real time [4850], Consequently, these mechanical parameters can be correlated to νst as a geomechanical rock property.

This work aims to construct different models to predict real-time static Poisson’s ratio (νst) of complex lithology during drilling. The functional networks (FN) and random forest (RF) approaches were applied, and the drilling parameters including WOB, T, SPP, RPM, ROP, and Q were used as inputs. In the following sections, the datasets are described, analysed, and preprocessed. Then, the outputs of the developed models are presented and validated using different datasets.

2. Methodology

2.1. Data Description

Different datasets were obtained from an intermediate section with a 12.25” hole diameter of two vertical wells having penetrating formations with complex lithology of sand, shale, and carbonate. These datasets contained the real-time drilling parameters, including the WOB, T, SPP, RPM, ROP, and Q, which were measured at the surface while drilling and the corresponding νst from core experiments and conventional logs. The drilling parameters were used as inputs in the models, and the output is νst. The dataset from the first well containing more than 1775 records is used to construct the models, while the other well’s dataset with 762 records is only used to validate the developed models.

2.2. Data Analysis and Processing

The obtained data was studied and processed before using the AI approaches to examine the data quality, reliability, and representation. Statistics of the drilling parameters and νst are outlined in Table 3. An extensive range of mechanical drilling parameters and νst with good data distribution coverage were indicated from the statistics.

Moreover, the linear relationship strength between the drilling parameters and νst was addressed by defining the correlation coefficient (R) to measure the relative significance of the drilling parameters for νst, as illustrated in Figure 1. Although the correlation coefficient between st and SPP indicated almost no linear relation, the SPP varied significantly with changes of st. The same behaviour depicted the other parameters; nonlinear relation may exist. Accordingly, all drilling parameters will be considered as inputs.

Data filtration and cleaning were conducted to remove impractical values, noises, and outliers, which improve the data quality and confidence. The tools’ limitations, feasible ranges of drilling parameters, and statistical analyses were accounted in these preprocesses.

2.3. Models’ Development

The cleaned data was divided into 70/30% (1243/532 datapoints) to train/test the models on a random basis. The FN- and RF-supervised learning AI techniques were utilized using Python software with the Scikit-Learn® framework. Sensitivity analyses and tuning process were performed to determine the optimum parameters for each model. Then, the constructed models were evaluated using the data records from the second well.

The FN and RF techniques were nominated for using since they were recently applied in oil and gas industry [34, 44, 51] with encouraging outputs and simplicity.

Multiargument functional models are used in the FN approach to learn and process from the data. This technique relies on both data knowledge and domain. The model’s accuracy depends on selecting the best method and type in the FN algorithm. It included five methods (i.e., forward selection (FS), exhaustive search (ES), backward-forward (BF), forward-backward (FB), and backward elimination (BE)) that were tested with the two types of linear and nonlinear to build the optimal model.

On the contrary, the RF technique depends on bagging and bootstrapping procedures. Numerous decision trees are combined and decorrelated, and these processes are performed by randomly choosing parameters at each candidate split. The optimal model parameters of decision trees’ number (n_estimators), depth of each tree (max_depth), minimum number of samples desired to divide an internal node (min_samples_split), minimum sample leaf size required to be at the end node (min_sample_leaf), and maximum number of features (max_features) were defined using the tuning process.

The optimality of each model was examined for the highest R and lowest AAPE values. The equations for R and AAPE calculations are presented in Appendix A.

The flowchart for the methodology of developing the FN/RF is shown in Figure 2.

3. Results and Discussion

3.1. FN Model

The backward-forward (BF) method with the nonlinear type resulted the highest matching accuracy specified by the R value of 0.89 and AAPE values of 10.23% and 10.28% in training and testing processes. Figures 3 and 4 describe the matching in the cross plots and graphical representations of predicted and actual νst, respectively.

3.2. RF Model

The parameters’ tuning process indicated that the superior RF model was acquired using the optimized parameters mentioned in Table 4.

The developed RF model presented high matching accuracy with R values of 0.99 and 0.94 and AAPE values of 1.89% and 5.19% for processes of training and testing. The significant fitting was indicated by the cross plots and graphical representations of predicted and actual νst, as presented in Figures 5 and 6.

3.3. Models’ Validation

The reliability of the built models was evaluated using the second well’s dataset. This set included 762 records with drilling parameters and corresponding st.

The obtained results with the validation dataset indicated good matching accuracy with R values of 0.86 and 0.94 and AAPE values of 11.23% and 5.12% for FN- and RF-based models, respectively. These results assured the robustness of the developed models and the superiority of the RF model. The cross plots and graphical representations of the predicted and actual st are depicted in Figures 7 and 8.

4. Discussion

This paper treated the interest of using the AI approaches for st prediction, to reduce the costs of experiments and overcome the missed data issue for providing a continued profile of st. A real field data was used for this scientific and professional research.

An iterative approach was applied to find the optimal parameters of each model. From that, the highest fitting in the RF model was acquired with 100 decision trees, depth of each tree as 15, square root option for maximum number of features, min_samples_split as 2, and one minimum sample leaf size, while the backward-forward method with the nonlinear type was the optimum parameters for the FN approach.

The optimality of the two models was evaluated according to the R, AAPE, cross plots, and graphical representations. As a result, both models predicted the real-time st with outperformance in the RF-based model.

As it is impractical to predict a continued profile of st from experiments because of the high expenses, the constructed models are able to estimate st in real time with the least errors. However, the shortness of the developed models was represented by the inputs and outputs’ range, as only the intermediate hole section was investigated in these models, so wider lithologies should be examined. Also, the application of the hybrid model to improve the selection of models’ parameters instead of iterative methods is recommended for further works as well as the use of other AI techniques.

Comparing the two models indicated that the RF approach had an outperformance in predicting the real-time νst with better R and AAPE values, as presented in Figure 9. The fitting indices for the two models were outlined in Table 5.

5. Conclusions

Two AI models using FN and RF techniques were provided for real-time prediction of static Poisson’s ratio from the drilling parameters (WOB, RPM, SPP, T, ROP, and Q). A vertical well containing complex lithology with carbonate, shale, and sand was used to build the models utilizing the real field measurements. The main deliverables of this work are summarized as follows:(i)The FN model resulted acceptable fitting with the R value of 0.89 and AAPE values of 10.23% and 10.28% in training and testing processes(ii)The RF-based model outperformed in both training and testing processes with R values of 0.99 and 0.94 and AAPE values of 1.89% and 5.19%(iii)The models were validated with a different dataset from another well confirming the models’ applicability as it resulted in good prediction fitting with R values of 0.86 and 0.94 and AAPE values of 11.23% and 5.12% for FN and RF models, respectively(iv)Since the experimental measurements of static Poisson’s ratio is highly expensive and not always available, the constructed models developed a basis for predicting a continuous profile at real time in inexpensive methods with the least errors

Appendix

The R and AAPE values used to evaluate the optimality of each model are calculated using the following equations:where st_Actual and st _Predicted are the measured and the model-based predicted static Poisson’s ratio, respectively and N represents the number of datapoints.

Abbreviations

AAPE:Average absolute percentage error
ACE:Alternating conditional expectation
AI:Artificial intelligence
ANFIS:Adaptive neuro-fuzzy inference system
ANN:Artificial neural network
BE:Backward elimination
BF:Backward-forward
DE:Differential evolution algorithm
ES:Exhaustive search
FB:Forward backward
FL:Fuzzy logic
FN:Functional networks
FS:Forward selection
max_depth:Depth of decision tree
max_features:Maximum number of features
min_sample_leaf:Minimum sample leaf size required to be at the end node
min_samples_split:Minimum number of samples required to split an internal node
n_estimators:Number of decision trees
ROP:Rate of penetration, ft/hr
RPM:Rotation speed, rotation per minute
SPP:Standpipe pressure, psi
SVM:Support vector machines
T:Torque, klbf.ft
WOB:Weight on bit, klbm

Symbols

a, b, c, and d:Different empirical constants
N:Number of datapoints
Q:Pumping rate, gal/minute
R:Correlation coefficient
VP:Compressional-wave velocity, km/s
VS:Shear-wave velocity, km/s
VSh:Shale volume factor
dyn:Dynamic Poisson’s ratio
st:Static Poisson’s ratio.

Data Availability

All data used to support the findings of the study are available within the article and can be obtained from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

The authors would like to thank King Fahd University of Petroleum & Minerals (KFUPM) for employing its resources in performing this work.