Abstract

The naturally fractured reservoirs are one of the most challenging due to the tectonic movements that are caused to increase the permeability and conductivity of the fractures. The instability of the permeability and conductivity effects on the fluid flow path causes problems during the transfer of the fluids from the matrix to the fractures and fluid losses during production. In addition, these complications made it difficult for engineers to estimate fluid flow during production. The fracture properties’ study is important to model the fluid flow paths such as the fracture porosity, permeability, and the shape factor, which are considered essential in the stability of fluid flow. To examine this, this research introduced new models including decision tree (DT), random forest (RF), K-nearest regression (KNR), ridge regression (RR), and LASSO regression model,. The research studied the fracture properties in naturally fractured reservoirs like the fracture porosity (FP) and the shape factor (SF). The datasets used in this study were collected from previous studies “i.e., Texas oil and gas fields” to build an intelligence-based predictive model for fluid flow characteristics. The prediction process was conducted based on interporosity flow coefficient, storativity ratio, wellbore radius, matrix permeability, and fracture permeability as input data. This study revealed a positive finding for the adopted machine learning (ML) models and was superior in using statistical accuracy metrics. Overall, the research emphasized the implementation of computer-aided models for naturally fractured reservoir analysis, giving more details on the extensive execution techniques, such as injection or the creation of artificial cracks, to minimize hydrocarbon losses or leakage.

1. Introduction

1.1. Background

Naturally fractured reservoirs are the result of natural processes that present the diastrophism and volume shrinkage that lead to fractures that have dispersed as a consistently linked network across the reservoir [1]. The tectonic processes have evolved in reservoirs, fractured reservoirs are frequently found in weak reservoir rocks with poor porosity [2]. Due to that, the fracture is extended and large, where it is often referred to as the large fracture [3]. If the granular porosity is high but the rocks are fragile, the fracture is relatively small and limited in quantity, often referred to as microfractures [1]. The naturally fractured reservoirs are different from the conventional reservoirs [4, 5]. In addition, the tectonic movements affected by the behavior of the fracture during transfer and production of the fluids flow due to the high conductivity and permeability of the natural fractures [6]. The conductivity and permeability of the fractures factor minimize the fracture porosity of the fluids that cause low storage capacity [6]. In contrast, the conductivity of the matrix increases storage capacity with low permeability, which causes an increase in the matrix porosity [7]. According to previous studies, matrix porosity is higher than fracture porosity in the naturally fractured reservoirs, which the fluids store in the matrix [8].

Multiphase flow modeling in naturally fractured reservoirs has been considered an issue for petroleum reservoir engineers [9, 10]. Warren and Root have shown a dual-porosity model which presents multiphase behavior in fractured reservoirs [11]. In the dual-porosity model, there are two characteristic regions: matrix and fracture [12]. The naturally fractured reservoirs present one of the most difficult reservoirs in the petroleum industry due to the fracture properties like permeability and porosity, where the natural fracture permeability is higher than the matrix permeability in a dual-porosity system [12]. The hydrocarbons flow from the matrix to the fractures and from these to the wellbore where the fractures cannot store these fluids, which cause fluid flow losses. An obstacle of the fluid flow can be seen in Figure 1 [13].

The motivation is always focused on developing a reliable mathematical model for modeling naturally fractured reservoir properties such as fracture porosity (FP) and the shape factor (SF). In the literature, ML models are one of the technologies that contribute to the systematization of fluid flow by forming speculative models based on hypotheses and equations modeling the properties of petroleum fluids [14]. Hence, the focus of the current investigation is to test different versions of ML models for the prediction of FP and SF. The models are built based on the characteristics of oil and gas, such as matrix porosity, permeability, pressure, and temperature.

1.2. Literature Review

Modeling of the naturally fractured reservoirs is one of the challenges due to the generation of the complex fractures and the conductivity factor of fractures that affect fluids paths [15]. Although fractures covered 20% of world reserves, naturally fractured reservoirs represented the risk reservoirs in drilling, production, and modeling processes due to fracture pressure that sometimes-caused loss in hydrocarbons production [16]. Two studies were conducted to investigate the naturally fractured reservoirs functionally [11, 17]. Both studies relied on the modeling of the naturally fractured, double-connected system, which was divided into two areas, the matrix and the fracture. The authors concluded that the fracture is central to the permeability of hydrocarbons and that the fluid flows from the matrix to the fracture depending on the geometric parameters of the fracture such as permeability and porosity [18]. Li et al. [19] extended the study of fluid leakage in areas of tectonic stress. To examine this, the tectonic movements affected the fracture behavior, which causes losses in the fluids of low-permeability formations. This examined some of the factors affecting the leakage of fluids by making a model for analyzing the leakage of fluids in naturally fractured gas fields. Turn to Warren and Root [11], the authors have provided a suitable solution to this problem and were able to arrive at these parameters, the actual shape, dimensions, and fluid flow properties of the reservoir, where the same scenario had been adopted later in [20, 21].

Two quadruple porosity models (QPM) that include a triple-fracture network with a single matrix system were presented by Dreier et al. [22] for naturally fractured reservoirs (NFR). The pressure-transient features of QPM are analysed and evaluated using these models: Warren and Root theory with various forms of matrix-to-fracture flow regimes, wellbore storage, and skin are commonly employed in well-test analysis, and the parameters storativity ratio and interporosity flow coefficient are significant in describing such reservoirs. Perez Garcia [23] improved the earlier simulation-based investigations, relying mainly on the Warren and Root equations as well as the Gilman model. The author came to the conclusion that the well-test data was reliable but more investigations and development assumptions were required for fracture reservoirs.

In order to boost the productivity of naturally fractured reservoirs, acid fracturing procedures are used. The effectiveness of the therapy is influenced by a number of factors, including treatment circumstances and reservoir characteristics. It is possible to measure the effectiveness of acid fracturing stimulations using a variety of methods [24]. Only a few models, however, took into account the natural fractures (NFs) present in the hydrocarbon reservoirs [25], using an ML model. Hence, the goal of this work is to develop an effective model to calculate the efficacy of acid fracturing therapy in naturally fractured reservoirs. This study estimates the increase in hydrocarbon production brought on by the use of acid fracturing treatments and takes into account the interactions between naturally occurring and artificially generated cracks. The reservoir features and treatment parameters of more than 3000 scenarios were utilized to create and validate the artificial neural network (ANN) model [26]. The created model takes into account the formation permeability, injection rate, natural fracture spacing, and treatment volume as reservoir and treatment characteristics [27]. To evaluate the effectiveness of the model’s prediction, the percentage error and correlation coefficient were also calculated. The performance of acid fracturing treatments can be predicted quite accurately using the proposed model. The testing datasets yielded a correlation coefficient of 0.94 and a percentage error of 6.3%.

A novel ML model based on an improved learning process was established to offer a precise and timely forecast for increased productivity [28]. In order to assess the validity of the new equation, validation data were employed. A 6.8% average absolute error and a 0.93 correlation coefficient were obtained, demonstrating the excellent dependability of the suggested correlation. The originality of this work is in creating a solid and trustworthy model to forecast productivity gains from acid fracturing in naturally fractured reservoirs. By offering quick and accurate calculations, the novel correlation can be used to enhance the treatment design for naturally fractured reservoirs [29]. In order to estimate fracture permeability under complex physics, an integrated workflow based on the ML model was proposed (e.g., inertial effect) [30]. The methodology is being used for the first time to scale up rocks [31]. The suggested model provides a practical and accurate replacement for conventional upscaling techniques that can be quickly integrated into workflows for reservoir characterization and modeling.

Analysis of pressure-transient well tests is a crucial technique for figuring out reservoir properties [32]. Due to the analysts’ inexperience, the validity of the data from the well-test analysis could be questioned [33]. A one-dimensional convolutional neural network (1D CNN) was tested to create an autonomous model for well-tested data interpretation [34]. Both the associated parameters and the type of curve can be automatically identified by the model [35]. Without adjusting the model architecture or using hyperparameters, a combined automatic interpretation model with four conventional well-test models were conducted. According to the findings, the 1D CNN model outperformed the ANN model. Three field cases are used to further validate the automatic interpretation model. Based on the reported literature review, the current research was inspired to develop a new methodology based on a soft computing model that is associated with theoretical study in the conditions of simulation data as well as being specialized based on the assumptions [36]. Our paper focuses to solve the fluids losses problem by modeling the fluids flow in the fractured reservoirs through using machine learning technique. This paper depends on the two parameters which are interporosity flow coefficient and storativity ratio to determine the fluid flow behavior.

1.3. The Objective of This Study

The previous studies used empirical methodologies and modern models such as ML models for diverse oil and gas datasets. This leads to the continuous issue in fluid flow paths due to the changes of the fracture’s characteristic such as high conductivity and permeability of fractures. In conjunction with modern technology, the aim of this research is to make use of the applicability of ML models for predicting the shape factor and the fracture porosity. The models evaluated the fluid flow in the naturally fractured reservoirs. Predictive models were established using five different related input parameters. The assumption of the characteristics of the oil and gas datasets allows the ML models to build more than one model for the same datasets and compare them to get the best two models. The expected research outcome is to develop a reliable alternative technology for the petroleum industry where it can participate in sustainability and management.

2. Data Descriptions

This study specializes in analyzing the fluid flow behavior in naturally fractured reservoirs during production. The actual data consist of oil and gas data were obtained from Texas field. The actual data take account of the matrix permeability, fracture permeability, wellbore radius, interporosity flow coefficient, and the storativity ratio as inputs while the fracture porosity as outputs. To accomplish that, the study changed the input and output of Texas field data such that storativity ratio, interporosity flow coefficient, matrix permeability, fracture permeability, and wellbore radius are input data, while the shape factor and fracture porosity are output data. The conditions of input and output data were put as temperature 150 Fahrenheit, pressure 3,626 psi, and matrix porosity 22% for oil and 15% for gas, radius 1,000 ft, where these conditions are the same as Texas field’s conditions with some changes in the assumptions [23]. These data are implemented on the basis that the flow is radial, taking into account that there is no Darcy’s law and no skin factor [37].

3. Machine Learning Methods

This section presented the utilized ML models adopted for fluid flow data simulation. ML models are built to describe the effect of the shape factor and the fracture on the treatment of the hydrocarbon’s leakage. Five different types of ML models were developed, including decision tree regression, random forest regression, LASSO regression, K-N neighbors’ regression, and ridge regression models. Figure 2 presented the flowchart of the conducted methodological mechanism for this study.

3.1. Decision Trees Model

Decision tree regression (DT) was developed by [38] as a powerful ML model for both classification and regression tasks [39, 40]. In the DT algorithm, features (extracted from a specific dataset) are arranged in a symbolic tree-shaped manner, with terminal and internal nodes representing leaves and splits, respectively [41]. A tree is shaped by following a set of fundamental principles. Multiple trees are combined to form a set of rules that can be used in the prediction step. The technique first builds a tree from the training dataset, after which it splits the original data into two branches using a binary split procedure. The new growth branches are subjected to the separation process, and this is continued when a new branch becomes inseparable, and the accompanying node achieves the minimum size and evolves into a terminal node [42]. DTR’s principles are easy to understand and follow a logical pattern that can be described as a tree; this is a significant advantage of the DT over other models. However, despite being quicker than other AI models, DTR frequently is not the right choice for time-series problems [43] because it frequently does not generate accurate results when there are issues of nonlinearity or noisy datasets.

3.2. Random Forest Model

Random forest model was first developed by [44], and since its introduction, it has been used widely in many fields of science and engineering for prediction purposes [4547]. The RF model is strongly advised for scenarios with numerous input variables. Regarding a random vector, RF is an individual uniform distribution to each tree in the forest. Although RF is preferred when the trees, eta, and mtry are of an appropriate size, the maximal depth can be adjusted according to the complexity of the data [48]. The RF model in this study was built using library (random forest). The significant hyperparameters, such as ntree, eta, max depth, and mtry, were set to 140, 4, 6, and 2 accordingly in order to reduce overfitting issues. When regression trees are built using distinct bootstraps for each tree, the potential features in those trees can be modeled using such algorithms [49]. For the purposes of predicting the targeted parameters, every tree in the forest was treated equally. The first set was utilized for the growth of the trees and then for the assessment of each tree’s classification error [50]. The RF prediction’s output is represented thus,where y = the average prediction output from the overall number of trees and yi (x) = the trees’ discrete prediction for output vector x. Model overfitting was avoided by using the ten-fold cross-validation procedure thrice. The tree was initially built for each predictor and was then followed-up with its growth to ensure optimal weight and minimal computed error. The significance of the predictors, as well as the self-adjusted growth of the selected trees, must be ranked using the RF approach. The RF model performed well across all the response predictors based on the employed performance indicators.

3.3. K-Nearest Neighbor Regression Model

The KNR classification was developed as a relatively new technique for the parametric estimation of unknown probabilities [51]. The KNR was mainly built for classification patterns with an understanding of the K-nearest neighbor rule [52]. The KNR concept relies on the distance between the distributions to categorize each piece of data that contains the majority of nearest neighbors [53]. The prediction process of KNR depends on the use of classifiers and regression, wherein the regression aspect uses previously processed data to predict future data. Statistical methods, such as linear regression, are typically used to process the regression; however, the use of the linear regression method is limited only to some databases. This study implemented regression for the prediction of the fracture porosity and the shape factor based on gas and oil filled data observations. The problem with regression is predicting the result of a preprocessed parameter using a certain collection of independent variables. The result can be expressed as G = Gn if the KNR is performed using n nearest neighbors. Then, the average of the results is used to determine the outcome. The solution will then be given as

At that point, KNR prediction initiates the outcome of the neighbor, and prediction can be done by determining the Euclidean distance (ED) between the case point and the query based on the existing dataset.where z represents the query point and q represents the case point from the existing dataset.

3.4. Ridge Regression Model

In conditions whilst linearly impartial variables are closely correlated, RR model is a way of calculating the coefficients of a couple of regression fashions [54]. It has been applied in different engineering and science domains and approved for its capacity. Amongst different areas and patterns of the RR model, equations are presented as follows [55]:where is the transpose of X. By contrast, the ridge regression estimator to evaluate () the fracture porosity.where Ip is the p × p identity matrix, and K > 0 indicates a large number. The form along the diagonal of I is known as a ridge.

3.5. Lasso Model

Lasso is a commonly used sparse regression technique that relies on the sparse assumption for parameter regularization [56]. It is an innovative method for variable selection during regression tasks that operates by minimizing the residual sum of squares under the condition that the sum of absolute values of the coefficients is less than a constant [57]. It was first discussed in relation to least squares. The following is a summary of the basic Lasso framework. Assume a sample of N cases with p variables and a single outcome for each of the N instances. Consider that yi is the response variable while xi = (x1, x2, x3, …, xnp). T represents the covariate vector for the ith case, β = (β1, β2, …, βp)T. Hence, Lasso is aimed to solve the regression problem using nonlinearity functional properties.

4. Results and Discussion

In this section, the results of the adopted ML models were presented based on the radial flow in the fluid modeling of fracture reservoirs. Radial flow was concerned with the analysis of the flow of fluids along a radius. The volume for the wellbore radius is not large enough to cover the modeling of an entire well by ML models. The present results provide a slight improvement over the previous results of the data provided by Perez [23].

One of the most popular graphical presentation on the predictability evaluation is the scatter plot between the actual observations and the predictive models results, which was adopted here for assessment. All models were developed based on the predictability and interoperability of fluid flow losses. The modeling results for the DT model for the gas flow were reported in Figure 3 (Figure 3(a): fracture porosity, and Figure 3(b): shape factor). The DT model attained a determination coefficient (R20.80) for fracture porosity and R20.94 for the shape factor. This was formed in harmony with the established previous research of [23]. The modeling of Perez’s data is assumed to be actual if our results build on the predicted data. On the other hand, the results for the oil flow characteristic are given in (Figure 4(a): fracture porosity (R20.93) and Figure 4(b): shape factor (R2 = 0.94)).

In the same manner, the scatter plots for the other ML models are (i.e., RF “Figure 5: gas flow and Figure 6: oil flow,” KNR “Figure 7: gas flow and Figure 8: oil flow,” RR “Figure 9: gas flow and Figure 10: oil flow” and Lasso “Figure 11: gas flow and Figure 12: oil flow”). The superior prediction accuracies were observed for the gas flow fracture porosity using the Lasso model over the testing phase with (R20.97). However, gas flow shape factor prediction achieved using KNR (R20.96). It is true that all models relative accomplished their results over (R2 = 0.85) as an indicator for acceptable results. However, the motivation here is to target the more accurate model which is relatively near to the factual field observations with R21. For the oil flow, the fracture porosity and shape factor were predicted accurately using the Lasso model.

It is even better to explore more logical and scientifically acknowledged graphical presentation of the developed ML models [58]. A graphical based on root mean square error (RMSE), correlation, and standard deviation was generated in Figures 13(a) and 13(b) “gas flow presented in the form of fracture porosity and shape factor” and Figures 14(a) and 14(b) “oil flow presented in the form of fracture porosity and shape factor.” Based on Figure 13(a), the KNR model presented the closest coordinate to the actual observation of gas flow fracture porosity. However, the prediction of gas flow shape factor was visualized using DT and KNR “relatively close coordinates” (Figure 13(b)). In Figure 14(a), Lasso models indicated nearer location for the fracture porosity; on the other hand, both models Lasso and RF showed relatively closer prediction performance for the shape factor of the oil flow (Figure 14(b)).

The modeling results further assessed using some statistical metrics based on the perfect fit of goodness (i.e., determination coefficient (R2) and Kling–Gupta efficiency (KGE)) and absolute error indicators (i.e., root mean square error (RMSE) and mean absolute percentage error (MAPE)) [59]. Tables 1 and 2 reported the modeling results for the gas flow fracture porosity and shape factor, respectively. By examining all the performance metrics of the Table 1, it clearly appears that performing more than two or more statistical indicators can give more informative results of the adopted ML models. Although the Lasso model revealed the superior correlation value, the KNR model gave the minimal value for the MAPE metric in which by validating with KGE metric, it is clearly a valid model for prediction superiority for the fracture porosity of gas flow. On the other hand, Tables 3 and 4 presented the modeling results of the oil flow fracture porosity and shape factor, respectively. Over the testing phase, the KNR model reported the best modeling results for the fracture porosity and shape factor prediction based on the multiple metrics evaluation.

Based on the performance metrics in Tables 14, modeling superiority for the attained prediction results distributed between RF, KNR and Lasso models. Indeed, this is a factual thing for the ML models behavior through the learning process in which depending on the how much capacity can be attained during the learning mechanisms of the models. This can be elaborated also due to the distribution of the fracture points in one line which prove that the fracture conductivity factor has a big impact oil than gas [60]. Generally, the radial flow is distributed as parabolic that points of fluids spread systemically due to the shape factor in gas flow [61].

5. Conclusions

Reliably predicting distributed fluids flow in the naturally fractured reservoirs is achievable using ML models applied to a group of oil and gas datasets of Texas field calibrated with the fracture porosity and shape factor data. The fracture porosity and shape factor respond to the distinct characteristics of fractures such as conductivity and permeability factor in different ways. The results of this study lead to the following conclusions:(i)When the fracture characteristics variables were used collectively in the trained models, it was definitively determining the suitable type of flow for oil and gas (pseudosteady state flow or radial flow).(ii)Based on the five available input variables, the following two output variables are shown to be most effective when used in combination to predict the movement of the fluids in the naturally fractured reservoirs. These input variables are matrix permeability, fracture permeability, wellbore radius, interporosity flow coefficient, and storativity ratio, while the output variables are the fracture porosity and shape factor.(iii)ML models confirmed their potential in predicting the ail/gas flow fracture porosity and shape factor.(iv)Some limitations were observed that sometimes ML models cannot work well in the development of the behavior of the naturally fractured reservoir characteristics due to their instability in one pattern as same as conventional reservoirs and this causes it difficult to predict.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they do not have any conflicts of interest.