#### Abstract

Natural gas hydrate has been widely of concern due to its great potential in application to address problems including gas storage, transmission, separation techniques, and also as energy resource. Accurate prediction of hydrate formation phase equilibrium conditions is essential for the optimized design during natural gas production, processing, and transportation. In this study, a novel graphical alternating conditional expectation (ACE) algorithm was proposed to predict hydrate formation phase equilibrium conditions for sweet and sour natural gases. The accuracy and performance of the presented ACE model were evaluated using 1055 data points (688, 249, and 118 data points for sweet natural gas, CO_{2}-CH_{4}, and H_{2}S-CO_{2}-CH_{4} systems, respectively) collected from literature. Meanwhile, a comparative study was conducted between the ACE model and commonly used correlations, including thirteen models for sweet natural gases, three models for CO_{2}-CH_{4} binary system, and seven thermodynamic models for H_{2}S-CO_{2}-CH_{4} ternary system. The obtained results indicated that the proposed ACE model produces the best results in prediction of hydrate phase equilibrium temperature for sweet natural gases and pressure for CO_{2}-CH_{4} system with average absolute relative deviation (AARD) of 0.134% and 2.75%, respectively. The proposed quick and explicit ACE model also provides a better performance in prediction of hydrate phase equilibrium pressure for H_{2}S-CO_{2}-CH_{4} ternary systems with AARD=5.20% compared with seven thermodynamic methods considered in this work, except for CPA/Electrolyte/Chen–Guo combined model (AARD=4.45%).

#### 1. Introduction

Gas hydrates are regarded as the representatives of a class of compounds widely known as clathrates or inclusion compounds, which occur in appearance of a crystalline solid like water ice or dry ice. Generally, hydrates concomitantly consist of some guest molecules trapped in a cage structure built by host molecules. The most leading host molecule is water and the most common guest molecules include light hydrocarbons (CH_{4}), hydrogen sulfide (H_{2}S), carbon dioxide (CO_{2}), hydrogen (H_{2}), and nitrogen (N_{2}). Accurate calculation of natural gas hydrate formation phase equilibrium conditions not only is essential for an optimized design scheme of natural gas production, processing, and transportation (flow assurance), but also is of great significance for preventing [1, 2] subsea/land oil and gas transport pipeline blockage, promoting the development of gas transmission and separation techniques [3–5].

For the natural gas hydrate formation phase equilibrium conditions, the primary problem is how to rapidly and precisely predict the hydrate phase equilibrium pressure or temperature as gas hydrates form. Up to now, various approaches have been presented to determine hydrate phase equilibrium pressure or temperature. These approaches can be mainly divided into five types: laboratory tests [6, 7], graphical methods [8, 9], empirical correlations [10–18], thermodynamic models [19–26], and artificial intelligence algorithms [27–30]. It is well known that the experimental determination of the hydrate formation phase equilibrium conditions is expensive and time-consuming, which limit the application of this method. The empirical correlations and thermodynamic models were widely used for predicting the hydrate formation phase equilibrium conditions. The commonly applied empirical correlations to predict hydrate formation conditions are listed in the Appendix. These empirical correlations have two significant advantages: (1) simplicity and (2) high accuracy within the range of its application. However, as for thermodynamic models, in the early 1959, Van der Waals and Platteuw (vdW-P) [19] firstly provided a basic thermodynamic model to calculate hydrate phase equilibrium conditions, which could be seen as the beginning of intensive enthusiasm for following researches on hydrate formation thermodynamic models. Afterwards, the emerging thermodynamic models (e.g., Parrish-Prausnitz [20], Ng-Robinson [21], John et al. [22], and Eslamimanesh et al. [23]) for calculating hydrate phase equilibrium conditions were mainly derived from the various amendments to the classical vdW-P model. Currently, the most accepted method is based on the vdW-P model integrated with an equation of state to estimate hydrate phase equilibrium conditions [24–26, 31, 32].

Recently, with the increase of global energy demand and the decrease of conventional gas, more attentions have been paid to explore and exploit new sour natural gas fields. Natural gas containing H_{2}S and/or CO_{2} is defined as sour gas, and also H_{2}S or/and CO_{2} hydrates are more easily formed under lower pressure and higher temperature conditions in the presence of water. Therefore, it is more critical to calculate the hydrate phase equilibrium conditions of sour natural gas mixtures to avoid the subsea or land oil/gas pipeline hydrate plugging problems. Thus, several methods (K-factor [33], Baillie-Wichert [34], and Mann et al. [35]) and some software (CSMHYD, HYSIS, EQUI-PHASE Hydrate, and PROSIM) were applied to determine hydrate phase equilibrium conditions of sour natural gas. In early 1941, Wilcox et al. [33] proposed a semiempirical formula based on the K-factor to forecast the hydrate phase equilibrium temperature. PROSIM has poorer performance and larger errors compared with those from the other software-based thermodynamic models when it computes the hydrate phase equilibrium temperature for sour gas streams. Similarly, the Baillie-Wichert method should be also cautiously applied for gas mixtures with only CO_{2} but without H_{2}S. In addition, the CO_{2} mole concentration should be limited to no more than 5%. Note that when the temperature is below 55°F(12.8°C), all methods mentioned above fails to obtain error range of ±3°F(1.7°C) in the process of hydrate phase equilibrium temperature prediction. In 1996, a classical hydrate thermodynamic model was presented by Chen and Guo [36, 37] on the basis of the two-step hydrate formation mechanism, which supposed that water activity coefficient was set as a unity. However, CO_{2} and/or H_{2}S dissolution and reactions with water indicated that the water activity coefficient cannot be assumed to be unity. Afterward, according to Chen–Guo model, Sun and Chen [38] eliminated the assumption that the activity coefficient was unity and obtained the satisfied calculation results. In 2013, a novel thermodynamic model [39] was proposed to determine the pure and mixed sour gas hydrate formation pressures. The model took into account the impact of hydrogen bonding association in fluid phases with great potential in application for accurately predicting hydrate phase equilibrium pressures of sour gases.

Recently, ZareNezhad and Aminian utilized an adaptive neurofuzzy inference system (ANFIS) [40] to predict the sour gas hydrate equilibrium dissociation pressure. It was interpreted from predicted results that the ANFIS model outperforms the existing thermodynamic models when H_{2}S mole fraction is higher than 10%. Similarly, an improved artificial neural network (ANN) was also provided by Soroush et al. [41] to predict the hydrate formation temperature over a wide range of gas mixtures. Different from those empirical correlations and thermodynamic models, such neural networks can only result in a “black box” instead of an explicit formula. In 2016, a least square support vector machine (LSSVM) model [42] was provided to predict natural gas hydrate formation temperature. The results revealed the superiority of LSSVM model in the form of high prediction accuracy.

However, there is still few reports of an empirical correlation to predict hydrate phase equilibrium conditions especially for sour natural gases. The purpose of this study is to develop a novel correlation of hydrate phase equilibrium conditions for sweet and sour natural gases using a graphical alternating conditional expectation (ACE) algorithm. The ACE algorithm and data acquisition are introduced in Section 2. The model of gas hydrate formation condition is developed in Section 3. Results and discussions are indicated in Section 4. Finally, in Section 5, the relevant conclusions are presented.

#### 2. Methodology

##### 2.1. ACE Algorithm

The ACE algorithm is a multivariate nonparametric regression method originally proposed by Breiman and Friedman [43] in 1985 and later perfected by Xue et al. [44] in 1997. The most fundamental idea behind ACE is to estimate the optimal transformations for multiple regressions, which matches the correlation between a random dependent (output) variable and multiple independent (input) variables. In this section, a detailed explanation about ACE is given and the general expressing formula of the nonparametric ACE algorithm is given as follows:where* θ *and are defined as a function of the dependent output variable and the independent input variables , respectively. , , and represent the dimensions of variables, the regression coefficient, and an error term, respectively.

Through converting the original linear function problem of an -dimensional variable into the problem of independent one-dimensional functions, , the constructed ACE model achieves optimal transformations in logarithmic or actual space. Thus, the eventual aim of ACE model is to minimize the error variance () of the transformed dependent output variable on the sum of transformed independent input variables. The is defined as follows:Particularly with the constrain,

Then a series of single functions is employed to minimize the error variance () with respect to , ,…, and , which was obtained from the following equations:

With respect to equations mentioned above, the alternating conditional expectation algorithm developed by (4) and (5) involved two stage operation processes, including (i) estimation of conditional expectations and (ii) iterative minimization of error variance (). Though iterative process of minimizing the error variance (), the real-valued measurable zero-mean functions , , and are translated into optimal transformations , , and . In the transformed space, the relationship of the dependent output and independent input is expressed as follows:where denotes the error that is not obtained by the application of the ACE transformations and is defined to obey a normal distribution with zero mean. It should be noted that was calculated as a correlation coefficient between the transformed dependent output variable and the sum of transformed independent variables (with the constrains: , as follows:

In the process of ACE transformation, the minimum regression error, , is replaced by calculating maximum multiple correlation coefficient, as (8).

Assuming that the transformation function is strictly monotonic and reversible, the dependent output variable for any data point is finally given as follows:

In consequence, the following criteria may indicate most of the advantages of the newly proposed ACE algorithm over the conventional multivariate parametric regression methods and artificial intelligence algorithms (e.g., ANN and SVM):(1)No requirement for a prior assumption of a function form and the optimal transformations are merely relying on the data set.(2)Normalization and stabilization of error variance distribution.(3)Applicable for the dual cases of bivariate and multivariate regression.(4)Different from ANN and SVM, such transformations can result in an explicit formula instead of “black box”.(5)It can effectively avoid probability for convergence to the local optimum, and it leads to fewer adjustable parameters.(6)A graphical interface program has been efficiently developed to compute ACE simulation results (GRACE) [44].(7)Acceptable generalization performance.

##### 2.2. Data Acquisition

Adequate data on sweet and sour natural gases hydrate phase equilibrium conditions are crucial for developing an empirical correlation since the comprehensiveness, diversity, and validity of the collected database have a dominating influence on the accuracy and reliability of the correlation. Meanwhile, due to the toxicity of H_{2}S and the corrosion of H_{2}S/CO_{2} to the device, the experimental data on the hydrate phase equilibrium conditions of sour gases with H_{2}S/CO_{2} content are scarce in the open literature, which has been also identified as an important and challenging task. In this work, 1055 experimental data sets of hydrate formation conditions for sweet (688 datasets) and sour (367 datasets) natural gases were collected from published literature [7, 8, 18, 45–48]. The ranges of all the experimental gas hydrate formation conditions considered in this work are given in Table 1. Note that we do not employ the water content as a variable although the water content plays an important role in the hydrate phase equilibrium temperature and pressure. This is because there is no connection between the deviation in the predicted hydrate phase equilibrium conditions and the water content.

#### 3. Model Development of Gas Hydrate Formation Condition

The superiority of ACE algorithm to other conventional multiple regressions is that it can flexibly employ actual and logarithmic space of the variables (e.g., ln(CH_{4}), ln(*P*), ln(*T*)... etc.) to reveal the intrinsic regularity in input and output variables when the functional form between them is implicit. Moreover, it is necessary to determine the independent and dependent variables which should be considered as a high priority. So, the trial-and-error approach could be applied in comparing and verifying all possible combinations of the output and input variables in either logarithmic or actual space. To evaluate the performance of ACE model, its accuracy was compared with some well-known empirical correlations or thermodynamic models that are commonly adopted to determine the hydrate phase equilibrium conditions. The flowchart of the proposed ACE model is briefly described in Figure 1.

##### 3.1. Sweet Natural Gases

For sweet natural gases hydrate formation condition, temperature is set as dependent output variable. Among the 688 data points, 563 points were selected to establish the ACE model and the remaining 125 points were used to validate the model. Meanwhile, all possible combinations of the output and input variables in either logarithmic or actual space were tested. After a series of tests, the best relationship is acquired as follows:

Figure 2 shows the corresponding optimal transformations of , , and for sweet natural gases determined by ACE algorithm. The optimal transformation of* T* versus the sum of the optimal transformation of* P*, and is illustrated in Figure 3. Consequently, a series of simple polynomials were used to match every optimal transform of the dependent and independent variables. Considering the relationship between and the optimal transformation of as shown in Figure 2(d), the prediction model of sweet natural gases hydrate formation temperature is yielded from the inverse of the optimal transformation of sweet natural gases hydrate formation temperature:Herewhere the model coefficients are listed in Table 2.

**(a)**

**(b)**

**(c)**

**(d)**

##### 3.2. Sour Natural Gases

###### 3.2.1. CO_{2}-CH_{4} Binary System

CO_{2} hydrate has attracted increasing researchers’ attention due to its great application potentials in many fields including CO_{2} capture and sequestration from flue gases, steam reforming processes, hydrogen (H_{2}) storage, water desalination, etc. These promising applications are held back for the problems related to the CO_{2} hydrate formation condition. In this section, hydrate formation condition data for CH_{4}-CO_{2} binary system was sorted and collected from different literature [45, 46]. For CO_{2}-CH_{4} system, hydrate phase equilibrium pressure is set as an output variable. Similarly, working with CO_{2}-CH_{4} streams, 189 out of the 249 data points were chosen to construct the ACE model and the remaining 60 points were employed to validate the model. The best relationship is given as follows:

Figure 4 shows the corresponding optimal transformations of ,* y*(CO_{2}),* y*(CH_{4}) for CH_{4}-CO_{2} binary system. Figure 5 illustrates the optimal transformation of versus the sum of the optimal transformation of ,* y*(CO_{2}), and* y*(CH_{4}).

**(a)**

**(b)**

**(c)**

**(d)**

According to the same principles above, the prediction model of CO_{2}-CH_{4} system hydrate formation pressure can be obtained from the inversion of the optimal transformation of CO_{2}-CH_{4} hydrate formation pressure:where the model coefficients are listed in Table 3.

###### 3.2.2. H_{2}S-CO_{2}-CH_{4} Ternary System

In recent years, due to the decrease of conventional natural gas, sour natural gases have been becoming subsequent candidates for economical and clean energy supply. Kashagan field, Shah field, Limestone field [41], and Puguang gas field from China are typical sour gas reservoirs with high H_{2}S content. The precise knowledge of hydrate phase equilibrium conditions for sour natural gas with high H_{2}S content is an essential prerequisite for any sour gas processing and pipeline engineering activity. To the best of our knowledge, there are few reports of an empirical correlation for predicting sour natural gases hydrate equilibrium conditions. Thus, the ACE algorithm was proposed for predicting the sour gas hydrate phase equilibrium conditions. For H_{2}S-CO_{2}-CH_{4} ternary system, hydrate formation pressure is set as an output variable. In view of the data for H_{2}S-CO_{2}-CH_{4} ternary system, the 118 data points were used for training data sets and 59 points were used for validating the feasibility and effectiveness of the model. The best expression form is shown as follows:

Figure 6 shows the corresponding optimal transformations of Ln(CH_{4}),* y*(H_{2}S),* y*(CO_{2}),* T* for H2S-CH_{4}-CO_{2} ternary system. Figure 7 illustrates the optimal transformation of* P* versus the sum of the optimal transformation of* T*,* y*(H_{2}S),* y*(CO_{2}), and* y*(Ln(CH_{4})). It can be observed from Figure 6(d) that the prediction model of hydrate formation pressure for H_{2}S-CO_{2}-CH_{4} ternary system can be obtained using the inverse of the optimal transformation of H_{2}S-CO_{2}-CH_{4} system hydrate formation pressure by means of the relationship between* P* and the optimal transformation of :where the model coefficients are listed in Table 4.

**(a)**

**(b)**

**(c)**

**(d)**

**(e)**

#### 4. Results and Discussions

To evaluate the performance of the presented ACE model and other commonly used models considered in this work, statistical analyses such as average absolute relative deviation (AARD), root mean square error (RMSE), standard deviation (SD), and coefficient of determination (R^{2}) were performed between the experimental and predicted values as follows.

where* N* represents the number of experimental data points, and stand for the experimental and predicted value corresponding to experiment , respectively, and is the average of all the experimental values.

##### 4.1. Sweet Natural Gases

The scatter diagram (cross-plot) of a comparison between the experimental results and calculated temperature using ACE model is presented in Figure 8. It is clearly shown in Figure 8 that a tight cloud of data points about 45° line for testing data sets illustrates an excellent fitting between experimental data and calculated values by the proposed ACE model.

Moreover, the predicted hydrate formation temperature of sweet natural gases for testing data sets from the proposed ACE model was compared to 13 commonly used correlations listed in Figure 9.

It can be concluded from Figure 9 that the proposed ACE model produces the lowest global AARD (0.134%). In the descending order of exactness Ghayyem et al., Ghiasi, Amin et al. (2016), Amin et al. (2015, first), Amin et al. (2015, second), Bahadori and Vuthaluru, Zahedi et al., Towler and Mokhatab, Kobayashi et al., Motiee, Berge, Hammerschmidt, and Makogon correlation come in sequence order, respectively. And the corresponding mean AARDs are 0.164%, 0.371%, 0.399%, 0.492%, 0.504%, 0.757%, 0.848%, 1.114%, 1.456%, 1.645%, 1.735%, 2.076%, and 3.093%, respectively.

##### 4.2. Sour Natural Gases

###### 4.2.1. CO_{2}-CH_{4} System

The cross-plot (Figure 10) compares experimental pressure with calculated data using ACE model for CO_{2}-CH_{4} system. The excellent agreement between ACE model prediction and the experimental values validate our ACE model for the CO_{2}-CH_{4} system.

Moreover, a comparison between predictions proposed by Adisasmito et al. [18] and the ACE model is conducted for the same training and testing data sets. The results are listed in Table 5.

In Table 5, the predictions of the proposed ACE model, of which AARD for total data is 2.75% including AARD=3.43% for testing data, are slightly better than those of Adisasmito’s model, of which AARD is 2.95% including AARD=4.53% for testing data.

In addition, a comparison between the proposed ACE model and two commonly software-based thermodynamic models, including CSMGem [49] and HWHYD [50], was also made for 9 data points collected in the open literature [48]. The predicted phase equilibrium pressures and AARDs are listed in Table 6. As can be seen in Table 6, the AARD of the predicted phase equilibrium pressure by ACE, CSMGem, and HWHYD models are 3.50%, 5.06%, and 9.67%, respectively, indicating that the presented ACE model owns a better accuracy than the two commonly thermodynamic models considered in present work in prediction of hydrate dissociation conditions for CO_{2}-CH_{4} binary system.

###### 4.2.2. H_{2}S-CO_{2}-CH_{4} System

In Figure 11, the cross-plot compares experimental pressure with calculated data by ACE model for H_{2}S-CO_{2}-CH_{4} system. The overall AARD, RMSE, SD, and R^{2} are 7.20%, 453.29, 0.10, and 0.97, respectively. The predicted results show a good agreement with the experimental values. Careful examination of Figure 11 reveals that ACE model can perfectly reproduce the results at lower hydrate formation pressure. Meanwhile, at higher hydrate formation pressure, the discrepancy between the experimental values and predicted values by ACE model is somewhat larger for H_{2}S-CO_{2}-CH_{4} ternary systems.

In order to evaluate the accuracy of ACE model, the prediction results are also compared with values calculated using other thermodynamic models from ZareNezhad and Ziaee [41] for 59 experimental data on sour nature gas mixtures taken from the literature [47]. The comparison results were presented in Table 7.

It can be observed in Table 7 that the predictions by the proposed ACE model are in good consistency with the experimental data with mean AARD=5.20% and better than the seven thermodynamic methods, except for CPA/Electrolyte/Chen–Guo combined model (AARD=4.45%).

In addition, compared to the thermodynamic models, the ACE model has one enormous advantage: it provides quick and explicit calculations of the hydrate phase equilibrium pressure for H_{2}S-CO_{2}-CH_{4} ternary system over a wide range of phase equilibrium temperature and H_{2}S content, which avoids complex calculations by the thermodynamic models. Careful examination of Table 7 shows that the ACE model gives the best results with overall AARD=4.09% for H_{2}S content range from 9.93% to 26.62%, which covers the mostly of natural gas fields with high H_{2}S content. Note that the result of (17) is less accurate in prediction of CO_{2}-CH_{4} hydrate formation pressure by setting the H_{2}S content as zero, indicating that (17) cannot be reduced to (14). The reason may be that the (17) was proposed on the basis of experimental data containing H_{2}S component, while H_{2}S content has an important influence on the results of hydrate phase equilibrium conditions for H_{2}S-CO_{2}-CH_{4} ternary system.

#### 5. Conclusions

In this work, the ACE model was proposed to predict hydrate formation phase equilibrium conditions for both sweet and sour nature gases. Among the 1055 experimental data from literature, three data groups (688, 249, and 118 data points) were used to train, test, and evaluate the performances of the ACE model for sweet and sour natural gases. The presented ACE model gives the best matching result among 13 commonly used correlations with mean AARD of 0.134% for sweet natural gases. The ACE model produces better results than Adisasmito et al., CSMGem and HWHYD models with AARD less than 3.5% for CO_{2}-CH_{4} binary system. The proposed ACE model has better performance in prediction of hydrate formation pressure for H_{2}S-CO_{2}-CH_{4} ternary systems with mean AARD=5.20% with respect to seven thermodynamic methods considered in this work, except for CPA/Electrolyte/Chen–Guo combined model (AARD=4.45%). Moreover, the most advantage of the ACE model is that it provides quick and explicit calculations of the hydrate formation conditions.

#### Appendix

Here are the empirical correlations for hydrate formation condition representation.

Hammerschmidt (1934)

Makogon (1981)where , .

Berge (1986)where .where .

Kobayashi et al. (1987)Motiee (1991)Towler and Mokhatab (2005)Bahadori and Vuthaluru (2009)whereZahedi et al. (2009)Ghiasi (2012)Hosseini-Nasab (2012)where .

Riazi (2014)Ghayyem et al. (2014)Amin et al. (2015)whereAmin et al. (2016)For For Adisasmito et al. (1991)The coefficients of these correlations are indicated by A-H, a-d, and y. The detailed values of model coefficients refer to [10–18].

#### Data Availability

The data used to support the findings of this study are included within the supplementary information file.

#### Conflicts of Interest

The authors declare that they have no conflicts of interest.

#### Acknowledgments

This work was supported by National Science and Technology Major Project (2016ZX05048) and National Key Basic Research Development Plan (973 program) “Well completion and test optimization methods for deepwater oil and gas wells” (2015CB251205).

#### Supplementary Materials

Supplementary Materials folder is the supplementary information, unzip the folder, it contains the data files for all the figures (Figure 2-Figure 11) in the manuscript. The name of file in the folder corresponds to the figure title in the manuscript.* (Supplementary Materials)*