Advances in Materials Science and Engineering

Advances in Materials Science and Engineering / 2020 / Article
Special Issue

Innovation, Sustainability and Resilience of Asphalt Pavement Materials

View this Special Issue

Research Article | Open Access

Volume 2020 |Article ID 2025072 | https://doi.org/10.1155/2020/2025072

Ruibo Ren, Wenmiao Fan, Pinhui Zhao, Hao Zhou, Weikun Meng, Ping Ji, "Crude Oil Source Identification of Asphalt via ATR-FTIR Approach Combined with Multivariate Statistical Analysis", Advances in Materials Science and Engineering, vol. 2020, Article ID 2025072, 13 pages, 2020. https://doi.org/10.1155/2020/2025072

Crude Oil Source Identification of Asphalt via ATR-FTIR Approach Combined with Multivariate Statistical Analysis

Guest Editor: Meng Guo
Received14 Feb 2020
Revised15 May 2020
Accepted03 Jun 2020
Published06 Jul 2020

Abstract

The types of crude oil for producing asphalt have a decisive influence on various performance measures (including aging resistance and durability) of asphalt. To discriminate and predict the crude oil source of different asphalt samples, a discrimination model was established using 12 greatly different infrared (IR) characteristic absorption peaks (CAPs) as predictive variables. The model was established based on diverse fingerprint recognition technologies (such as principal component analysis (PCA) and multivariate logistic regression analysis) by using attenuated total reflectance-Fourier transform infrared spectroscopy (ATR-FTIR). In this way, the crude oil source of different asphalt samples can be effectively discriminated. At first, by using PCA, the 12 CAPs in the IR spectra of asphalt samples were subjected to dimension reduction processing to control the variables of key factors. Moreover, the scores of various principal components in asphalt samples were calculated. Afterwards, the scores of principal components were analysed through modelling based on multivariate logistic regression analysis to discriminate and predict the crude oil source of different asphalt samples. The result showed that the logistic regression model shows a favourable goodness of fit, with the prediction accuracy reaching 93.9% for the crude oil source of asphalt samples. The method exhibits some outstanding advantages (including ease of operation and high accuracy), which is important when controlling the source and quality and improving the performance of asphalt.

1. Introduction

Asphalt pavements are widely used: as a black binding material produced from oil, asphalt is widely used as the binder in asphaltic mixtures [13]. Due to the differences in origins and production modes of crude oil for producing asphalt, the properties of crude oil exert important influences on the performance of asphalt mixtures, which also lead to significant differences in the performance of the various asphalt produced therewith [48].

The conventional performance of the same grade of asphalt is very similar; however, different asphalt exhibit large differences in various aspects, including high- and low-temperature performance, durability, and fatigue properties, which are considered as external expressions of chemical composition, molecular structure, and transformation of asphalt [911]. Furthermore, the study shows that the differences in the composition and structure of asphalt mainly depend on the source of crude oil and refining process of asphalt production. Due to the differences in the geological structure, oil generation conditions, and age, the nature and composition of crude oil in different regions are very different. However, crude oil with similar properties and composition in the same region has similar processing, storage, and transportation options. At the same time, most of the petroleum asphalt is produced by distillation currently, and the molecules in the asphalt retain their original state in the crude oil. Therefore, most of the composition and structure of asphalt are inherited from crude oil; that is to say, the structural performance of asphalt mainly depends on the source of crude oil. Because the asphalt is produced by different types of crude oil, the physical and chemical composition information about asphalt is unique. Just like the human fingerprint information, these components which can express the unique structure of asphalt can be called the “oil fingerprint” of asphalt. It is because of the uniqueness of “oil fingerprint” information of asphalt that it is feasible to discriminate the oil fingerprints of asphalt from different crude oil sources [1216].

At present, as the composition and structure of asphalt are extremely complex, the characterization of its structure requires more high-resolution and high-throughput analysis means and equipment, so there are few reports on the identification and analysis of asphalt oil fingerprints [17]. However, the identification and analysis of marine oil spill fingerprints has always been an issue of widespread concern. Similar to the method and purpose of identifying “oil fingerprints” of oil spills at sea, the purpose of recognising oil fingerprints of asphalt is to attain oil fingerprint information of asphalt through different methods such as physical, chemical, and biological methods [18]. Moreover, by applying multivariate statistical methods (including principal component analysis (PCA) and regression analysis), the chemical composition variables of oil fingerprints are summarised, classified, and discriminated [19, 20]. On this basis, qualitative and quantitative relationships between data are obtained to distinguish the crude oil source of asphalt, thus effectively controlling their qualities. Meanwhile, some testing methods used in the “oil fingerprint” identification of marine oil spills have been successfully used to analyse the composition and structure of asphalt [2123]. For example, a gas chromatograph-mass spectrometer (GC-MS) was used to explore the chemical compositions of smoke released by asphalt materials during heating [24, 25]. Gel permeation chromatography (GPC) and thin-layer chromatography (TLC) were used to measure the molecular weights and the composition distributions of asphalt [2628]. Nuclear magnetic resonance (NMR) and Fourier transform infrared spectroscopy (FTIR) were used to investigate the compositions, structures, and functional groups of asphalt [29, 30]. In all analytical techniques, compared with other methods (including GC-MS and NMR), which generally show some disadvantages (including high cost, damage to samples, and being laborious and time consuming during analysis), infrared (IR) spectroscopy is the most widely used technique in investigating asphalt materials. The reason is that IR spectroscopy shows many outstanding advantages, including being label-free, rapid, nondestructive, and low-cost, with simple sample preparation [3133]; however, in the above analysis, the chemical structures of asphalt are qualitatively analysed, mainly aiming at those of a certain or multiple specific asphalt samples while lacking quantitative research into the types of asphalt. The research into discrimination of the types of asphalt, tracing of the production area, and quality control of asphalt has not yet been reported.

Therefore, by utilising attenuated total reflectance-Fourier transform infrared spectroscopy (ATR-FTIR), the characteristic functional groups of asphalt from different crude oil source were discriminated and quantitatively analysed. Based on multivariate statistics, PCA and logistic regression analysis were conducted on IR spectral data to establish a discriminant function. An accurate, nondestructive, stable method of discriminating the crude oil source of asphalt samples was explored, which provides a scientific basis for realising reasonable selection, supervision quality, and guaranteed origins of asphalt.

2. Experimental Raw Materials and Methods

2.1. Experimental Materials

During the experiment, 33 asphalt samples were purchased from factories in China for producing asphalt. Before being applied, the asphalt samples were sealed in original oxygen-free containers at 5°C to prevent the samples from being oxidised. Additionally, all asphalt samples were unprocessed before use. As mentioned in Section 1, the differences in the “oil fingerprint” of asphalt are determined by the crude oil from which it is produced. Due to the same geological structure, oil generation conditions, and age in the same region, the composition and chemical structure of crude oil are also very similar. Therefore, the “oil fingerprints” of asphalt produced by crude oil from the same region are very similar, such as crude oil from the Middle East Gulf region, including Saudi Arabia, Iran, Kuwait, Iraq, and United Arab Emirates, crude oil from South America, including Marry, Poscan, Maya, and Castilla, and crude oil from the Bohai Rim region of China, such as Bohai Bay, Huanxiling, and Caofeidian. The crude oil of 33 asphalt samples came from the above three regions. According to the names of the three regions, the crude oil source of asphalt is divided into three categories: Middle East, South America, and the Bohai Rim region of China. The basic performance measures (penetration ratio (ASTM D5), ductility ratio (ASTM D113), and softening point (ASTM D36)) of asphalt and the crude oil source of asphalt are listed in Table 1. It is worth noting that the last digit of the asphalt number listed in Table 1 represents different sampling batches of the same asphalt.


Asphalt numberAsphalt name1Penetration (0.1 mm)Ductility (cm, 10°C)Softening point (°C)SourceCategory

1MM-179.665.747.3Middle East (Saudi Arabia)1
2Q-175.4>10046.5South America (Marry & Poscan)2
3SK (BY)-169.752.246.9Middle East (Saudi Arabia)1
4Q-261.539.048.5South America (Marry & Poscan)2
5SL-163.447.547.0Middle East (Saudi Arabia)1
6CMR-167.5>10047.2South America (Marry)2
7LH-176.6>10045.6South America (Marry & Poscan)2
8QPK-161.8>10046.8Middle East (Iran)1
9MM-261.116.049.6Middle East (Saudi Arabia)1
10QP-168.933.248.7Middle East (Kuwait)1
11JB-167.374.047.5South America (Marry & Poscan)2
12SK (XY)-171.547.947.2Middle east (Saudi Arabia)1
13ZH-167.413.548.1South America (Maya & Castilla)2
14JL-164.235.648.2Middle East (Kuwait)1
15HR-163.285.248.6South America (Marry & Poscan)2
16AS-162.184.347.4Middle East (Saudi Arabia)1
17XT-162.723.948.8Middle East (Saudi Arabia)1
18ZH-264.0>10050.1South America (Maya& Castilla)2
19LH-269.0>10047.0South America (Marry & Poscan)2
20QL-164.021.350.0Middle east (Kuwait)1
21KL-179.8>10047.2Bohai Rim region of China (Bohai Bay)3
22KL-261.057.849.5Bohai Rim region of China (Bohai Bay)3
23SL-265.030.049.2Middle East (Saudi Arabia)1
24SK (NB)-171.047.948.4Middle East (Saudi Arabia)1
25Q-375.0>10047.3South America (Marry & Poscan)2
26DSZ-169.0>10050.0Bohai Rim region of China (Huanxiling)3
27CMR-268.2>10047.9South America (Marry)2
28JB-264.674.046.3South America (Marry & Poscan)2
29ZH-363.5>10051.2South America (Maya & Castilla)2
30LH-368.7>10047.9South America (Marry & Poscan)2
31DSZ-278.4>10048.6Bohai Rim region of China (Huanxiling)3
32SZ-161.957.849.3Bohai Rim region of China (Bohai Bay)3
33SZ-268.0>10050.9Bohai Rim region of China (Bohai Bay)3

1In this column, 1, 2, and 3 indicate the different sampling batches of the same asphalt.
2.2. FTIR Analysis

Through ATR-FTIR (using a Cary 630 FTIR microscope), the IR spectra of asphalt samples were explored. Within the range of 400–4,000 cm−1, 64 scans were conducted, each at a resolution of 1 cm−1. The samples were placed on the horizontal ATR crystal made of zinc selenide, being subjected to multiple reflections. After each operation, the ATR crystal was cleaned using acetone.

The original spectrum data were first subjected to baseline correction by applying the OMNIC software to eliminate baseline effects. Afterwards, based on the standardised variation diagram of preprocessed spectrum data, the difference in masses of different samples was eliminated.

2.3. Multivariate Statistical Analysis

Through the combination of principal component analysis (PCA) and multiple logistic regression analysis, the infrared spectrum data are analysed to establish the discrimination model of the crude oil source of asphalt. Logistic regression analysis is a multivariate analysis method to analyse and predict attribute-dependent variables based on single or multiple continuous or attribute-independent variables. Furthermore, each variable is required to be independent of each other in variable screening and parameter estimation. In many studies, there is a certain degree of linear dependence between their variables, which is called multicollinearity. This multiple collinear relationship may increase the mean square error and standard error of the estimated parameters, which leads to the instability of the analysis results of the logistic regression model. The main reason for the problem of multicollinearity is the overlap of information. However, PCA can reduce the repeatability of information and achieve the purpose of eliminating multicollinearity by extracting independent principal components from explanatory variables.

For this reason, this study used a multinomial logistic regression model based on PCA to improve the discrimination accuracy of the model. First of all, the PCA was used to reduce the dimension of the CAPs variables of the infrared spectrum, so that the variables with strong correlation were integrated into the same principal components. The principal components were independent of each other; thus, the multiple collinear relationship between variables was eliminated. Then, by using these principal components as independent variables, the discriminant model of crude oil source of asphalt was obtained by logistic regression analysis.

2.3.1. PCA Analysis

PCA refers to a simplification of multidimensional data to several relevant variables (principal components) through a dimension reduction approach. Each principal component reflects most of the information of original variables, and the contained information is not repeated. PCA can compress countless information and simplify complex problems [34]. The modelling process of PCA is as follows:(1)Calculation of the correlation coefficient matrix:where refers to the correlation coefficient of original variables and , , which can be calculated by using the following formula:(2)Calculating eigenvalues and eigenvectors:The characteristic equation was solved. Generally, the eigenvalues were calculated by using the Jacobi method and, in descending order are . The eigenvectors corresponding to eigenvalue were separately calculated, satisfying , that is, where denotes the jth component of vector .(3)Calculating contribution and cumulative contribution of principal components:In general, the eigenvalues with the cumulative contribution not lower than 70% are taken. are the corresponding first, second, …, mth principal components.(4)Calculating the loads of principal components:(5)Scores of various principal components:

2.3.2. Logistic Regression Analysis

Logistic regression is a multivariate analysis method for investigating the relationship between binominal or multinomial observation results (dependent variable) and influencing factors (independent variable), belonging to probabilistic nonlinear regression methods. The logistic regression when the dependent variable only shows two or more states belongs to binomial logistic regression and multinomial logistic regression, respectively [35, 36]. For discriminating and classifying the crude oil of asphalt, multinomial logistic regression is applied to conduct data analysis, owing to the crude oil of asphalt being sourced from the Bohai Rim region of China, South America, and the Middle East.(1)Model fitting:For multinomial logistic regression, a certain level of dependent variables is defined as the reference level herein. Compared with the other levels, i-1 (i refers to the number of dependent variables) generalised logistic regression models were fitted. By taking three-level dependent variables as an example, it is supposed that the values of dependent variables are 1, 2, and 3: the probabilities corresponding to the values are , , and , respectively. Based on m-independent variables, two models are fitted as follows:(2)Meaning of regression parameters:For multinomial logistic regression, each independent variable contains parameters. The parameter represents an independent variable that changes one unit on the premise that other independent variables remain unchanged, and it reflects the variation of the log-odds ratio (OR) of class i. The OR is subjected to logarithmic transformation to obtain the linear mode of the logistic regression model.

3. Results and Discussion

3.1. Establishment of Discrimination Indices for Crude Oil Source of Asphalt

FTIR is an important means of identifying organic compounds. When irradiating organics using the IR light, the molecules absorb the IR light leading to vibrational energy level transition, and different chemical bonds or functional groups show diverse absorption frequencies. The contents of various materials are reflected in their IR absorption spectra, which can be quantitatively analysed according to peak location and absorption intensity. The structural composition of asphalt is complex, and asphalt shows significant differences in behaviour. For these reasons, it fails to effectively characterize the difference of behaviours of asphalt from different crude oil only by quantitatively comparing the peak areas of IR spectrograms. Therefore, by observing the shapes and locations of IR spectrograms, 12 significant characteristic absorption peaks (CAPs) were selected to analyse the transmittances of absorption peaks.

The IR absorption spectra of 33 asphalt samples are similar. By using the mean value method, the mutual mode of the IR spectrogram of all asphalt samples was constructed (Figure 1): the assignments of 12 characteristics peaks are as follows: the strong absorption peaks around 2850 cm−1 and 2920 cm−1 are triggered by the stretching vibration of , and a very weak absorption peak around 1700 cm−1 is induced by the stretching vibration of C=O. Moreover, the vibration of the benzene ring leads to the absorption peak in the vicinity of 1600 cm−1, and the absorption peaks at 1380 cm−1 and 1460 cm−1 are caused by the bending vibration of . The fingerprint region appears below 1300 cm−1, in which the absorption peaks at 1166 cm−1 and 1032 cm−1 are triggered by the stretching vibrations of C=S and S=O, respectively. The stretching vibration of CH results in a weak absorption peak around 969 cm−1, while the absorption peaks at 872 cm−1 and 812 cm−1 are induced by vibrations of an isolated hydrogen and two adjacent hydrogen atoms on the benzene ring, respectively. Additionally, the absorption peak at 723 cm−1 is also caused by the stretching vibration of .

3.2. Analysis of Predictive Variables Based on Descriptive Statistics

The IR spectra of all asphalt samples are similar, and it is difficult to distinguish the differences among asphalt samples by comparing spectrograms alone. Hence, 12 significantly different CAPs were selected from the spectrograms to describe the transmittances of absorption peaks based on descriptive statistics. From two aspects of centralised location (including indices such as average and median) and degree of dispersion (including indices such as extreme value), the samples are described so as to reflect spectrographic data (Table 2).


Statistical indicatorsOil sourceAverage valueMedian valueMaximum valueMinimum value

2920–1Middle East52.1051.6255.4649.83
South America55.8554.2061.0253.29
Bohai Rim region of China54.3953.2956.6453.23

2850–1Middle East59.8259.5163.6058.43
South America63.2662.3967.3860.74
Bohai Rim region of China62.9762.3664.8861.69

1700–1Middle East91.0791.1491.9689.28
South America91.4691.5092.2990.83
Bohai Rim region of China92.9292.4194.0692.29

1600–1Middle East87.3187.3788.2585.10
South America87.6787.4988.7487.20
Bohai Rim region of China90.6690.2691.6390.08

1460–1Middle East68.7168.6470.1067.64
South America69.8169.8171.6868.27
Bohai Rim region of China72.0372.1872.2471.67

1380–1Middle East75.5175.4876.9574.08
South America76.1176.1277.2674.77
Bohai Rim region of China78.5178.5078.9678.07

1166–1Middle East85.2685.4186.3383.43
South America85.8885.9186.8484.65
Bohai Rim region of China89.4889.1090.2589.08

1032–1Middle East84.3784.4986.3582.29
South America85.7886.3086.8082.57
Bohai Rim region of China90.1089.8190.6889.81

969–1Middle East86.5386.7287.7384.97
South America87.2887.2288.1085.98
Bohai Rim region of China90.3390.1590.6990.14

872–1Middle East83.7483.9384.6481.00
South America84.8684.6786.0983.79
Bohai Rim region of China88.9788.5389.8688.50

812–1Middle East80.7381.1281.8577.23
South America82.7082.7983.8880.40
Bohai Rim region of China88.6388.2489.4388.21

723–1Middle East78.1878.1480.7476.17
South America81.3281.3882.7078.25
Bohai Rim region of China85.6184.9087.1584.78

In Table 2, according to the analysis result of descriptive statistics on the transmittances of 12 CAPs, it can be seen that the asphalt produced by crude oil from the Bohai Rim region of China showed a larger transmittance. By contrast, the transmittances of asphalt produced by crude oil from the Middle East and South America were consistently low. However, it is impossible to distinguish the oil source of asphalt based on the descriptive statistics of infrared spectral transmittance of asphalt. Therefore, it is necessary to introduce multivariate statistical analysis methods, such as multinomial logistic regression analysis based on PCA described in Section 2.3.

3.3. Correlation Analysis of Predictive Variables

Correlation analysis aims to explore the correlation among multiple variables, which is also an important parameter for evaluating the fingerprint variables of asphalt [37]. In order to further evaluate whether the selected 12 variables were of sufficient significance to the prediction model, a correlation analysis of the 12 CAP variables was required. Generally, correlation analysis is conducted by applying Pearson and Spearman correlation coefficients. The Pearson correlation coefficient is generally applicable to data satisfying a normal distribution, and the Spearman correlation coefficient is employed for data that do not satisfy a normal distribution. Therefore, before the correlation test, it is necessary to test the normal distribution of 12 variables to determine the appropriate correlation test method.

By using the skewness-kurtosis test method, whether the transmittances of the 12 CAPs of 33 asphalt samples conform to a normal distribution was assessed, and through the K-S test as an auxiliary analysis method, the accuracy of the test results was ensured [38, 39]. The 12 variables were processed by importing them into SPSS19 (Tables 3 and 4).


Statistical indicators (cm−1)SkewnessKurtosis

29201.1231.279
28500.7970.130
17000.7131.241
16001.1201.379
14600.509−1.149
13800.509−0.728
11661.1290.600
10320.737−0.213
9690.9260.074
8721.0520.735
8120.9750.245
7230.6510.459


Statistical indicators2920 cm−12850 cm−11700 cm−11600 cm−11460 cm−11380 cm−11166 cm−11032 cm−1969 cm−1872 cm−1812 cm−1723 cm−1

Number of samples333333333333333333333333
Kolmogorov–Smirnov1.140.770.681.330.990.751.300.981.081.351.140.86
Asymptotically significant (bilateral)0.150.590.730.060.270.630.070.280.190.050.140.43

It can be seen from Table 3 that the values of skewness and kurtosis of transmittances of the 12 CAPs of all asphalt samples produced by three origins of oil fluctuate within a certain small positive and negative range around zero. It can be further seen from Table 4 that the asymptotic significances of the 12 variables all exceed 0.05. Moreover, based on the result of the skewness-kurtosis test, it can be considered that the 12 variables of 33 asphalt samples all conform to a normal distribution, which provides a basis for determining the method for testing correlation among variables. Therefore, the Pearson correlation coefficient is used to analyse the correlation between variables (Table 5).


Correlation coefficient (cm–1)2920 cm−12850 cm−11700 cm−11600 cm−11460 cm−11380 cm−11166 cm−11032 cm−1969 cm−1872 cm−1812 cm−1723 cm−1

29201.00
28500.941.00
17000.150.351.00
16000.220.410.911.00
14600.490.610.710.841.00
13800.300.400.720.850.961.00
11660.130.320.890.960.840.881.00
10320.070.280.850.880.750.790.971.00
9690.120.300.860.920.830.880.990.981.00
8720.250.460.890.970.860.870.980.940.961.00
8120.230.430.840.930.860.870.980.960.980.991.00
7230.420.630.790.860.830.770.900.910.900.940.951.00

As shown in Table 5, the IR CAP at 2850 cm−1 showed a significant correlation with those at 2920, 1460, and 723 cm−1, respectively. Additionally, there are significant correlations between each IR CAP at 1700, 1600, 1460, 1380, 1166, 1032, 969, 872, 812, and 723 cm−1. Moreover, multiple CAPs exhibited a high correlation. The aforementioned CAPs with high correlation covered all 12 CAPs. This showed that the 12 selected CAPs contained most of the fingerprint information about the asphalt, thus providing a basis for selecting variables capable of discriminating the different crude oil sources of asphalt.

3.4. Establishment of Logistic Regression and Discriminant Model Based on PCA
3.4.1. PCA on All Variables

According to the correlation analysis of variables, it can be seen that the information contained in the 12 CAPs shows a certain repeatability. PCA not only can remove repeated information but can retain key information, thus realising dimension reduction. Furthermore, it makes the modelling for logistic regression and discrimination more reliable due to reducing the disturbance caused by accidental factors.

The transmittances of the 12 CAPs of 33 asphalt samples are input into the SPSS19 software for PCA. The results are displayed in Table 6 and Figure 2. As shown in Table 6, there are three principal components whose eigenvalues exceed one. The first, second, and third principal components explain 77.658%, 15.498%, and 3.508% of the nature of the original variables, respectively. The cumulative variance contribution of the three principal components is 96.664% (research shows that there is a high explanation rate when the cumulative contribution is higher than 70%). It can be seen from the scree plot (Figure 2) that the broken lines of the first three principal components are steep while later tending to become shallower. This further indicates that it is appropriate to extract the three principal components (PCA1, PCA2, and PCA3). According to the correlation coefficients between the principal component and the original variables, the principal components Y1, Y2, and Y3 are separately expressed as follows:where represent the transmittances of CAPs at 2920, 2850, 1700, 1600, 1460, 1380, 1166, 1032, 969, 872, 812, and 723 cm−1, respectively.


Number of principal componentsEigenvalueVariance contribution (%)Cumulative variance contribution (%)

19.31977.65877.658
21.86015.49893.156
31.0173.50896.664
40.2552.12398.787
50.0810.67799.464
60.0310.25699.720
70.0190.15999.879
80.0060.05199.931
90.0040.03399.964
100.0030.02199.985
110.0020.01399.998
120.0000.002100.000

3.4.2. The Process and Result of Multinomial Logistic Analysis

By substituting the transmittances of the 12 CAPs of 33 asphalt samples into formulae (911), the scores of the three principal components can be calculated (Table 7). Moreover, the scores of the principal components are taken as factors, and three kinds of origins of asphalt are considered as dependent variables. Among them, the crude oil from the Bohai Rim region of China is regarded as a reference group to establish a multinomial logistic regression model based on principal components. On this basis, the parameters of the three principal components used for the logistic regression model are obtained.


Asphalt numberAsphalt name2PCA1PCA2PCA3

1MM-1−0.446−0.840−0.024
2Q-1−0.5370.8761.484
3SK (BY)-1−0.870−0.225−0.392
4Q-2−0.3830.118−1.233
5SL-1−0.9860.240−0.471
6CMR-1−1.7960.963−0.386
7LH-1−0.865−1.1510.438
8QPK-1−0.123−0.995−0.555
9MM-2−1.042−0.455−0.089
10QP-1−0.593−1.212−0.046
11JB-1−0.742−1.037−0.711
12SK (XY)-1−1.152−0.3350.338
13ZH-1−0.470−0.919−0.641
14JL-1−0.113−0.717−1.306
15HR-1−0.523−0.7570.423
16AS-1−0.316−0.0981.739
17XT-1−0.2680.1562.187
18ZH-20.4182.666−0.827
19LH-2−0.4342.149−1.062
20QL-1−0.4510.3360.688
21KL-10.175−0.225−0.545
22KL-2−0.1530.2210.312
23SL-20.6271.0810.147
24SK (NB)-1−0.2680.1562.187
25Q-30.4182.666−0.827
26DSZ-10.175−0.225−0.545
27CMR-2−0.1530.2210.312
28JB-21.536−0.789−0.679
29ZH-31.626−0.611−1.122
30LH-32.2730.0721.503
31DSZ-21.536−0.789−0.679
32SZ-11.626−0.611−1.122
33SZ-22.2730.0721.503

2In this column, 1, 2, and 3 indicate the different sampling batches of the same asphalt.

Based on the parameter from regression, the logistic regression model can be obtained as follows:where refer to the probabilities of crude oil sources (the Middle East, South America, and Bohai Rim region of China) of asphalt and Y1, Y2, and Y3 denote the first, second, and third principal components, respectively.

By substituting expressions (9), (10), and (11) into expression (12), the expression (formula (13)) for characterising the relationship between the logistic regression model and the 12 variables can be acquired. During discrimination and prediction, the probabilities of crude oil sources of asphalt can be separately acquired by substituting the transmittances of the 12 CAPs of the asphalt. The maximum probability corresponds to the predicted origin:

Additionally, to validate whether the model shows adequate practical meaning, it is necessary to test the goodness of fit, pseudo R-squared, and likelihood ratio of the model. The tests (including the Pearson chi-square test and the deviance chi-square test) of goodness of fit can test whether the model fits the original data, or not. If the significance level exceeds 0.05, the fitting effect is favourable. The pseudo R-squared value can verify the degree of explanation offered by the model for information contained in its original variables, which is shown in Cox, Nagelkerke and McFadden pseudo R-squared values. The closer the result is to 1, the better the explanation. The likelihood ratio test measures the contribution of original variables to the model. If the significance level is lower than 0.05, the contribution of original variables is high.

According to the test result (Table 8) obtained through use of the logistic regression model, the goodness of fit, pseudo R-squared, and likelihood ratio of the model all satisfy test requirements. This indicates that the extracted principal components PCA1, PCA2, and PCA3 also retain key information about the data while effectively realising dimension reduction, which makes a significant contribution to the construction of the logistic regression model. The final result obtained through model regression is also meaningful.


Test of goodness of fitCoefficientChi-squareSignificance level
Pearson5.5021
Deviance6.0781

Test of pseudo R-squaredCox and SnellNagelkerkeMcFadden
0.8490.9710.971

Test of likelihood ratioParameterChi-squareSignificance level
B13.5210.001
F147.3350.000
F28.5540.014
F33.3890.042

3.4.3. Validation of Discriminatory Effect of the Model

By taking IR CAPs of 33 original asphalt samples as verification samples, the discrimination effect obtained through the multinomial logistic regression model in multivariate statistical analysis was evaluated by applying formula (13). The discrimination result of multinomial logistic regression in multivariate statistical analysis is shown in Table 9.


Observed valuePredicted value
Middle EastSouth AmericaBohai Rim region of ChinaDiscrimination accuracy (%)

Middle East141093.3
South America111091.7
Bohai Rim region of China006100.0
Percentage (%)45.536.418.293.9

As shown in Table 9, discrimination accuracies of 15, 12, and six asphalt samples separately produced by crude oil sourced from the Middle East, South America, and Bohai Rim region of China are 93.3%, 91.7%, and 100%, respectively. The comprehensive discrimination accuracy is 93.9%. The above result showed that multivariate logistic regression analysis based on PCA can rapidly discriminate the origins of asphalt.

4. Conclusions

Based on ATR-FTIR technology, the infrared spectra of 33 kinds of asphalt produced by crude oil from the Middle East, South America, and Bohai Rim region of China were collected. Furthermore, the 12 selected CAPs of infrared spectra were analysed by multivariate statistics. The comprehensive accuracy of the logistic regression model based on PCA in discriminating asphalt, which were produced by crude oil from three different regions reached 93.9%. The results indicated that the combination of ATR-IR spectral analysis and multivariate statistics can accurately and nondestructively discriminate between different crude oil source of asphalt. Moreover, the method shows some remarkable advantages, including ease of operation, rapidity, and high accuracy, which is important when controlling the origins and quality of asphalt and improving the performance thereof.

The method provided in this paper is suitable for the oil source identification of base asphalt produced by crude oil from different regions and can also provide reference for other kinds of asphalt, such as polymer-modified asphalt. However, the accuracy and applicability of this method need to be further improved. In particular, whether the asphalt produced by crude oil mixing from different regions can be effectively identified needs further research.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was financially supported by the National Natural Science Foundation of China (no. 51608511), Key R&D Project of Shandong Province (2018GGX105013), and Project of Science and Technology Support for Youth Entrepreneurship in Colleges and Universities of Shandong Province (2019KJG004). The authors would like to acknowledge many coworkers, students, and laboratory assistants for providing technical help on instrument analysis.

References

  1. S. Zhao and J. Liu, “Using recycled asphalt pavement in construction of transportation infrastructure: Alaska experience,” Journal of Cleaner Production, vol. 177, pp. 155–168, 2018. View at: Publisher Site | Google Scholar
  2. Y. Tan and M. Guo, “Using surface free energy method to study the cohesion and adhesion of asphalt mastic,” Construction and Building Materials, vol. 47, pp. 254–260, 2013. View at: Publisher Site | Google Scholar
  3. P. Li, J. Liu, and S. Zhao, “Implementation of stress-dependent resilient modulus of asphalt-treated base for flexible pavement design,” International Journal of Pavement Engineering, vol. 19, no. 5, pp. 439–446, 2018. View at: Publisher Site | Google Scholar
  4. F. C. G. Martinho and J. P. S. Farinha, “An overview of the use of nanoclay modified bitumen in asphalt mixtures for enhanced flexible pavement performances,” Road Materials and Pavement Design, vol. 20, no. 3, pp. 671–701, 2019. View at: Publisher Site | Google Scholar
  5. F. Xiao, D. Ma, J. Wang, D. Cai, L. Lou, and J. Yuan, “Impacts of high modulus agent and anti-rutting agent on performances of airfield asphalt pavement,” Construction and Building Materials, vol. 204, pp. 1–9, 2019. View at: Publisher Site | Google Scholar
  6. Y. Sun, W. Wang, and J. Chen, “Investigating impacts of warm-mix asphalt technologies and high reclaimed asphalt pavement binder content on rutting and fatigue performance of asphalt binder through MSCR and LAS tests,” Journal of Cleaner Production, vol. 219, pp. 879–893, 2019. View at: Publisher Site | Google Scholar
  7. M. Guo, Y. Tan, D. Luo et al., “Effect of recycling agents on rheological and micromechanical properties of SBS-modified asphalt binders,” Advances in Materials Science and Engineering, vol. 2018, Article ID 5482368, 12 pages, 2018. View at: Publisher Site | Google Scholar
  8. W. Y. Fan, X. Xin, M. Liang et al., “Study on rheology and storage stability of modified asphalt,” Journal of China University of Petroleum, vol. 39, pp. 165–170, 2015. View at: Google Scholar
  9. M. Lagos-Varas, D. Movilla-Quesada, J. P. Arenas et al., “Study of the mechanical behavior of asphalt mixtures using fractional rheology to model their viscoelasticity,” Construction and Building Materials, vol. 200, pp. 124–134, 2019. View at: Publisher Site | Google Scholar
  10. P. Wang, Z. J. Dong, Y. Q. Tan et al., “Effect of multi-walled carbon nanotubes on the performance of styrene-butadiene-styrene copolymer modified asphalt,” Materials and Structures, vol. 50, no. 1, p. 17, 2017. View at: Publisher Site | Google Scholar
  11. M. Guo, Y. Tan, L. Wang, and Y. Hou, “Diffusion of asphaltene, resin, aromatic and saturate components of asphalt on mineral aggregates surface: molecular dynamics simulation,” Road Materials and Pavement Design, vol. 18, no. 3, pp. 149–158, 2017. View at: Publisher Site | Google Scholar
  12. Y. Xue, Z. Ge, F. Li, S. Su, and B. Li, “Modified asphalt properties by blending petroleum asphalt and coal tar pitch,” Fuel, vol. 207, pp. 64–70, 2017. View at: Publisher Site | Google Scholar
  13. V. Y. Pivsaev, P. E. Krasnikov, A. A. Pimenov, and D. E. Bykov, “Enhancement of adhesive properties of road asphalts, waste oil processing products,” Petroleum Chemistry, vol. 55, no. 1, pp. 80–83, 2015. View at: Publisher Site | Google Scholar
  14. M. N. Siddiqui, “NMR fingerprinting of chemical changes in asphalt fractions on oxidation,” Petroleum Science and Technology, vol. 28, no. 4, pp. 401–411, 2010. View at: Publisher Site | Google Scholar
  15. M. N. Siddiqui, “NMR finger printing of chemical changes in asphalt fractions on oxidation,” Petroleum Science and Technology, vol. 27, no. 17, pp. 2033–2045, 2009. View at: Publisher Site | Google Scholar
  16. R. Ren, K. Han, P. Zhao et al., “Identification of asphalt fingerprints based on ATR-FTIR spectroscopy and principal component-linear discriminant analysis,” Construction and Building Materials, vol. 198, pp. 662–668, 2019. View at: Publisher Site | Google Scholar
  17. C. M. Xu, Y. Liu, S. Q. Zhao et al., “Analysis of heteroatomic compounds in petroleum asphaltenes by high resolution mass spectrometry,” Journal of China University of Petroleum, vol. 37, pp. 190–195, 2013. View at: Google Scholar
  18. Q. F. Wu and Z. C. Tang, “Study on the determinants of economic compensation for ecological damage caused by oil spill at sea,” Journal of China University of Petroleum, vol. 34, pp. 1–8, 2018. View at: Google Scholar
  19. Y. Zhong, H. Wang, G. Lu, Z. Zhang, Q. Jiao, and Y. Liu, “Detecting functional connectivity in fMRI using PCA and regression analysis,” Brain Topography, vol. 22, no. 2, pp. 134–144, 2009. View at: Publisher Site | Google Scholar
  20. M. Kharbach, R. Kamal, M. A. Mansouri et al., “Selected-ion flow-tube mass-spectrometry (SIFT-MS) fingerprinting versus chemical profiling for geographic traceability of Moroccan argan oils,” Food Chemistry, vol. 263, pp. 8–17, 2018. View at: Publisher Site | Google Scholar
  21. P. Y. Zhou, C. S. Chen, J. J. Ye et al., “Combining molecular fingerprints with multidimensional scaling analyses to identify the source of spilled oil from highly similar suspected oils,” Marine Pollution Bulletin, vol. 93, no. 1-2, pp. 121–129, 2015. View at: Publisher Site | Google Scholar
  22. A. Ismail, M. E. Toriman, H. Juahir et al., “Chemometric techniques in oil classification from oil spill fingerprinting,” Marine Pollution Bulletin, vol. 111, no. 1-2, pp. 339–346, 2016. View at: Publisher Site | Google Scholar
  23. J. M. Bayona, C. Domínguez, and J. Albaigés, “Analytical developments for oil spill fingerprinting,” Trends in Environmental Analytical Chemistry, vol. 5, pp. 26–34, 2015. View at: Publisher Site | Google Scholar
  24. E. C. Y. Chan, P. K. Koh, M. Mal et al., “Metabolic profiling of human colorectal cancer using high-resolution magic angle spinning nuclear magnetic resonance (HR-MAS NMR) spectroscopy and gas chromatography mass spectrometry (GC/MS),” Journal of Proteome Research, vol. 8, no. 1, pp. 352–361, 2009. View at: Publisher Site | Google Scholar
  25. Y. Li, S. Wu, and S. Amirkhanian, “Investigation of the graphene oxide and asphalt interaction and its effect on asphalt pavement performance,” Construction and Building Materials, vol. 165, pp. 572–584, 2018. View at: Publisher Site | Google Scholar
  26. G. Xu and H. Wang, “Molecular dynamics study of oxidative aging effect on asphalt binder properties,” Fuel, vol. 188, pp. 1–10, 2017. View at: Publisher Site | Google Scholar
  27. N. Nciri and N. Cho, “A thorough study on the molecular weight distribution in natural asphalts by gel permeation chromatography (GPC): the case of Trinidad lake asphalt and asphalt ridge bitumen,” Materials Today: Proceedings, vol. 5, no. 11, pp. 23656–23663, 2018. View at: Publisher Site | Google Scholar
  28. D. O. Larsen, J. L. Alessandrini, A. Bosch, and M. S. Cortizo, “Micro-structural and rheological characteristics of SBS-asphalt blends during their manufacturing,” Construction and Building Materials, vol. 23, no. 8, pp. 2769–2774, 2009. View at: Publisher Site | Google Scholar
  29. H. Yao, Q. Dai, and Z. You, “Fourier transform infrared spectroscopy characterization of aging-related properties of original and nano-modified asphalt binders,” Construction and Building Materials, vol. 101, pp. 1078–1087, 2015. View at: Publisher Site | Google Scholar
  30. F. Zhang, J. Yu, and J. Han, “Effects of thermal oxidative ageing on dynamic viscosity, TG/DTG, DTA and FTIR of SBS-and SBS/sulfur-modified asphalts,” Construction and Building Materials, vol. 25, no. 1, pp. 129–137, 2011. View at: Publisher Site | Google Scholar
  31. A. Dazzi, C. B. Prater, Q. Hu, D. B. Chase, J. F. Rabolt, and C. Marcott, “AFM-IR: combining atomic force microscopy and infrared spectroscopy for nanoscale chemical characterization,” Applied Spectroscopy, vol. 66, no. 12, pp. 1365–1384, 2012. View at: Publisher Site | Google Scholar
  32. M. Lopes, V. Mouillet, L. Bernucci, and T. Gabet, “The potential of attenuated total reflection imaging in the mid-infrared for the study of recycled asphalt mixtures,” Construction and Building Materials, vol. 124, pp. 1120–1131, 2016. View at: Publisher Site | Google Scholar
  33. Y. Xiong, G. Chen, S. Guo, and G. Li, “Lifetime prediction of NBR composite sheet in aviation kerosene by using nonlinear curve fitting of ATR-FTIR spectra,” Journal of Industrial and Engineering Chemistry, vol. 19, no. 5, pp. 1611–1616, 2013. View at: Publisher Site | Google Scholar
  34. G. Y. Chen and S. E. Qian, “Denoising of hyperspectral imagery using principal component analysis and wavelet shrinkage,” IEEE Transactions on Geoscience and Remote Sensing, vol. 49, no. 3, pp. 973–980, 2010. View at: Publisher Site | Google Scholar
  35. S. Tonidandel and J. M. LeBreton, “Determining the relative importance of predictors in logistic regression: an extension of relative weight Analysis,” Organizational Research Methods, vol. 13, no. 4, pp. 767–781, 2010. View at: Publisher Site | Google Scholar
  36. R. Azen and N. Traxel, “Using dominance analysis to determine predictor importance in logistic regression,” Journal of Educational and Behavioral Statistics, vol. 34, no. 3, pp. 319–347, 2009. View at: Publisher Site | Google Scholar
  37. J. Adler and I. Parmryd, “Quantifying colocalization by correlation: the Pearson correlation coefficient is superior to the Mander’s overlap coefficient,” Cytometry Part A, vol. 77A, no. 8, pp. 733–742, 2010. View at: Publisher Site | Google Scholar
  38. A. Soberón and W. Stute, “Assessing skewness, kurtosis and normality in linear mixed models,” Journal of Multivariate Analysis, vol. 161, pp. 123–140, 2017. View at: Publisher Site | Google Scholar
  39. A. D. Ho and C. C. Yu, “Descriptive statistics for modern test score distributions,” Educational and Psychological Measurement, vol. 75, no. 3, pp. 365–388, 2015. View at: Publisher Site | Google Scholar

Copyright © 2020 Ruibo Ren et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


More related articles

 PDF Download Citation Citation
 Download other formatsMore
 Order printed copiesOrder
Views275
Downloads346
Citations

Related articles

Article of the Year Award: Outstanding research contributions of 2020, as selected by our Chief Editors. Read the winning articles.