Abstract

The diagnostic performance of an artificial neural network model for chronic HBV-induced liver fibrosis reverse is not well established. Our research aims to construct an ANN model for estimating noninvasive predictors of fibrosis reverse in chronic HBV patients after regular antiviral therapy. In our study, 141 consecutive patients requiring liver biopsy at baseline and 1.5 years were enrolled. Several serum biomarkers and liver stiffness were measured during antiviral therapy in both reverse and nonreverse groups. Statistically significant variables between two groups were selected to form an input layer of the ANN model. The ROC (receiver-operating characteristic) curve and AUC (area under the curve) were calculated for comparison of effectiveness of the ANN model and logistic regression model in predicting HBV-induced liver fibrosis reverse. The prevalence of fibrosis reverse of HBV patients was about 39% (55/141) after 78-week antiviral therapy. The Ishak scoring system was used to assess fibrosis reverse. Our study manifested that AST (aspartate aminotransferase; importance coefficient = 0.296), PLT (platelet count; IC = 0.159), WBC (white blood cell; IC = 0.142), CHE (cholinesterase; IC = 0.128), LSM (liver stiffness measurement; IC = 0.125), ALT (alanine aminotransferase; IC = 0.110), and gender (IC = 0.041) were the most crucial predictors of reverse. The AUC of the ANN model and logistic model was 0.809 ± 0.062 and 0.756 ± 0.059, respectively. In our study, we concluded that the ANN model with variables consisting of AST, PLT, WBC, CHE, LSM, ALT, and gender may be useful in diagnosing liver fibrosis reverse for chronic HBV-induced liver fibrosis patients.

1. Introduction

The progress to liver cirrhosis is a vital stage of chronic hepatitis B (CHB). Recent research demonstrated about 15–40% of CHB patients would progress to cirrhosis, liver failure, or hepatocellular carcinoma [1]. Persistent antiviral treatment for chronic HBV infection would suppress progress to cirrhosis and even implement reverse of fibrosis in the early stage [2].

Liver biopsy was considered the gold standard for distinguishing different stages in the diagnosis of liver cirrhosis, but the value was unstable because of invasiveness, sampling error, lack of standards, or intra- and interobserver agreement [3]. In recent years, noninvasive diagnosis models based on clinical and serological biomarkers for assessing liver cirrhosis have been calculated for CHB patients as the alternative marker to liver biopsy [46].

The correlations between serological biomarkers and the reverse of liver biopsy score are nonlinear and complex. Several research studies have explored the artificial neural network (ANN) model to estimate the correlation between serological biomarkers and the reverse of liver cirrhosis [7, 8]. ANNs base on the machine learning mechanism to identify the complex relationship between input neural units and output neural units. A systematic review suggested that ANNs were known to handle complex relationships better than linear statistic algorithms [8].

The aim of our study was to estimate the effectiveness of the noninvasive ANN model in estimating reverse of liver cirrhosis based on clinical variables and serological biomarkers.

2. Materials and Methods

2.1. Data Sources and Patients

In this study, we used the database from National Science and Technology Major Project, which enrolled 21 hospitals and 298 patients dating from July 2013 to December 2015. The demographic data and key laboratory data of patients were collected.

The enrollment criteria for this prospective study were as follows: treatment-naive patients with chronic HBV-induced fibrosis S2/S3 (similar to F2/F3, Ishak 2/3/4), who consented to undergo liver biopsy before and after treatment; patients who are HBeAg positive, HBVDNA >2 × 104 IU/ml; or patients who are HBeAg negative, HBVDNA >2 × 103 IU/ml.

One treatment group uses entecavir alone for 2 years, and the other group uses entecavir alone for the first 0.5 years and then entecavir plus pegylated interferon (peg-IFN) for 1 year and entecavir for another additional 0.5 years. At the end of treatment, all patients would undergo the second liver biopsy for assessing the liver fibrosis reversion. According to the Ishak scoring system, the decrease ≥1 unit was considered reverse after treatment (Figure 1).

All the patients agreed to follow-ups regularly, and the Ethics Committee of Beijing Friendship Hospital, Capital Medical University, approved the study protocol (BJFH 2014033).

2.2. Serological Biomarkers for Cirrhosis

Patients were assessed at baseline and at every six months for blood count, liver function test, HBVDNA, AFP (alpha-fetoprotein), PT (prothrombin time), thyroid function, liver ultrasonography, and Fibroscan. The second liver biopsy will be performed to evaluate the regression rate of liver fibrosis 1.5 years after initial therapy.

Reverse of liver cirrhosis was defined as the decrease of at least 1 point by the Ishak scoring system after 1.5 years compared with the baseline biopsy score.

2.3. Artificial Neural Networks

An artificial neural network consists of a set of processing units which simulate neurons and are interconnected via a set of weighted connections in a way which allows signals to travel through the network in parallel as well as serially [9].

The constructed ANN in this work consists of three layers. The input layer represents the observed results of serum biochemical and Fibroscan tests. The output layer is the indicator of reverse or nonreverse. The amplitude of the signal transmitted between neurons depends on the signal intensity emerging from the sending neuron and on the weight of their connection, the latter being denoted as the connection weight [10]. The three-layer network with a hidden layer was trained as follows:where is the patient’s number in the training dataset, each node could be abstracted to the corresponding state variable , and is the weighted connections expressing the importance between node i and node j. The output node was calculated to indicate whether the weighted sum is less than or greater than a threshold θ value:in which each node included a transform function , is the expected output, and is the observed output. The output layer was expressed in formula (2) estimated by the transform function:

The hidden layer derived from input nodes was calculated in formula (3). Our network’s output was limited to the binary set {0, 1} according to liver fibrosis reverse after 78-week antiviral therapy [1113].

To provide a reasonable predictive model and comply with the standard way, our dataset was randomly divided into two subsets: 80% of the entire dataset for the training set and 20% for the testing set. The independent variables (input layer) were gender, PLT, WBC, ALT, AST, CHE, and LSM, which were statistically significant between the reverse and nonreverse groups. The dependent variable (output layer) was reverse or nonreverse after 1.5 years.

2.4. Statistical Analysis

Quantitative data were described as median, lower quartile, and upper quartile. And categorical data were described as frequency and percentage. Differences between the groups were compared using Student’s t-test (normally distributed variables) or the Mann–Whitney test (nonnormally distributed variables) for continuous variables. The chi-square test or Fisher’s exact test was used for categorical variables.

A three-layer feedback ANN model with the propagation algorithm was constructed according to the results of univariate and sensitivity analyses. Data were randomly divided into the training group (80%) and testing group (20%) in our exploratory ANN prediction model. Sigmoid transfer functions were performed in the hidden and output layers. Gradient descent was used for assessing connection weights. The overfit penalty was pointed as 0.001, and the convergence criterion was 0.00001 [14]. The outcome variable transformed to the range from 0 to 1 by the normalization algorithm. Liver fibrosis reverse was predicted if the outcome was equal to or greater than 0.5. SPSS modeler 18.0 (SPSS Inc., Chicago, IL, USA) was used for the ANN model.

A stepwise logistic regression analysis was performed to construct a logit model for comparison with our ANN model. The probability for conditional stepwise entry predictors was 0.05 and that for the removal predictors was 0.10. The ROC curve was calculated to assess the sensitivity, specificity, positive and negative likelihood ratios, accuracy, and area under the ROC curve (AUROC). These analyses were performed using SAS 9.2 (SAS Institute Inc., Cary, NC, USA). P values below 0.05 were considered significant.

3. Results

3.1. Distribution of Reverse and Nonreverse in CHB Patients after 1.5 Years of Therapy

A total of 298 patients with chronic HBV-induced fibrosis S2/S3 were enrolled in our study, and 141 patients consented to undergo liver biopsy after 1.5 years of antiviral therapy. About 39% (55/141) of patients were diagnosed reverse based on the Ishak scoring system (Figure 1).

3.2. Univariate Test Results for CHB Patients before and after Treatment

Our study found that HBVDNA (transformed to log10, t = 31.067, ), PLT (Z = 2.700, ), WBC (t = 4.651, ), ALT (Z = 15.555, ), AST (Z = 15.387, ), ALB (albumin; t = 4.711, ), CHE (Z = 4.952, ), TBIL (total bilirubin; t = 3.639, ), PT (t = 7.046, ), INR (international normalized ratio; t = 10.084, ), AFP (Z = 7.220, ), and LSM (t = 7.976, ) were statistically different between baseline and 78 weeks, which are manifested in Table 1.

3.3. Multivariate Analysis of Reverse and Nonreverse Groups after 78 Weeks

At the end of 1.5 years, 141 patients undertook the second liver biopsy for assessing the curative effect. Fifty-five patients were diagnosed reverse and 86 patients were diagnosed nonreverse according to the Ishak scoring system. Seven variables were statistically significant and considered relevant to liver fibrosis reverse by univariate analysis: age (X2 = 4.059, ), PLT (Z = 3.478, ), WBC (t = 3.744, ), ALT (t = 1.988, ), AST (t = 3.060, ), CHE (Z = 3.217, ), and LSM (t = 2.024, ) (Table 2). A logistic regression was constructed to predict liver fibrosis reverse as follows: .

3.4. ANN Analysis

As manifested in Figure 2, AST, PLT, WBC, CHE, LSM, ALT, and gender were the most important predictors of liver biopsy reversion and assigned as the input layer in the ANN model. The importance coefficient of AST was 0.296 which contributed to the vital weight in our model. The three-layer feedback propagation neural network model included one input layer (containing 7 variables as inputs), one hidden layer (containing 4 neurons), and one output layer (Figure 3). The sensitivity, specificity, positive likelihood ratio, negative likelihood ratio, positive predictive value, and negative predictive value were 83.1%, 85.2%, 5.61, 0.19, 93.0%, and 74.5%, respectively, in the ANN model, which manifested better fitting function than logistic regression, as shown in Table 3.

4. Discussion

To our knowledge, this study is the first attempt to use the database of multicenter hospital-based HBV fibrosis patients with two liver biopsies to construct the ANN model for predicting fibrosis reverse after 1.5 years of antiviral therapy. Our results manifested AST, PLT, WBC, CHE, LSM, ALT, and gender were the noninvasive predictors of liver fibrosis reverse. According to ROC curve analysis and AUC analysis, the predictive performance of the ANN model was superior to the results of the logistic regression.

Compared to the classical statistic model based on the hypothesis that input variables and outcomes are linear regression relationships, the ANN is the machine learning algorithm based on the computational system, which could be constructed in a special structure to perform dynamic and continuous learning from knowledge as we input to the model [8]. ANNs have been used to predict the outcome events, including survival and exploring complex relationships between different surgical groups for complex medical decision-making [15]. The ANN used in our study is a standard backpropagation neural network in which the input variable received information from the data and transferred it to the output variable [16]. The stop criterion is recommended to build based on cross validation. It would monitor the error on an independent set of data and stop the training process when this error begins to increase.

In our research, noninvasive diagnosis models consisted of AST, PLT, WBC, CHE, LSM, ALT, and gender, which were statistically significant in univariate and multivariate analysis. Recently, APRI [17] and FIB-4 [18], consisting of ALT, AST, PLT, and age, were commonly used noninvasive serological predictors for fibrosis diagnosis. The WHO (World Health Organization) guidelines recommended that APRI and FIB-4 could be used for HBV-reduced fibrosis assessment in countries and regions with limited medical resources [19]. Furthermore, other noninvasive serum biomarkers were calculated for predicting cirrhosis in liver diseases. The Lok score performed well in predicting clinically significant portal hypertension using transient elastography (TE) [20]. The AST-to-ALT ratio was commonly used to elevate the alcoholic liver disease pattern in patients with hepatitis C who progressed to liver cirrhosis [21]. Yuanyuan Kong et al. found that baseline FIB-4 [18] and Ishak score [22] as well as baseline LSM, PLT, and ALB and their changes during the first 6 months could predict histological reversal in CHB patients on antiviral therapy [23].

However, Marcellin pointed out in their study that APRI and FIB-4 scores were not suitable for CHB patients for assessing hepatic fibrosis according to the Ishak stage, especially in gauging improvements in liver fibrosis following therapy [24]. Noninvasive assessment with either serum biomarkers or TE can be used to monitor improvement in liver fibrosis during antiviral therapy. The correlation of fibrosis improvement predicted by noninvasive measurement with histology has yet to be determined (evidence grading is B2 according to the GRADE system) [25]. Therefore, our study was aimed to construct the most accurate noninvasive predictors of HBV-induced fibrosis reverse.

One of the major limitations of our study was the limited hospital-based sample size, especially with consent to the second liver biopsy, which was the invasive assessment with certain risks in fibrosis patients after 1.5 years of antiviral therapy. In order to improve the predictive performance of our model, a total of 150 CHB patients with baseline biopsy could be collected as the validation dataset in ANN analysis.

In conclusion, the ANN model with serum biomarkers and Fibroscan test consisting of AST, PLT, WBC, CHE, LSM, ALT, and gender could be useful for assessing HBV-induced fibrosis reverse after the antiviral therapy.

Abbreviations

AFP:Alpha-fetoprotein
ALB:Albumin
ALT:Alanine aminotransferase
ANN:Artificial neural network
APRI:AST-platelet ratio index
AST:Aspartate aminotransferase
AUC:Area under the curve
CHB:Chronic hepatitis B
CHE:Cholinesterase
FIB-4:Fibrosis index based on four factors
HBV:Hepatitis B virus
INR:International normalized ratio
LSM:Liver stiffness measurement
PLT:Platelet count
PT:Prothrombin time
ROC:Receiver-operating characteristic
TBIL:Total bilirubin
TE:Transient elastography
WBC:White blood cell
WHO:World Health Organization.

Data Availability

We have registered our study in ClinicalTrials.gov, and all the study data should be submitted to the National Science and Technology after the last patient’s follow-up in 2021. Then, we may share the data through ClinicalTrials.gov.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was supported by grants from the National Science and Technology Major Special Project for Infectious Diseases (No. 2018ZX10302204), Digestive Medical Coordinated Development Center of Beijing Municipal Administration of Hospitals (No. XXX0104), Beijing Municipal Administration of Hospitals Incubating Program (No. PX2018071), and Research Foundation of Beijing Friendship Hospital, Capital Medical University (No. yyqdkt2015-15).