Abstract

Proper anesthesia is very important for patients to get through surgery without pain and then avoid some other problems. By monitoring the depth of sedation for a patient, it could help a clinician to provide a suitable amount of anesthetic and other clinical treatment. In hospital, a patient is usually monitored by different types of biological systems. To predict the depth of sedation from biological signals is able to ease patient monitoring services. In this study, continuous restricted Boltzmann machines based neural network is proposed to predict the depth of sedation. The biological signals including heart rate, blood pressure, peripheral capillary oxygen saturation, and body weight are selected as analytic features. To improve the accuracy, the signals related to the state of anesthesia including fractional anesthetic concentration, end-tidal carbon dioxide, fraction inspiration carbon dioxide, and minimum alveolar concentration are also adopted in this study. Using minimizing contrastive divergence, a continuous restricted Boltzmann machine is trained and then used to predict the depth of sedation. The experimental results showed that the proposed approach outperforms feed-forward neural network and modular neural network. Besides, it would be able to ease patient monitoring services by using biological systems and promote healthcare quality.

1. Introduction

It is very important to help patients get through surgery without pain and avoid some other problems; thus clinicians need to provide proper anesthesia in surgery [14]. Under general anesthesia, memory and awareness are critical components of the depth of sedation (DOS). However, it needs additional monitoring equipment and sensors to predict the DOS. But, using this equipment and these sensors is uncomfortable for a patient. In clinical practice, a patient is usually monitored by different types of biological systems. Hence, it is important to predict the DOS from biological signals. It then can ease patient monitoring services and promote healthcare quality.

The assessment and interpretation of anesthesia are always based on patients’ subjective reports, physiological monitors, and evaluation indices. Generally, electromyography, electrocardiography, blood pressure (BP), electroencephalogram, and peripheral capillary oxygen saturation (SpO2) are monitored by physiological monitors. Bispectral index (BIS), entropy, auditory evoked potential, and surgical stress index are treated as evaluation indices. According to these data, an anesthetist is able to give an objective report of general anesthesia. But these approaches have their limits [5]. BIS is not real-time approach and it has a latency of 30 seconds or more [6]. The monitoring environment [7] for auditory evoked potential has a strict requirement. Entropy is sensitive to artifacts and noises caused by motions of body, such as eye movement and cough [8]. Besides, the equipment used in these approaches is very expensive and needs electrodes, which would be uncomfortable for a patient.

Recently, the researchers showed that the parasympathetic activity is significant relative to DOS [9, 10]. Generally, the autonomic nervous system plays an important role in maintaining physiological activity. Therefore, the parasympathetic activity is usually monitored in clinical practice and can be represented as BP, heart rate (HR), sweat, and tear formation. Therefore, these physiological signals are used as a standard measure of autonomic reactions in clinical practice [11]. HR variability is closely related to autonomous nervous system, so the HR variability would vary remarkably between different narcotic states [12]. Because the HR is significant to monitor the depth of anesthesia, Li et al. applied HR variability to measure the degree of anesthesia [13]. In clinical practice, the strict safety limits are defined regarding respiratory rate, HR, mean arterial blood pressure, SpO2, transcutaneous partial pressures of oxygen, and carbon dioxide [14]. Consequently, using these features would be feasible to improve the accuracy of estimation of DOS.

In the past decades, the restricted Boltzmann machine (RBM) had been widely used in many applications including dimensionality reduction, classification, collaborative filtering, feature learning, and topic modeling [1521]. A RBM is a bipartite Markov random field wherein the input layer is associated with observed responses, and the output layer consists of hidden binary factors of variation [19]. The input layer is associated with observed responses and the output layer typically consists of hidden binary factors of variation. The standard type of RBM has binary-valued input and output layers. To extend the topics of applications, a continuous RBM (CRBM) is proposed to improve the modeling ability with both artificial and real continuous data by minimizing contrastive divergence (MCD) [2224]. Therefore, the CRBM can be applied to different types of features and it could provide unsupervised feature learning and topic modeling. According to the advantages of CRBM, using CRBM as features learning and pattern classification would be able to preciously predict the DOS of a patient from biological signals.

In this study, CRBM based approach is proposed to predict the DOS from biological signals. To accurately predict the DOS, four features including HR, BP, and SpO2 are extracted to represent the biological signals. Because the status of anesthesia plays an important role in clinical anesthesia, integrating the features, which are relative to status of anesthesia, is able to accurately predict the DOS. Thus, four features including fractional anesthetic concentration (AA–FI), end-tidal carbon dioxide (EtCO2), fraction inspiration carbon dioxide (FiCO2), and minimum alveolar concentration (MAC) are adopted. Because these types of features are great relative to body weight (BW), the BW should be also included. Finally, the CRBM whose parameters are trained by MCD is applied to predict DOS.

The rest of this paper is organized as follows. Section 2 describes the CRBM based approach for predicting DOS, including the feature extraction and CRBM. Section 3 presents the results of a series of experiments examining the performance of the proposed approach. In Section 4, the conclusion and recommendations for future research are presented.

2. Method

In this section, a CRBM based approach to predict DOS from biological signals is presented to improve performance beyond that of previously proposed methods. First, two types of features are extracted to represent the biological signals and the status of anesthesia. Second, the parameters of CRBM are estimated by using MCD training algorithm. Finally, the CRBM is used to predict the DOS. The procedure of proposed approach is detailed in the following subsections. Feature extraction is presented in Section 2.1. Then, the CRBM including continuous stochastic unit, diffusion network, and MCD training algorithm is detailed in Section 2.2.

2.1. Feature Extraction

Two types of features are selected in this study. The first type is related to patient’s autonomic nervous system and the features are extracted from biological signals. These features are HR, BP, and SpO2, expressed as follows.(i)Heart rate: a human’s resting HR is determined by numerous and complex factors and then the anesthesia would usually cause a decrease in HR. HR is the speed of the heartbeat measured by the number of heartbeats per unit of time, typically beats per minute.(ii)Blood pressure: BP is the pressure exerted by circulating blood upon the walls of blood vessels. Generally, it is measured in millimeters of mercury. During each heartbeat, blood pressure varies between maximum and minimum pressure.(iii)Peripheral capillary oxygen saturation: SpO2 is an estimation of the oxygen saturation level.

The second type is related to the state of anesthesia and produced by an anesthesia monitor. These features are AA–FI, EtCO2, and FiCO2, expressed as follows.(i)Fractional anesthetic concentration: AA–FI is a chemical compound possessing general anesthetic properties that can be delivered via inhalation.(ii)End-tidal carbon dioxide: EtCO2 is the level of carbon dioxide released at the end of expiration. A high EtCO2 reading in a patient with altered mental status or severe difficulty breathing may indicate hypoventilation and a possible need for the patient to be intubated. A low EtCO2 reading on patients may indicate hyperventilation, so it can quickly reveal a worsening trend in a patient’s condition.(iii)Fraction inspiration carbon dioxide: FiCO2 is the fraction or percentage of carbon dioxide in the space being measured.(iv)Minimum alveolar concentration: MAC is the concentration of the vapor in the lungs that is needed to prevent movement in 50% of subjects in response to surgical stimulus.

Since BW greatly affects the effect of anesthesia, BW is always considered in clinical practice. Thus, the BW is selected as a feature in this study. Since the proposed model is based on neural network, the features of samples are normalized between 0 and 1.

2.2. Continuous Restricted Boltzmann Machine

A CRBM, which is a stochastic neural network, consists of one visible layer and one hidden layer of stochastic neurons with interlayer connections [23]. An example of CRBM is shown in Figure 1. The probability generated from a visible neuron is proportional to the product of the probabilities that are generated by each of the hidden units acting along. The following subsections describe CRBM in detail.

2.2.1. Continuous Stochastic Unit

Let and be the th visible neuron and the th hidden neuron, respectively. Given the states of hidden neurons, the output of can be expressed as where is the bidirectional weights connecting neuron and neuron . is a zero-mean Gaussian noise with variance and it constitutes a noise input component according to the following probability distribution: is the sigmoid function with asymptotes and . Let the activation of be Then, can be rewritten as where is the noise-control parameter and is used to control the slope of the sigmoid function. Therefore, it forms a continuous stochastic unit.

2.2.2. Diffusion Network

The network used in CRBM is a diffusion network, which consists of fully connected neurons with activation . Using a stochastic differential diffusion equation proposed by Movellan et al. [25], the diffusion network for the time evolution of the activation can be derived as where controls the changing rate of and is a state-decay term. provides an additional random element in the network’s behavior. Then, the discrete-time diffusion process for a finite time increment can be written as where is a Gaussian random variable with zero mean and unit variance.

Let and ; then the term can be canceled. Equation (6) can be rewritten as

If is symmetric and is a constant, (7) is equivalent to the total input of a CRBM as given by (1). When the CRBM is a symmetric restricted diffusion network, the weight update algorithm of the diffusion network can be used for training a CRBM.

2.2.3. MCD Training Algorithm for CRBM

A CRBM is trained by using an iterative training process that minimizes contrastive divergence [26]. In each training cycle, the training samples are sequentially used to modify the interlayer connections weights. Thus, the training rule for any parameter in a diffusion network can be derived as where and are the exception values over the training data with visible states clamped. is the system covariance which is the negative derivation energy function in diffusion network with respect to parameter . By minimizing contrastive divergence and avoiding the time-consuming process of full Gibbs’s sampling to reach equilibrium, the can be treated as the exception value over one-step sample and then denoted by . Thus, (8) can be rewritten as

Since the energy function of CRBM is analogous to the continuous Hopfield model [27], the energy function of CRBM can be derived as where is with . According to (9) and (10), the MCD training rules for weights and noise-control parameters can be derived as where and are the learning rate. is the one-step sample state of neuron and is the mean over the training data. To simplify (12), the integral term is approximate as Therefore, (12) can be rewritten as Finally, the training rules for CRBM in (11) and (14) can be simplified to use addition and multiplication operations.

3. Experimental Results and Discussion

In this study, the inhalational anesthetic was adopted and administered by anesthetists. The sevoflurane was used as the anesthetic gases and the biological signals were monitored by using Datex AS/5 anesthesia monitor. Therefore, a database including 6000 epochs was collected from 27 subjects (13 males and 14 females). 5000 and 1000 epochs were randomly selected as training and testing sets, respectively. Leave-one-out cross-validation was used to evaluate the proposed approach. The mean square error (MSE) was adopted to evaluate the performance of the proposed approach. Because the performance of neural network is greatly dependent on the initial weights connecting neurons, each approach was examined 20 times. The MSE were then used in the following experiments.

3.1. Experimental Results of CRBM in Different Formulas

In this subsection, the types of features and the number of neurons in hidden layer were examined. The maximum dimension of features is eight in this study and it is a low dimension for a neural network. Thus, the number of neurons in hidden layer for CRBM was tested from 1 to 100.

Besides, three types of features were considered according to the characteristics of features. The features for first type (denoted by BioFea) were related to the biological signals and they are HR, BP, SpO2, and BW. The features for second type (denoted by AneFea) were related to the state of anesthesia and they are AA–FI, EtCO2, FiCO2, MAC, and BW. Because, these features were greatly affected by BW, the BW was considered in these two types. Finally, the features for third type were all the eight features and were denoted by ComFea.

The experimental results for three types of features were shown in Figure 2. The best performances for biological signals and the state of anesthesia were 0.0524 (with 4 neurons of hidden layer) and 0.0599 (with 69 neurons of hidden layer), respectively. It is clear that the performance of biological signals is better than that of the state of anesthesia. Besides, the number of neurons in hidden layer of CRBM is also quite small and it would greatly improve the computational efficiency. Thus, the biological signals are useful to predict the DOS.

Using ComFea, the MSE was 0.0504 when the number of neurons of hidden layer for CRBM was 6. It is clear that the proposed approach using ComFea achieved the best performance. Therefore, the CRBM could effectively transform the features into a reasonable domain. It is also able to obtain a reasonable result in low dimension. Thus, the CRBM is very suitable in this topic. To avoid the effect of initial weights of a neural network, the proposed approaches were examined 20 times and then the experimental results of three types of features, which were expressed in maximum MSE, average MSE, minimum MSE, and number of neurons in hidden layer, were shown in Table 1. We found that the minimum MSE of AneFea is larger than the maximum MSE of BioFea and ComFea. Therefore, the predictive ability of AneFea is lower than that of other types of features. Since the performance of BioFea is similar to that of ComFea, the proposed approach can be integrated with other biological systems. Then, it would effectively reduce the computational complex and increase the value of biological systems.

An example of predicted results was shown in Figure 3. The black line was the DOS for BIS, which was predicted from the signals of EEG. We could find that the variation of DOS estimated from BIS was larger than that of proposed approach (shown in red line). Therefore, the energy of EEG is noise-sensitive and it would cause the result of DOS to be unstable. In clinical anesthesia, these noise-sensitive components can be ignored; thus the proposed approach provides reasonable results.

3.2. Comparison with Other Approaches

To compare the proposed approach, the feed-forward neural network (denoted by ANN) and modular neural network (MNN) were implemented as baseline system. Because the features of MNN should be classified into two groups, three grouping types were examined in this study. For the first grouping type of MNN, the features including HR, BP, SpO2, and EtCO2 were selected as one group and the others were another group (denoted by MN–1) [28]. For the second grouping type of MNN, the features, which were relative to the status of anesthesia (AA–FI, AA–ET, AA–MAC, and BW), were classified as one group and the others were another group (denoted by MN–2). Since the EtCO2 could be used to reflect the status of anesthesia, it was treated as a feature of the status of anesthesia. Therefore, the features including AA–FI, AA–ET, AA–MAC, EtCO2, and BW are selected as one group and the others were another group (denoted by MN–3).

The experimental results expressed as average MSE, maximum MSE, and minimum MSE were shown in Figure 4. The average MSE for ANN, MN–1, MN–2, MN–3, and CRBM were 0.0665, 0.0590, 0.0551, 0.0515, and 0.0504, respectively. It is clear that the proposed approach outperforms other approaches. Besides, the performance of modular neural network is better than that of ANN. The MNN can accurately integrate the individual results, which are predicted from biological signals and state of anesthesia. In Figure 4, we also found that the performance variation for ANN is greater than that of other approaches. Therefore, the performance of ANN is closely dependent on the initial weights of a neural network. Using the CRBM, the effect of initial weights of neural network can be greatly reduced.

Comparing the experimental results of MN–1, MN–2, and MN–3, the performances were quite different for different grouping types. Therefore, the grouping type of features plays an important role in MNN. To achieve an acceptable result, domain knowledge for features could help users to design the structure of MMN, but it is difficult. Therefore, CRBM could provide an appropriate approach to automatically map the features into a reasonable space. The experimental results showed that the performance of CRBM is slightly better than that of modular neural network. Thus, it is an effective way to deal with this problem. Moreover, the effect of initial weights can be reduced.

4. Conclusion

In this study, a novel approach based on continuous restricted Boltzmann machines is proposed to predict the depth of sedation. The features, which were relative to biological signals and states of anesthesia, successfully represented the characteristics of DOS. Using the CRBM, these features could be effectively mapped to an appropriate domain and then obtained a best performance. Besides, the biological signals, which were used to represent the characteristic of autonomic nervous system, can be individually used to predict the DOS and the results were suitable for many biological systems in clinical practice. The experimental results demonstrated that the proposed approach outperforms feed-forward neural network and modular neural network. Therefore, the proposed approach is able to ease patient monitoring services by using biological systems and promote healthcare quality.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgment

The authors would like to thank the Ministry of Science and Technology of Taiwan for financially supporting this research under Contract MOST 103-2221-E-218-009.