Abstract

Cough is a respiratory protective behavior for clearing the secretion. The cough process can be characterized by three features which are cough peak flow rate, peak velocity time, and cough expired volume. The cough expired volume (CEV) and the cough peak flow rate (CPFR) are important for medical diagnosis and cough effectiveness assessment. In this study, the CEV and CPFR values of 700 healthy participants were measured and collected by using a portable pulmonary function device. The gender, age, height, weight, and smoking status information of the 700 participants were also collected. Meanwhile, the integration of backpropagation neural network and genetic algorithm (GA-BP) method was developed to estimate CEV and CPFR values. The results showed that the estimation accuracy of GA-BP method exceeds 90%, which indicates that the GA-BP method could be effectively used for CEV and CPFR value estimation. Furthermore, the method proposed in this paper could be useful for medical diagnosis and medical device development.

1. Introduction

Cough is a kind of respiratory reflex behavior. When the respiratory tract is stimulated by inflammation, dust, or some particulate matters, cough behavior is performed to clear respiratory secretions to keep the respiratory tract clean and unobstructed [15]. Because of the high pressure difference between the inside and outside of the thoracic cavity, a high airflow rate is generated to impose a great shear force on the surface of secretions and propel them to the mouth [6].

According to previous studies, the cough process continued about 0.4∼0.6 s and can be characterized by three parameters which are cough peak flow rate (CPFR), peak velocity time (PVT), and cough expired volume (CEV) [710]. The CEV and CPFR are the total exhausted air volume and the maximum airflow rate measured in atmosphere normal reference during the whole cough process, respectively.

The cough expired volume (CEV) and the cough peak flow rate (CPFR) are important for medical diagnosis, cough effectiveness assessment, and extubation decision [1113]. However, as for the patients with mechanical ventilation, neuromuscular disease, or other diseases which impairs the cough ability, the CEV and CPFR values cannot be obtained and used for medical diagnosis. Therefore, establishing a relationship between the CEV, CPFR values and human physical information could be used for medical diagnosis for these patients. If the physical information is obtained, the CEV and CPFR values will be estimated.

Previous studies have developed the relations between CEV and CPFR values and human genders, heights, and weights. Leiner et al. found that the CPFR is related to the height and age of human [14]. Mahajan et al. and Singh et al. involved the gender as an influence factor and developed relations between CPFR, CEV, and PVT [8, 9]. Their research involved 100 healthy and nonsmoking volunteers (50 females and 50 males) and showed a direct relationship between CPFR and CEV values [8]. The results showed that the maximum value of CEV reached 5 L with an average of 3 L. Zhu et al. measured the CEV values of three healthy subjects and found that the variation range from 0.8 to 2.2 L with an average of 1.4 L [15]. Gupta et al. investigated 25 healthy subjects (12 females and 13 males) and developed a first-order relation between CEV, CPFR values and genders, heights, and weights through the linear regression analysis method [7]. The results are performed as follows:

Brandimore et al. established a linear relation among CEV values, airflow rates, and number of coughs through analyzing the measured data of 25 participants (14 females and 11 males with an average of 23 years old). The results demonstrated significant linear relationships between expired volume, the total number of coughs, and cough airflow rates [6]. Particle velocimetry (PIV) method has been also used to estimate ranges of cough velocities. Chao et al. collected the average velocity of 50 coughs from eleven healthy volunteers (3 men and 8 women). The estimated maximum cough velocities of male and female were 13.2 m/s and 10.2 m/s, respectively [16]. VanSciver et al. performed over twenty-nine nonsmoking healthy volunteers (ten male and nineteen female volunteers) to obtain and analyze the cough velocity. The results showed that there is no correlation between cough velocity and sex and weight [10].

In this study, 700 healthy participants were involved. The CEV values, CPFR values, genders, heights, weights, ages, and smoking status were measured and recorded. Meanwhile, the integration of backpropagation neural network and genetic algorithm, which is called the GA-BP method was developed to estimate CEV and CPFR values.

2. Materials and Methods

2.1. Experiment Setup

The CEV and CPFR values were measured by using a portable pulmonary function device (Contec Ltd.), which are presented in Figure 1. The CEV and CPFR values acquisition range from 0 to 10 L and 0∼16 L/s, respectively. The acquisition errors are within ±0.05 L and 0.2 L/s, respectively. A disposable connector is installed in the front of the device during the collection. The human subjects hold the connector with their mouths and cough forcefully. After a single cough, the CEV and CPFR values will be displayed on the device screen and stored in the device.

All the participants were receiving the training of how to use this device before measurement. Sitting posture was adopted during measurement.

2.2. Ethical Statement

The CEV, CPFR values, and physical information of 700 human subjects were obtained by the doctor at the Chao Yang Hospital. All the human subjects were agreed to conduct these measurements and signed the informed content. The Chao Yang Hospital ethics committee and human subjects have approved these data collection (Approval number: 20175241).

2.3. Backpropagation Neural Network

BP neural network is a multilayer feedforward neural network with the forward signal transmission and reverse error transmission and could be used to estimate any nonlinear relations through training [1720]. Typically, the BP neural network consists of the input layer, hidden layer, and output layer. The input signals are processed layer by layer from the input layer through the hidden layer to the output layer. If the expected output is not achieved, the signals transfer to backpropagation. According to the prediction error, the neural network weights and threshold values adjust to obtain the expected output values [2124].

In this study, five inputs which represent gender, height, weight, age, and smoking status, and two outputs which represent CEV and CPFR value are set up in the BP neural network. The structure of the BP neural network is presented in Figure 2. The neuron number of hidden layers is set to 11 based on the Hecht–Nelson method.

2.4. BP Neural Network Improved by Genetic Algorithm

Although BP neural network can obtain a good estimated result through the training process, the training time is too long and the results may converge to local optimal values. Genetic Algorithm (GA) is a parallel random search optimization method, which imitates the natural genetic mechanism and Darwin’s principle of biological evolution [2527]. Therefore, the BP neural network is improved by genetic algorithm (GA) to solve this problem, which is called GA-BP [2830]. The GA-BP algorithm is used to search for the most preferable weights and thresholds of neural network. The whole operation process includes initialization, calculating fitness values, selection, crossover, and mutation. This process repeats until the end condition is satisfied.

The data set which consists of the connection weights and thresholds of neural network is regarded as an individual. As shown in Figure 3, the real number coding was used to create the initial values for each individual which consists of ωij, a, ωjk, and b. The ωij and ωjk represented the weights between the input layer and hidden layer and the weights between the hidden layer and output layer, respectively. The a and b represented the thresholds of the hidden layer and output layer, respectively. The number of individuals was set ten in this study.

The absolute error between predicted and expected output of BP neural network was used as the individual fitness value in this study. The fitness function is as below:where E is the individual fitness value, m is the number of neurons in output layer, and Yi and Oi are the predicted output and expected output of the network, respectively.

The goal of selection is that individuals with lower fitness have a greater chance to inherit to the next generation. The roulette method was used for selection operation in this study, as shown in the following formula:where p is the chosen probability of each individual, k is the coefficient, and N is the number of individuals.

The crossover probability was set 0.5. The crossover operation for individual ak and al at the j position was presented in equation (4). The diagram is shown in Figure 4.where α is a random number between [0∼1].

The mutation probability was set 0.4 and the diagram is shown in Figure 5. The new chromosome is calculated through equation (5).where amax and amin are the upper bound and lower bound of gene aij, respectively. The r2 is a random number, is the current iteration number, Gmax is the maximum evolution number, and r is a random number between [0∼1].

The iteration was set 40. The operation process of GA-BP is presented in Figure 6.

3. Results

The statistical results of collected physical information (gender, age, height, weight, and smoking status), CEV, and CPFR values of 700 human subjects are presented in Table 1.

In order to testify the GA-BP method, the ten times tenfold cross-validations method was adopted. 700 groups of data were randomly divided into tenfolds, each fold contains 70 groups of data. Ninefold data were used to train the GA-BP network. The remaining onefold data is used for validation. Through ten different random groupings and exchanging the training and test data, we conducted ten times tenfold cross-validations. The statistical results of data for ten times tenfold cross-validations from test 1 to test 10 are presented in Tables 2 and 3.

The relative errors between estimated and test values for CEV and CPFR in ten times tenfold cross-validations are calculated and presented in Figures 7 and 8, respectively.

4. Discussion

There were 700 participants (430 males and 270 females) involved in this study. There were 106 smokers, accounting for about 15% of the total. The age, height, weight, CEV, and CPFR values were collected and analyzed. The average of CEV and CPFR were 1.54 L and 6.25 L/s, respectively. It is adapt to the statistical results of physical information of participants which presents a young state.

The ten times tenfold cross-validations method was adopted to verify the GA-BP method. From Tables 2 and 3, we found that the average and standard deviation values of age, height, weight, CEV, and CPFR values of training groups were approximate with that of validation groups in all ten times validations. It indicated that all the training and validations groups were selected randomly and the estimated results were reliable.

It is considered that the estimated value is acceptable and relatively accurate when the absolute error is within 10% in this study. From the ten times validation results in Figures 7 and 8, we find that the relative errors of a large number of test samples are within 10% for both CEV and CPFR values. The accuracy of ten times validations and the average values for CEV and CPFR estimations are calculated and presented in Table 4. From Table 4, we found that the accuracy of CEV and CPFR value estimations exceeded the 90% in all ten times validations. The averages of CEV and CPFR estimation accuracy were 95% and 94.57%, respectively. The results indicated that the GA-BP method has a high accuracy and could be effectively used for CEV values and CPFR values estimation.

Considering the low sampling frequency of the portable pulmonary function device, the PVT value estimation is not completed in this study. Even the estimation accuracy is high under current condition and more data especially in a large span of age, height, weight, and other physical information should be involved and used for improving the generalization ability.

5. Conclusions

The heights, weights, ages, smoking status, CEV, and CPFR values of 700 participants were collected and analyzed in this paper. The GA-BP method which integrates backpropagation neural network and genetic algorithm was developed to estimate the CEV and CPFR values. Additionally, the ten times tenfold cross-validation method was adopted to testify the GA-BP method. The results show that the estimation accuracy of GA-BP method used for CEV and CPFR values both exceeds 90%. The averages of CEV and CPFR estimation accuracy reached 95% and 94.57%, respectively. The estimation results verified the accuracy of the GA-BP method.

In the future study, the PVT value estimations will be completed. More information such as compliance and resistance will be measured and used in the GA-BP method to improve the generalization ability.

Data Availability

The datasets generated and analyzed during the current study are available in the Baidu cloud disk repository (https://pan.baidu.com/s/1NLTUx8lPO8l7LocGWpzJ1A&shfl=sharepset; code: 0qv1).

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research was funded by the National Natural Science Foundation of China (Grant 51575020) and China Postdoctoral Science Foundation (Grant 2019M660391). It was also funded by Open Foundation of the State Key Laboratory of Fluid Power and Mechatronic Systems (Grant nos. GZKF-201920 and GZKF-201804).