Assessment of Electrocardiogram Rhythms by GoogLeNet Deep Neural Network Architecture

Kim, Jeong-Hwan; Seo, Seung-Yeon; Song, Chul-Gyu; Kim, Kyeong-Seop

doi:https://doi.org/10.1155/2019/2826901

Journal of Healthcare Engineering

On this page

Abstract Introduction Results Conclusions Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2019 | Article ID 2826901 | https://doi.org/10.1155/2019/2826901

Assessment of Electrocardiogram Rhythms by GoogLeNet Deep Neural Network Architecture

Jeong-Hwan Kim,¹Seung-Yeon Seo,¹Chul-Gyu Song,²and Kyeong-Seop Kim¹

Academic Editor: Chandan Karmakar

Received30 Aug 2018

Revised15 Feb 2019

Accepted21 Mar 2019

Published28 Apr 2019

Abstract

The aim of this study is to design GoogLeNet deep neural network architecture by expanding the kernel size of the inception layer and combining the convolution layers to classify the electrocardiogram (ECG) beats into a normal sinus rhythm, premature ventricular contraction, atrial premature contraction, and right/left bundle branch block arrhythmia. Based on testing MIT-BIH arrhythmia benchmark databases, the scope of training/test ECG data was configured by covering at least three and seven R-peak features, and the proposed extended-GoogLeNet architecture can classify five distinct heartbeats; normal sinus rhythm (NSR), premature ventricular contraction (PVC), atrial premature contraction (APC), right bundle branch block (RBBB), and left bundle brunch block(LBBB), with an accuracy of 95.94%, an error rate of 4.06%, a maximum sensitivity of 96.9%, and a maximum positive predictive value of 95.7% for judging a normal or an abnormal beat with considering three ECG segments; an accuracy of 98.31%, a sensitivity of 88.75%, a specificity of 99.4%, and a positive predictive value of 94.4% for classifying APC from NSR, PVC, APC beats, whereas the error rate for misclassifying APC beat was relative low at 6.32%, compared with previous research efforts.

1. Introduction

Heart disease progresses because of insufficient blood supply to the heart when coronary artery disease develops or arrhythmias are severe and long-lasting [1]. According to statistics from the Centers for Disease Control and Prevention (CDC), heart disease is the world’s leading cause of death. Consequently, early diagnosis of cardiovascular disease is important for reducing the devastating impact and increasing quality of life. The primary screening tool for heart disease, the electrocardiogram (ECG), provides diagnostic features that determine the existence of irregular heartbeats by measuring and recording the electrical activity of the heart [2].

Common heart-monitoring devices for cardiac arrhythmias include Holter equipment, which continuously collects 24 hours ECG data [3, 4]. Various ECG arrhythmia classification algorithms have been developed [5–8]. Artificial neural network models with backpropagation algorithms have also been proposed to classify ECG data into normal and abnormal patterns [9–11] by training and testing the PhysioNet MIT-BIH Arrhythmia benchmark database [12].

Recently, deep learning models trained on image data have been applied to interpret ECG data for automatic classification of arrhythmias [13–15]. Deep learning is a subfield of machine learning, and it aims to learn features from three or more hierarchical layers to solve the complex tasks that were difficult for shallow neural network models [16, 17].

Concerning patient-specific ECG heartbeat classification via a deep learning approach, Kiranyaz et al. [18] implemented a 1-dimensional convolutional neural network (CNN) classifier with training and testing MIT-BIH arrhythmia records acquired from 44 patients. This proposed CNN architecture achieved a classification performance detecting ventricular ectopic beats (VEBs) and supraventricular ectopic beats (SVEBs) with an accuracy of 99% and 97.6%, respectively. Zhang et al. [19] claimed a higher accuracy of detecting VEB and SVEB beats (99.7% and 99.3%, respectively) by proposing a patient-specific ECG classifier based on recurrent neural networks and a clustering technique. However, deep learning models for classifying ECG beats might encounter difficulties in overtraining caused by quasi-periodic behaviour of ECG data. Therefore, it is necessary to exploit the number of samples contained in an ECG segment for representing input variables to avoid overtraining. Thus, we first propose a way of determining the optimal number of ECG segments used to encode the input variables. Then, we build a modified deep neural network based on GoogLeNet deep learning architecture [20, 21] with modifying inception modules to classify ECG beats into premature ventricular contraction (PVC), atrial premature contraction (APC), right bundle branch block (RBBB), and left bundle brunch block (LBBB). For the experimental tests for ECG classification, the annotated information and raw data of the MIT-BIH database from 44 patients were evaluated by our proposed GoogLeNet deep neural architecture for expanding the kernel size of the inception structure.

2. Representations of ECG Segments

2.1. Determination of ECG Intervals for Encoding Input Data

To determine the number of ECG samples for supplying input data, we define R-peak features of ECG data as follows: R_n: reference beat in which the R-peak occurs in the training and test dataset R_n−k: time position in which the ()^thR-peak occurs with respect to the reference beat R_n+k: time position in which the ()^thR-peak occurs with respect to the reference beat R_n−k·R_n+k: time interval between the time locations of R_n−k-peak and R_n+k-peak.

In our study, the range of training/test data, [, ], ECG interval was configured to include R_n-reference peaks by covering at least three and seven R-peaks prior and posterior to R_n-peak by forming

Because the number of samples contained in the ECG interval was different depending on a patient’s type of arrhythmia, a normalization process was necessary to unify the number of samples for training and testing input data. Kiranyaz et al. [18] and Zhang et al. [19] defined an ECG segment for the input data by including three R–R intervals. In our study, the test/training ECG data were acquired by performing the normalization process with multiplying the number of (R–R intervals-1) by 100 samples to avoid an aliasing problem. Figure 1 shows an example of defining the data interval with four ECG segments and the number of normalized samples.

2.2. Training and Testing Set

To classify the ECG rhythm using our proposed deep learning architecture, the rhythm of arrhythmia was classified into normal sinus rhythm (NSR), APC, PVC, RBBB, and LBBB as input data for training and testing [22–24]. In the considered segment of the MIT-BIH arrhythmia database, various arrhythmias are present, and the classification is determined with a reference beat.

All 48 patients from the MIT-BIH arrhythmia data were randomly selected, excluding four with pacemakers. Table 1 shows the total number of rhythms from the MIT-BIH arrhythmia data and the number of data used for training and testing the GoogLeNet deep learning model.

3. GoogLeNet Deep Learning Structure

The existing depth learning model can achieve high accuracy by deepening the layers to increase the performance of the neural network. A major drawback of this model is that the computational complexity increases exponentially as the layer becomes deeper. Google introduced the inception structure at 2014 ImageNet Large-Scale Visual Recognition Challenge (ILSVRC14), being the best-performing model and is called GoogLeNet [20]. At the core of this structure, the inner layer of the neural network was extended to output various correlation distributions based on the idea that the neural network output of each layer has optimal efficiency if various probability distributions with high correlations with the input data are obtained. In the basic inception v1 module, where input data are fed into four independent layers (1 × 1, 3 × 3, 5 × 5 convolution layers and 3 × 3 max-pooling layer), the outputs are combined into a single data set. Inside, the convolutional layers derive various spatial information of the input data, and the maximum pooling layer plays the role of extracting distinct features by reducing the channel and size of the input data. Therefore, the inception module is a method of extracting more information into a smaller layer by widening the layer of the neuron network, which is only composed of the existing depth.

3.1. Designing GoogLeNet Deep Learning Model for Arrhythmia Detection

Currently, the inception structure has been updated to v4. The shape of v1 is slightly extended. In this research, we use the v1 model as a basis to construct three CNN layers, an activation layer, and a maximum pooling layer. Figure 2(a) shows a Design I model using a single inception module, consisting of a complete connection layer and an output layer, and Figure 2(b) represents a Design II model using two inception modules. In Figure 2(c), Szegedy et al. [20] claimed that the use of the incoherence structure was efficient after using the convolution layer. Table 2 summarizes the composite specifications of the constructed incessant model and the detailed parameters for the pooling layer.

(a)

(b)

(c)

3.2. Optimization Parameters of the GoogLeNet Deep Learning Model

The proposed deep learning model was implemented by MATLAB codes using a desktop PC that comprised AMD FX-8350 CPU and 12 GB memory for matrix computational loads under the Windows 10 operating system. As the number of convolution filters inside the inception increases, the number of filters on the second floor increases by fixing the number of filters on the first floor of the reception to 15, when only the inception layer is used to confirm the change in the arrhythmia classification accuracy. We also evaluated the inception model using the convolution layer together. Even if the number of filters in an inception layer increases, as shown in Figure 3, the accuracy is not significantly enhanced. In fact, the accuracy of the model combined with the convolution layer, and inception layer is reduced by the difference in precision because of the ECG interval representation.

(a)

(b)

(c)

To ascertain the influence of the ECG segment on classification accuracy of the arrhythmia in our constructed inception model, the classification accuracy with the highest number of filters on each floor is summarized in Table 3. The combination model of the inception layer with the convolution layer reveals no clear difference from the model of the first floor of the reception, and the difference in accuracy of the ECG segment section decreased from 2.2% to 0.8%. The input data achieved by the inception layer was higher by about 1% in 2-layer inception models. The highest accuracy was achieved with three ECG segments.

3.3. Expansion of Kernel Size in the GoogLeNet Deep Neural Network Model

The conception filter of the basic inception module seems to be suitable for extracting the feature information of the normalized ECG with a kernel size of 1, 3, and 5, but may not be suitable for deriving information between R–R intervals. Therefore, by increasing the kernel size to 10, 50, and 100, time information included in the ECG such as the R–R interval of the ECG signal could be obtained. To apply this inception model, it can only be used in the first layer directly computed with ECG input data. The arrhythmia classification accuracy is summarized in Figure 4 and Table 4, showing the increasing number of internal convolution filters by 2 steps in the range from 1 to 19 of the inception layer. Thus, the number of filters increases, and the accuracy rises little by little, but the accuracy gets converged to a constant value from nine layers, and the maximum accuracy is achieved when it is comprised of three segments of ECG signals.

4. Results

4.1. Evaluation of Arrhythmia of Whole ECG Data

4.1.1. Performance of Basic Inception Module

The highest classification performance was achieved by considering three ECG segments. Figure 5 shows the changes in accuracy and errors, and the arrhythmia classification index of the ECG signal is listed in Table 5. APC has the highest error rate of 13.99% of the percentage, and it costs a half portion of the total error at the rate judged to be an error of each arrhythmia rhythm. Therefore, we need to focus on reducing errors in APC.

(a)

(b)

4.1.2. Expanded Inception Module

The highest accuracy parameter is obtained in the model in which the kernel size was expanded to the nine inception modules, and the number of input data and the number of filters in the ECG three segments are nine. When evaluating the ECG arrhythmia as shown in Figure 6, a change in accuracy and error appears as listed in Table 6, which shows the arrhythmia classification index. From the results of this model, the overall arrhythmia error increased; however, the error rate of APC decreased to 6.78%. Therefore, when using only the first filter on the first floor of inception and nine filters, we can be sure that it effectively responds to APC detection.

(a)

(b)

4.2. Evaluation of Patient-Specific Arrhythmia

The ECG signal can vary depending on the patient. The waveform of the cardiovascular symptoms such as the rhythm changes in shape depending on individual differences and the applied measurement device or method. In addition, when analyzing the rhythm of the MIT-BIH arrhythmia data, although two specialists aided in the evaluation, there were cases where the opinions were classified into rhythms that differ from each other. This is the reason why the type of arrhythmia differed for a given pattern. As a result of evaluating the MIT-BIH arrhythmia data with a deep learning model with five arrhythmia rhythms, the arrhythmia classification accuracy did not exceed 97%.

Therefore, we applied the deep learning model, which the evaluation rather presented the important rhythm of the individual custom heart, and classified the input data into the normal and the abnormal rhythm. Given that MIT-BIH 109, 111, 118, 124, 207, 214, and 232 NSR rhythm does not exist and only LBBB and RBBB rhythm exists, LBBB and RBBB can be regarded as normal rhythms. Furthermore, MIT-BIH arrhythmia data are about 2000 pieces for each number. The normal rhythms occupy a considerable part, the basic rhythms are about 20% of the verification data, and the abnormal rhythm is the number in total. Approximately 100 rhythms were sorted out, and the results were derived.

In our experimental simulations, the performance of accuracy, sensitivity, specificity, and positive predictive value (PPV) was evaluated, while varying the size of input data in our deep learning model during the training and test stage true positives (TPs) and true negatives (TNs) were used as metrics to detect abnormal heartbeats. TP refers to the judgements of arrhythmia rhythm, and TN defines the case of detecting NSR beats. Additionally, false positive (FP) refers to the decision of abnormal heart beat on NSR, and false negative (FN) defines the case of classifying NSR beats on the irregular heartbeats.

Tables 7 and 8 list the accuracy, sensitivity, specificity, and PPV of all the considered data. Figure 7 shows the comparison of results.

(a)

(b)

(c)

(d)

With regard to the accuracy and specificity with the four indicators, given that the NSR rhythms have more beats than the abnormal waveform in MIT-BIH database, a higher accuracy for the classification of NSR beats was obtained for all cases. Therefore, sensitivity and positive predictive value in the NSR classification must be judged more thoroughly. The sensitivity obtained by using an inception model with convolution filter sizes of [1, 3, 5] was between 96.2 and 96.9% accuracy, which is slightly higher than the expanded inception model. In contrast, in terms of positive prediction, the inception model with convolutional kernel size of [10, 50, 100] was somewhat higher with an accuracy ranging between 91.4 and 95.7%.

5. Conclusions

In this study, we explored the influence of detailed parameters by presenting a model suitable for the evaluation of ECG rhythm via various deep learning models. To accomplish this objective, the MIT-BIH ECG arrhythmia database was used to evaluate arrhythmia classification with varying of the inception structure to classify LBBB, RBBB, PVC, and APC rhythm. Based on Figure 4 and Table 4, the number of filters in the inception module should be at least 5 to detect arrhythmia beat. For the comparison with the previous state-of-the-art concerning the classification of heartbeats, we illustrated Table 9 to show the misclassification error rate of classifying NSR, APC, and PVC beats with the additional specifying accuracy, sensitivity, specificity, and positive predictive value in Table 10.

For the case of extended inception deep learning model used in our research, the misclassification error rate of APC was 7.2% for classifying NSR, LBBB, RBBB, APC, and PVC beats, whereas the error rate was 6.32% for classifying only NSR, APC, and PVC beats, which is relatively low compared with the previous research studies. Thus, we can conclude that the extension of the inception deep learning model can detect five distinct ECG rhythms with the highest accuracy of classification for the detection of APC beats.

Data Availability

We tested PhysioNet MIT-BIH arrhythmia benchmark databases which are open public data available at http://www.physionet.org.

Conflicts of Interest

The authors declare that there are no conflicts of interests regarding the publication of this article.

Acknowledgments

This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (No. 2018R1A6A3A01011941).

References

Cardiac Arrest Infographic: An Important Public Health Issue, https://www.cdc.gov/dhdsp/docs/cardiac-arrest-infographic.pdf.
Z. Abedin, ECG Interpretation: The Self-Assessment Approach, Blackwell Future, Hoboken, NJ, USA, 2008.
J. Adamec and R. Adamec, ECG Holter: Guide to Electrocardiographic Interpretation, Springer, Berlin, Germany, 2018.
R. Xiao, Y. Xu, M. M. Pelter, D. W. Mortara, and X. Hu, “A deep learning approach to examine ischemic ST changes in ambulatory ECG recordings,” AMIA Summits on Translational Science Proceedings, vol. 2017, pp. 256–262, 2018.
View at: Google Scholar
G. K. Malik, Y. Kumar, and M. Panda, “Cardiac arrhythmia detection in ECG signals by feature extraction and support vector machine,” in Proceedings of the Second International Conference on Research in Intelligent and Computing in Engineering, ACSIS, vol. 10, pp. 241–244, Gospeshwar, Uttrakhand, India, June 2017.
View at: Google Scholar
E. J. d. S. Luz, W. R. Schwartz, G. Cámara-Chávez, and D. Menotti, “ECG-based heartbeat classification for arrhythmia detection: a survey,” Computer Methods and Programs in Biomedicine, vol. 127, pp. 144–164, 2016.
View at: Publisher Site | Google Scholar
D. Ge, N. Srinivasan, and S. M. Krishnan, “Cardiac arrhythmia classification using autoregressive modeling,” Biomedical Engineering Online, vol. 1, no. 5, pp. 1–12, 2002.
View at: Publisher Site | Google Scholar
J. A. Gutiérrez-Gnecchi, R. Morfin-Magaña, D. Lorias-Espinoza et al., “DSP-based arrhythmia classification using wavelet transform and probabilistic neural network,” Biomedical Signal Processing and Control, vol. 32, pp. 44–56, 2017.
View at: Publisher Site | Google Scholar
I. Saini and B. S. Saini, “Cardiac arrhythmia classification using error back propagation method,” International Journal of Computer Theory and Engineering, vol. 14, no. 3, pp. 462–464, 2012.
View at: Publisher Site | Google Scholar
L. V. R. Kumari, Y. Padma Sai, N. Balaji, and R. Gowrisree, “Comparison of artificial neural networks for cardiac arrhythmia classification,” International Journal of Advance Engineering and Research Development, vol. 4, no. 10, pp. 800–805, 2017.
View at: Publisher Site | Google Scholar
M. Mitra and R. K. Samanta, “Cardiac arrhythmia classification using neural networks with selected features,” Procedia Technology, vol. 10, pp. 76–84, 2013.
View at: Publisher Site | Google Scholar
G. B. Moody and R. G. Mark, “The impact of the MIT-BIH arrhythmia database,” IEEE Engineering in Medicine and Biology, vol. 20, no. 3, pp. 45–50, 2001.
View at: Publisher Site | Google Scholar
B. Pyakillya, N. Kazachenko, and N. Mikhailovsky, “Deep learning for ECG classification,” Journal of Physics: Conference Series, vol. 913, pp. 1–5, 2017.
View at: Publisher Site | Google Scholar
A. Isin and S. Ozdalili, “Cardiac arrhythmia detection using deep learning,” Procedia Computer Science, vol. 120, pp. 268–275, 2017.
View at: Publisher Site | Google Scholar
M. M. A. Rahhal, Y. Bazi, H. AlHichri, N. Alajlan, F. Melgani, and R. R. Yager, “Deep learning approach for active classification of electrocardiogram signals,” Information Sciences, vol. 345, pp. 340–354, 2016.
View at: Publisher Site | Google Scholar
H. Mhaskar, Q. Liao, and T. Poggio, “When and why are deep networks better than shallow ones?” in Proceedings of the Thirty-first AAAI Conference on Artificial Intelligence (AAAI-17), pp. 2343–2349, San Francisco, CA, USA, February 2017.
View at: Google Scholar
T. Reasat and C. Shahnaz, “Detection of inferior myocardial infarction using shallow convolutional neural networks,” in Proceedings of the IEEE Region 10 Humanitarian Technology Conference (R10-HTC), pp. 718–721, Dhaka, India, December 2017.
View at: Google Scholar
S. Kiranyaz, T. Ince, and M. Gabbouj, “Real-time patient-specific ECG classification by 1-D convolutional neural networks,” IEEE Transactions on Biomedical Engineering, vol. 63, no. 3, pp. 664–675, 2016.
View at: Publisher Site | Google Scholar
C. Zhang, G. Wang, J. Zhao, P. Gao, J. Lin, and H. Yang, “Patient-specific ECG classification based on recurrent neural networks and clustering technique,” in Proceedings of the LASTED International Conference, pp. 20-21, Innsbruck, Austria, February 2017.
View at: Google Scholar
C. Szegedy, W. Liu, Y. Jia et al., “Going deeper with convolutions,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–9, Las Vegas, NV, USA, June 2015.
View at: Google Scholar
C. Szegedy, S. Ioffe, V. T. Vanhoucke, and A. A. Alem, “Inception-v4, inception-resnet and the impact of residual connections on learning,” in Proceedings of the Thirty-First AAI Conference on Artificial Intelligence (AAAI-17), pp. 4278–4284, San Francisco, CA, USA, February 2017.
View at: Google Scholar
L. He, W. Hou, X. Zhen, and C. Peng, “Recognition of ECG patterns using artificial neural network,” in Proceedings of the International Conference on Intelligent Systems Design and Applications, Jinan, China, October 2006.
View at: Google Scholar
J. A. Gutiérrez-Gnecchi, R. M. Magana, D. L. Espinoza et al., “DSP-based arrhythmia classification using wavelet transform and probabilistic neural network,” Biomedical Signal Processing and Control, vol. 32, pp. 44–56, 2017.
View at: Publisher Site | Google Scholar
M. D. Ingole, S. V. Alaspure, and D. T. Ingole, “Electrocardiogram (ECG) signals feature extraction and classification using various signal analysis techniques,” International Journal of Engineering Sciences & Research Technology, vol. 3, no. 1, pp. 39–44, 2014.
View at: Google Scholar
K. Luo, J. Li, Z. Wang, and A. Cuschieri, “Patient-specific deep architectural model for ECG classification,” Journal of Healthcare Engineering, vol. 2017, Article ID 4108720, 13 pages, 2017.
View at: Publisher Site | Google Scholar
T. Ince, S. Kiranyaz, and M. Gabbouj, “A generic and robust system for automated patient-specific classification of ECG signals,” IEEE Transactions on Biomedical Engineering, vol. 56, no. 5, pp. 1415–1426, 2009.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2019 Jeong-Hwan Kim et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

3462

Downloads

1633

Citations