Classification of Electrocardiography Hybrid Convolutional Neural Network-Long Short Term Memory with Fully Connected Layer

Ramachandran, Dhanagopal; Kumar, R. Suresh; Alkhayyat, Ahmed; Malik, Rami Q.; Srinivasan, Prasanna; Priya, G. Guga; Gosu Adigo, Amsalu

doi:https://doi.org/10.1155/2022/6348424

Computational Intelligence and Neuroscience

On this page

Abstract Introduction Related Work Discussion Conclusions Data Availability Conflicts of Interest References Copyright Related Articles

Special Issue

Exploration of Human Cognition using Artificial Intelligence in Healthcare

View this Special Issue

Research Article | Open Access

Volume 2022 | Article ID 6348424 | https://doi.org/10.1155/2022/6348424

Classification of Electrocardiography Hybrid Convolutional Neural Network-Long Short Term Memory with Fully Connected Layer

Dhanagopal Ramachandran,¹R. Suresh Kumar,¹Ahmed Alkhayyat,²Rami Q. Malik,³Prasanna Srinivasan,⁴G. Guga Priya,⁵and Amsalu Gosu Adigo⁶

Academic Editor: Arpit Bhardwaj

Received14 Mar 2022

Accepted23 May 2022

Published11 Jul 2022

Abstract

Electrocardiography (ECG) is a technique for observing and recording the electrical activity of the human heart. The usage of an ECG signal is common among clinical professionals in the collection of time data for the examination of any rhythmic conditions associated with a subject. The investigation was carried out in order to computerize the assignment by exhibiting the issue using encoder-decoder techniques, creating the information that was simply typical of it, and utilising misfortune appropriation to anticipate standard or anomalous information. On a broad variety of applications such as voice recognition and prediction, the long short-term memory (LSTM) fully connected layer (FCL) and the two convolutional neural networks (CNNs) have shown superior performance over deep learning networks (DLNs). DNNs are suitable for making high points for a more divisible region and CNNs are suitable for reducing recurrence types, LSTMs are appropriate for temporary displays, in the same way as CNNs are appropriate for reducing recurrence types. The CNN, LSTM, and DNN algorithms are acceptable for viewing. The complementarity of DNNs, CNNs, and LSTMs was investigated in this research by bringing them all together under the single architectural company. The researchers got the ECG data from the MIT-BIH arrhythmia database as a result of the investigation. Our results demonstrate that the approach proposed may expressively describe ECG series and identify abnormalities via scores that outperform existing supervised and unsupervised methods in both the short term and long term. The LSTM network and FCL additionally demonstrated that the unbalanced datasets associated with the ECG beat detection problem could be consistently resolved and that they were not susceptible to the accuracy of ECG signals. It is recommended that cardiologists employ the unique technique to aid them in performing reliable and impartial interpretation of ECG data in telemedicine settings.

1. Introduction

Electrocardiography (ECG) provides a significant amount of information about cardiovascular health and architecture, and it is the principal tool for diagnosing cardiac illness [1]. Arrhythmia is a highly frequent cardiac ailment that is well researched and understood by specialists in the field. Throughout the course of clinical practice, mistakes in diagnosis and inaccurate outcomes may occur due to the gap in expertise between experts and the absence of a smooth flow of information [2]. Programmable detection of arrhythmias and traceable confirmation of occurrences are critical because specialists should be assisted in distinguishing between arrhythmic events before they are seen.

For the most part, the diagnosis of arrhythmia has focused on screamed impulses from the electrocardiogram (ECG), manual extraction of components [3, 4], and pulse segmentation [5, 6]. Given the ambiguous complexity of the real scientific dataset, it is necessary to do thorough administration in order to mitigate the possible consequences of a diagnostic prediction inaccuracy. A full EKG signal, as illustrated in Figure 1, has been seen to include the QRS complex and P wave (current set in the electrocardiogram) and sometimes the T wave. Figure 1 shows the normal ECG, with ranges of the wave characteristics, as well as the wave features themselves. As a result, medical information technology has been widely employed to evaluate EHR data and accurately define the condition, based on artificial intelligence assessments combined with realistic methods. The following study has expanded group estimates to include algorithmic characterizations of blood pressure, such as K-nearest neighbor (KNN), Naive Bayes, and decision trees (DTs) [7]. In addition, three types of SVM classifications were developed with the goal of predicting cardiac pathology [8]. In order to identify cardiovascular conditions based on the SVM arrangement of heart sounds, it is advised that an automated classification system be used [9]. Recently, neural models have shown exceptional efficiency in terms of anticipating details and resolving a variety of structural challenges. Health care is increasingly relying on approaches of the deep learning in order to uncover new knowledge and control disease, particularly in the fields of diabetes, coronary artery disease, and cerebrum infection, using biomedical data [10]. There have been a few therapeutic implementations of deep learning, which can be seen more clearly in the [11] section. There are a number of useful neural system-based models that are mostly focused on correctly classifying cardiac diseases [12]. Scientists are currently experimenting with the use of convolutional neural networks (CNNs) to distinguish between various ECG signal classifications and to separate ECG information into normal versus pathological structures [13]. The RNN may also be used to detect probable infections by employing unambiguous EHR patient representations and presenting transitional linkages within EHR data events. Recently, researchers have used gated recurrent units (GRUs) and long short-term memory (LSTM) modules to predict cardiovascular disease risk and transient vascular infection [14]. As cutting-edge deep learning advances, convolutional computations will be employed to complete a number of extraction tasks. In contrast to morphology, the technique is less difficult and the signal efficiency criteria are less strict [15]. Researchers were able to recognize and characterize premature ventricular limits and ventricular ectopic beats using the one-dimensional convolutional neural network (1D CNN) developed by Li et al. [16]. It was claimed by [17] that comparable 1D CNN grouping may considerably increase system efficacy by arranging for more divisions of coronary artery problems than had ever been offered before. Some concerns in the literature on ECG arrhythmias remain unresolved, including the lack of ECG signal details during highlight mining or commotion cleaning, as well as a poor description of the internal mixture technique.

This work presents a 2D grayscale picture model that is entered into an LSTM together with deep 2D CNNs as a result of issues encountered. It is possible to avoid the loss of many ECG data points by converting the ECG to a 2D picture with a 1D signal, although this is more complicated. Due to the sensitive nature of the material, most current evaluations rely on restricted information. Preprocessing one-dimensional ECG data can have a significant impact on the absolute accuracy of one-dimensional ECG signals; therefore, most investigations would be cautious. More information and finer details can be obtained by converting 1D ECG data into 2D ECG images [18]. When converting data, is it required to separate each beat into its own distinct entity? Noise data may be overlooked by the convolution layer of the model, resulting in a false positive if all of the signals are separated. Automatic processes like filtering and feature extraction are not required for 2D ECG images. Assuming that noise data are almost certainly ignored by the pooling and convolution layers in these configurations, they preserve strategic separation from the question of how noise and precision are related in the process of producing a feature map. Photographs are also used as details in certain similar illness studies by numerous doctors [19, 20] instead of 1D signals to better grasp the ailment. 2D ECG images are more comparable to the path travelled by a cardiologist in the course of his or her research and identification of symptoms via visual perception when it comes to the detection and classification of rhythmic disturbances. In equipment such as ECG monitors, difficulties such as sluggish sampling rates and vibration would arise if an ECG signal was used that was only one dimension. ECG tracking robots will be able to use two-dimensional ECG images more regularly, which will help cardiac doctors diagnose arrhythmic illnesses. It is getting increasingly challenging to use the information development methods that have been used in previous studies because of the current properties of the 1D ECG data. Use the ECG signal to increase the planning data available, which will aid in the improvement of layout correctness. In order to support the CNN 2D methodology, which trained a single ECG image from several angles, we used a variety of alternative trimming approaches to enlarge the 2D ECG image. It is possible to use a 2D CNN to modularly highlight an automated ECG extraction, which would fix the current hand-planned waveform inclusions that are not robust enough for distinguishing tolerable variances in heartbeats. The recurrent neural network (RNN) can be used to learn from previous experiences as an alternate deep learning mechanism for LSTM to the 2D CNN design. All cells in the LSTM’s input condition are conditional on the data’s status and time components, which provides a strategic buffer against the problem of long-term dependence. In spite of the fact that data have been explicitly removed from the system, LSTM cells can retain and manage useful information [21]. Classification accuracy is greatly improved by combining LSTM and 2D CNN.

Chest discomfort, suffering, and exhaustion, as well as an irregular heart rate and a slew of other symptoms, can all be traced back to cardiovascular disease. Heart disease can be diagnosed using a variety of factors. Age, gender, and other risk variables are taken into account. All of these risk factors, including alcohol intake, smoking, obesity, and a wide range of diseases such as asthma, are linked to each other in some way. There are a plethora of factors that make it challenging for doctors to accurately diagnose and assess heart illness. Traditional classification techniques such as support vector machines (SVMs), a priori algorithms, decision trees, and the hybrid random forest model [22, 23] have already been developed for classifying and analysing EHR data related to coronary disease expectancies. Using logistical recurrence and the Bayesian data definition, cardiovascular failure prediction has been proven and got an AUC score of 77% [24, 25]. Utilising logistic recurrence and a higher AUC score of 77%, this has been shown to be true using Bayesian data and the preferred methodology.

Convolutional neural networks (CNNs) and multilayer perceptrons (MLPs) were employed to review foetal pulse recordings with an 85% accuracy; a recurrent neural network (RNN) was also proposed in records with an 83% precision for the detection of abnormal heartbeat rhythms. Atrial fibrillation order was predicted using a long short-term memory organiser [26, 27], which achieved a 78% accuracy rate and a 79% F1 value in this job [28]. Computerized PCG signal analysis was also employed to identify the risk of programed auxiliary cardiac abnormalities (PACAs) in juvenile coronary heart disease diagnostics. To improve the accuracy of coronary disease applications, the BiLSTM estimate was taken into account when designing the bidirectional neural network architecture, which resulted in an increase in accuracy of 99.49% [29]. Biomedical researchers have a wide range of objectives when it comes to neural network performance, and the best results have been realised in clinical imaging using deep learning [30]. In order to improve automated clinical findings and suggest a strong morphological approach to real ECG chronicles, a generative adversarial network (GAN) was created. It has been used in conjunction with cardiovascular disease specialists who supported large absorptions of electronic clinical data by the LTSM model and ambulatory courses that were exposed by fictional substance with the BiLTSM [31, 32].

As new ideas like ensemble learning emerged to better the application of structures, established knowledge mining techniques saw their reach expand. In order to analyse and classify cardiac diseases based on their proximity and absence, the aim is to build a well-known ensemble learning model [33]. One can predict that the model’s accuracy will be superior to that of top-tier findings. Instead of creating a single classification, the power of ensemble learning was utilised by completing predictions from a number of various classifiers, and AdaBoost computations and the bagging tree were used to lower the risk of heart disease in a case study [34]. A neural network-based ensemble strategy was proposed in order to produce a highly powerful classification approach and to offer a promising accuracy structure [35]. The example of ensemble learning model is hypothesised that LSTM-CNN-based identification of cardiovascular breakdown.

Training set mismatches can affect how existing arrangement models are displayed, and one of these elements is the presentation of existing arrangement models when they are used to show actual information. Predicted classifiers remain focused on a single class and have not summarised the information gathered during training. The approaches are edited nearest neighbors (ENNs), Smote, and Tomek [36] should be utilized during model construction to update the information for greater relevance [37]. Using an EKG-based heartbeat order ensemble learning system setup, the well-characterized presentation of a stable multiclass grouping issue was established [38].

3. Methodology

The MIT-BIH arrhythmias database was utilised to collect the study’s data and observations, both of which can be accessed online. A total of 48 hours of data were collected from 48, 0.5-hour ECG signal reports from 47 different subjects using two conditions [7]. The R peak frequency of 360 hertz has been used to examine every single signal record. Unidentified cardiologists have offered their interpretations of these data in the form of anonymous comments. For the sake of data processing, electrocardiograms (ECGs) have been transformed into ECG images. Tests described in this research utilised lead II’s specific symptoms as a guide: “V” for premature ventricular constriction (PVC), “L” for the left branch square block (LBSB), “N” for the standard signal rhythm (SSR), “A” for atrial premature beat (APB), “R” for the right bundle branch block (RBBB), “/” for paced beat (PAB), “E” for premature ventricular constriction (PVC), and “!” for ventricular fibrillation (VFW). Here, the nodal leanings were restricted to ventricular flutter and few beats that were not recognized as rhythms were built on that foundation. Most ECG arrhythmia investigations have failed to take into account the low criticality of these types of beats. Figure 2 depicts the broad approaches.

3.1. Preprocessing

For each individual, the ECG signal lasts around 2–3 minutes on average. We separated the picture into tens of windows based on its growth. The morphology or range of the signal has no influence, and as a result, we do not clean or convert the signal in this case. Only the R-R signal [39] is isolated during the preprocessing stage. The approach adopted is incredibly required and effective even if the assumption of the signal is not made. In Figure 3, you can see that both signals have been upgraded to 188 with labels 1 and 0 for anomalous and unusual indications, respectively. A typical signal may have an average explanation; however, an abnormal signal may not have an explanation for such an extraordinary event [40]. This section contains an intriguing discussion of the application of ECG signal anomaly detection.

3.2. Enhanced Data

For each ailment type, incomplete data are collected due to an imbalance caused by a database that only contains the most common rhythm kinds. Data expansion can be used to generate a small amount of data in the class and reduce overfitting challenges to an acceptable level as a result of the uneven quantity of data in each data classification [41]. If the image is better, the data computation will be more efficient. Due to a loss in ECG signal training, the vast majority of earlier electrocardiogram rhythmic medications were unable to physically add information regarding expansion. In feed-forward neural networks (FFNNs) and support vector machines (SVMs), the goal of classifiers is to consider each ECG signal to have the same categorization meaning. Many studies employ the ECG signal separation approach to separate 1D ECG data into various segments in order to increase the number of data measurements while dealing with enormous volumes of information [42]. Figure 4 shows the various types of ECG signals that were obtained through the use of enhanced data (see text for explanation). Although the ECG data produced by the model in this work necessitate an image improvement strategy, the information computation should be created rather than the information enhancement technique. A 2D ECG image that has already been modified is merged with image processing to bring out the finer features. Data are collected in such a way as to focus on the ECG’s image change while retaining an unaltered approximation of the outcome as a result of the data collection. By improving the exam’s knowledge irregularity, this is made possible by enhancing the original functioning in a key way.

3.3. Fully Connected Layer [CNN-LSTM]

Deep learning is essential for both machine learning and pattern identification. Data-driven machine learning is a subset of deep learning. During this research, eight unique ECG signal patterns were identified and classified. A cross-learning approach is used to help students acquire more in-depth knowledge. The model includes all of the components, including CNN and LSTM. CNNs are better suited for geographical or private data, while LSTMs are better suited for time-series data. The LSTM layer 10 is the most commonly employed of the convolutional layers 1 through 9. Taking advantage of a completely linked layer, the process end improves its performance. Once the spatial aspect reference has been generated by using an appropriate convolutional layer, it may then be used to produce it. Such markings can be detected with the help of the LSTM layers that are created as a result of this process [43]. There is an LSTM and a CNN in the mix (none, 16, 16, 256). This is how the output looked until the model’s pooling stage. The information size of an LSTM layer changes when we apply the reshape technology to reshape the model’s components (256, 256). After breaking down the LSTM’s temporal properties, the model is able to distinguish ECG signals across the fully linked layer. Optimizing your pattern’s early stages is made easier by setting a streamlining agent and learning rate. It was in response to this that researchers created a 0.001 learning speed and a streamlining booster that are currently in use. Figure 5 shows the proposed network mode.

3.4. Architecture and Details

The core of the proposed system, which contains the 2D CNN, is composed of three convolution blocks and a stage size of one. In addition, this is the most challenging component of the proposed design. In order to complete each convolution, an exponential linear unit is used (ELU). This layer of batch normalisation has been incorporated into the system to ensure that activation costs are consistent across batches. Each convolution is made up of two 2D CNN layers and one overall batch layer. To obtain the convolution part of a convolution task, the superposition matrix is multiplied by both of the convolution functions. It does not matter what kind of convolution is used. Pool channels with maximum step sizes of two are used for light extraction after a 2D convolution on the feature map. Compositing this feature map was difficult due to a large portion of the more intricate area being removed and labelled as a separate feature map. The groundwork for the design is being done on a daily basis. As the model structure is optimized, the feature map size is gradually decreased. This is done in order to get the most out of the model structure in terms of learning rate. Finally, each feature map is shifted to the LSTM layer in order to obtain any temporal information that is accessible. Convergence and convolution result in the highlights being broken up into numerous pieces. The LSTM circular chain method is used to predict time series. LSTM is not precisely the same as a normal RNN in terms of performance as an alternative version of a single neural network. It focuses on certain cell states and makes use of gated units to do this. Data transferred across the network must be handled consistently and efficiently, which is why LSTM regularly consolidates these systems. For modules that are part of this process, a gradient is eliminated so that long-term reliance issues can be estimated. Following an LSTM layer, a fully connected softmax system with five output neurons is used, which is controlled by a feature vector that illustrates the picture via time-dependent features. A prediction of an arrhythmia is made using the outputs of these five classifications. They are all interconnected layers.

As an input to the network, the layer broadcasts logical vectors on one side and vectors on the other. The facts are geared. We use a log-mel with 40 dimensions in every frame of our work. We may be able to lessen the frequency variations of the input signal by first running the signal through numerous additional convolutional layers. The architecture of each CNN layer is described, and facts are geared towards it; we employ two convolutional layers, each of which contains 256 feature maps, for a total of eight CNN layers. A recurrent 9 × 9 time channel is used for the first convolutional layer, which is followed by a 4 × 3 channel for the second convolutional layer, with the channels being distributed throughout the whole frequency range of the signal. It will be necessary to use maximum pools that are not protected in order to pool, and repetitive pooling will simply occur [44]. First, a pooling depth of three was included into the design of the first layer, and no second layer pooling was performed.

As a result of the huge number of feature maps multiplied by frequency multiplied by time, the final layer of a CNN is massive. As shown in Figure 6, such lines have their feature size reduced by a linear layer until they reach the LSTM layer. At every step of the CNN layering process, the inclusion of these linear layers has been taken into consideration, as seen in [45]. During our experiments, we noticed that reducing the size of the linear layer outputs was important to obtaining 256. In order to display the signal in real time, we send the CNN output through LSTM layers if the frequency is exhibited. We employ two LSTM layers, the first with an 832-cell LSTM layer and the second with a dimensionality reduction of 512 projection layer units, in accordance with the proposed approach. Backpropagation through time (BPTT) can only begin once the LSTM has been unrolled a total of twenty times. As a result, DNNs demonstrate how data from the hypothetical frame may be used to make a more accurate forecast of the actual frame. Each letter of the alphabet has its own unique set of LSTMs, so the data used in the CNN include a mix of letters from both alphabets. There should never be more than five possible decoding CLDLN objects in the LSTM, so that certain attributes can be preserved by setting r = 0. Finally, we pass the LSTM output to a pair of fully coupled DNN layers following the frequency and worldly examples. According to [46], the higher layers can provide higher specifications, which can be separated much more efficiently at each level as demonstrated in [47]. These higher specs are illustrated in [46]. 1,024 hidden units can be found in each linked layer, making up the total number.

The ELU study was employed in this analysis because it reveals that ECG arrhythmia was the best grouped of the conditions studied. ELU is shown in the following equation:

A mean squared error (MSE) specified in (2) shall be used to reconstruct between the input signal and output signal .

When it comes to batch normalisation, equation (3) was computed.

Overfitting is a serious problem when developing a model for training purposes. A training model is overfitted in this case, and the regularization of the dropout is employed to avoid this from happening. At the same time, we offer correlations with models in which dropout regularization could not be applied at the same level as dropout regularization. As part of the dropout regularization process, a portion of each hub in a comparable layer is likely to be eliminated in order to lessen conditions between layers. When the neurons exit, the corresponding weight will be forbidden, resulting in a significant increase in the capacity of the model. The model beyond regularization incorporates all weights into the learning process during the training period, resulting in a significant increase in the connection between the layers and the inability to perform overfitting. Dropout regularization was tested on a model with a single completely connected layer, and it was discovered to be situated at the final totally connected layer. The dropout rate had been 0.5% at the time.

4. Result and Discussion

In this study, the intention of the unbalanced classification of ECG signal was applied in the CNN-LSTM arrangement in order to get the desired results. The ECG beat data were classified according to the LSTM model, and then we used FCL to construct a CNN-LSTM configuration for classification. The acceptable practicality of a completely linked layer for the kind of ECG unbalanced rhythms was immediately apparent. We were able to establish the validity of the LSTM network topology via the use of comparison and best in class methodologies. The suggested model was mostly based on the MIT-BIH arrhythmia database, which was the primary source. Following the specifications of the AAMI standards, all MIT-BIH beats are grouped into five primary categories. This is, however, not always a desired outcome. The kind of arrhythmia will be determined by the ECG beat and the precision with which the beat shapes are formed.

The results of the PTB diagnostic ECG database were utilised to evaluate the outcomes of the MIT-BIH arrhythmia technique. A PTB reference has been created by taking 185 subjects from each of these bad issues plus 25 subjects from 12 separate meetings and merging them together (2 from PTB and 10 from MIT-BIH). As a consequence, we look at 185 examples of good and embarrassing products that can be created right now but are not because the machine is not set up to do them. F1 scores are used to describe the results of our research since we take into account and evaluate their validity. Table 1 shows that the impacts on the standard class, including the F1 score, precision, and analysis, are both raised and broadened in every situation. As a result, the total number of ECG signals that have been labelled as normal but are actually problematic has fallen dramatically (from 16,550 to 16,575). It is clear from Table 1 that the F1 score for minorities has risen significantly. As of this writing, S has an accuracy percentage of 97.23%, while F has an accuracy rating of 96.42. Changes in other significant tactics employed in authoring quality networks are shown in Table 2. Using a dropout regularizer, we predicted that the latent vector would show more prominently while remaining as flat and stable as possible. Money is required to restructure the input signal in more significant ways. The data we have gathered show that regularization frequently enhances the model and generally improves the model’s accuracy, as we have shown. Table 3 shows the experimental results of the MIT-BIH dataset model.

Figure 6 depicts the accuracy and loss as a result of modifications to the training and test conditions. To ensure a steady condition throughout the training, the model was verified after about 35 epochs of training in this mode of operation. In order to estimate the model’s deal performance, the following performance metrics were used: accuracy, R-squared, root mean square error, and computation time. After exploratory testing, the CNN-LSTM combined with the FCL hybrid model achieved 99.43% accuracy, 0.884% R-squared, 0.18% RMSE, and a 20% reduction in calculation time, among other results. Figure 7 depicts the many forms of heart illnesses that have been categorized according to the suggested design. We may deduce the five forms of sickness from this chart, and we can also look at the length of the R-R gap between the peak and the valley.

Table 2 shows that our suggested CNN-LSTM and FCL algorithms perform exceptionally well. As in earlier investigations, we used deep learning to establish how ECG irregular beat information was to be categorized in the classification process. We used LSTM with FCL to classify ECG arrhythmias that were out of balance. The categorization of imbalanced ECG data is frequently employed, according to studies [21, 43]. When it comes to the most important difference, we employ FCL to adjust the loss function in order to focus on ECG beats that appear to be misclassified, hence increasing the accuracy of arrhythmia classification. In terms of recall, our CNN-LSTM with FCL achieves the highest results on the dataset. As a result, inappropriate results, such as aberrant ECG beats, are incorrectly ascribed to regular ECG beats, according to this hypothesis.

5. Conclusions

The initial analysis of cardiovascular infection is based on the study and differentiation of arrhythmic indications and symptoms. CNN-LSTM and FCL interplay was recommended in this study to enhance readiness while limiting the impact of an immense amount fundamental specific ECG beat information on the model training. However, the proposed architecture uses LSTM layers to move the variation to the outputs of DNN layers that have large effective feature illustration than the CNN layers, which diminish every spectral fluctuation in the input feature. For CNN-LSTM and FCL, the results show that they achieved 99.43% accuracies (F1 score), 96.27% precision (precision), 94.85% recall (recall), and 92.85% precision (precision). According to the results of the MIT-BIH arrhythmic test, the proposed design was appropriate and had a high intensity level. Cardiologists could use the method outlined here to help them make more accurate and unbiased diagnoses of ECGs in telemedicine situations. Finally, future evaluations will include additional types and beats with various degrees of difficulty. In addition, we propose to present exact rates of noise to ECG data in order to research the presence of CNN-LSTM by means of FCL pattern in order to investigate its appearance.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

O. S. Lih, V. Jahmunah, T. R. San et al., “Comprehensive electrocardiographic diagnosis based on deep learning,” Artificial Intelligence in Medicine, vol. 103, Article ID 101789, 2020.
View at: Publisher Site | Google Scholar
P. Dewinta and A. Menaldi, “Cognitive behavior therapy for generalized anxiety disorder: a case study of arrhythmia patient,” Human, vol. 9, no. 2, pp. 161–171, 2018.
View at: Publisher Site | Google Scholar
R. Willis, “Clinical approach to arrhythmias and intermittent collapse,” Guide to Canine and Feline Electrocardiography, vol. 189, 2018.
View at: Google Scholar
K. A. Alfarhan, M. Y. Mashor, A. R. Mohd Saad, H. A. Azeez, and M. M. Sabry, “Effects of the window size and feature extraction approach for arrhythmia classification,” Journal of Biomimetics, Biomaterials and Biomedical Engineering, Trans Tech Publications Ltd, vol. 30, pp. 1–11, 2017.
View at: Publisher Site | Google Scholar
Y. Hagiwara, H. Fujita, S. L. Oh et al., “Computer-aided diagnosis of atrial fibrillation based on ECG signals: a review,” Information Sciences, vol. 467, pp. 99–114, 2018.
View at: Publisher Site | Google Scholar
U. B. Baloglu, M. Talo, O. Yildirim, R. S. Tan, and U. R. Acharya, “Classification of myocardial infarction with multi-lead ECG signals and deep CNN,” Pattern Recognition Letters, vol. 122, pp. 23–30, 2019.
View at: Publisher Site | Google Scholar
S. Kaplan Berkaya, A. K. Uysal, E. Sora Gunal, S. Ergin, S. Gunal, and M. B. Gulmezoglu, “A survey on ECG analysis,” Biomedical Signal Processing and Control, vol. 43, pp. 216–235, 2018.
View at: Publisher Site | Google Scholar
I. Kavakiotis, O. Tsave, A. Salifoglou, N. Maglaveras, I. Vlahavas, and I. Chouvarda, “Machine learning and data mining methods in diabetes research,” Computational and Structural Biotechnology Journal, vol. 15, pp. 104–116, 2017.
View at: Publisher Site | Google Scholar
Ö Yildirim, “A novel wavelet sequence based on deep bidirectional LSTM network model for ECG signal classification,” Computers in Biology and Medicine, vol. 96, pp. 189–202, 2018.
View at: Publisher Site | Google Scholar
L. B. Marinho, N. d. M. M. Nascimento, J. W. M. Souza, M. V. Gurgel, P. P. Rebouças Filho, and V. H. C. de Albuquerque, “A novel electrocardiogram feature extraction approach for cardiac arrhythmia classification,” Future Generation Computer Systems, vol. 97, pp. 564–577, 2019.
View at: Publisher Site | Google Scholar
P. Pławiak, “Novel methodology of cardiac health recognition based on ECG signals and evolutionary-neural system,” Expert Systems with Applications, vol. 92, pp. 334–349, 2018.
View at: Google Scholar
Ö Yıldırım, P Pławiak, R. S Tan, and U. R Acharya, “Arrhythmia detection using deep convolutional neural network with long duration ECG signals,” Computers in Biology and Medicine, vol. 102, pp. 411–420, 2018.
View at: Publisher Site | Google Scholar
B. G. Arndt, J. W. Beasley, M. D. Watkinson et al., “Tethered to the EHR: primary care physician workload assessment using EHR event log data and time-motion observations,” The Annals of Family Medicine, vol. 15, no. 5, pp. 419–426, 2017.
View at: Publisher Site | Google Scholar
S. Xiao, J. Yan, M. Farajtabar, L. Song, X. Yang, and H Zha, “Oint modeling of event sequence and time series with attentional twin recurrent neural networks,” J, Article ID 08524, 2017.
View at: Google Scholar
W. Rawat and Z. Wang, “Deep convolutional neural networks for image classification: a comprehensive review,” Neural Computation, vol. 29, no. 9, pp. 2352–2449, 2017.
View at: Publisher Site | Google Scholar
D. Li, J. Zhang, Q. Zhang, and X. Wei, “Classification of ECG signals based on 1D convolution neural network,” in Proceedings of the 2017 IEEE 19th International Conference on e-Health Networking, Applications and Services (Healthcom), pp. 1–6, IEEE, Dalian, China, October 2017.
View at: Publisher Site | Google Scholar
A. J. Prakash and S. Ari, A System for Automatic Cardiac Arrhythmia Recognition Using Electrocardiogram Signal, Woodhead Publishing, Sawston, United Kingdom, pp. 891–911, 2019.
View at: Publisher Site
L. Wang, Y. Mu, J. Zhao, X. Wang, and H. Che, “IGRNet: a deep learning model for non-invasive, real-time diagnosis of prediabetes through electrocardiograms,” Sensors, vol. 20, no. 9, p. 2556, 2020.
View at: Publisher Site | Google Scholar
N. D. B. Bruce, C. Wloka, N. Frosst, S. Rahman, and J. K. Tsotsos, “On computational modeling of visual saliency: examining what’s right, and what’s left,” Vision Research, vol. 116, pp. 95–112, 2015.
View at: Publisher Site | Google Scholar
A. E. U. Cerna, Large Scale Electronic Health Record Data and Echocardiography Video Analysis for Mortality Risk Prediction, Doctoral dissertation, The University of New Mexico, New Mexico, 2019.
O. Yildirim, U. B. Baloglu, R.-S. Tan, E. J. Ciaccio, and U. R. Acharya, “A new approach for arrhythmia classification using deep coded features and LSTM networks,” Computer Methods and Programs in Biomedicine, vol. 176, pp. 121–133, 2019.
View at: Publisher Site | Google Scholar
M. Liu, M. Wang, J. Wang, and D. Li, “Comparison of random forest, support vector machine and back propagation neural network for electronic tongue data classification: application to the recognition of orange beverage and Chinese vinegar,” Sensors and Actuators B: Chemical, vol. 177, pp. 970–980, 2013.
View at: Publisher Site | Google Scholar
Z. Li, D. Zhou, L. Wan, J. Li, and W. Mou, “Heartbeat classification using deep residual convolutional neural network from 2-lead electrocardiogram,” Journal of Electrocardiology, vol. 58, pp. 105–112, 2020.
View at: Publisher Site | Google Scholar
R. Miotto, L. Li, B. A. Kidd, and J. T. Dudley, “Deep patient: an unsupervised representation to predict the future of patients from the electronic health records,” Scientific Reports, vol. 6, no. 1, pp. 1–10, 2016.
View at: Publisher Site | Google Scholar
A. Artetxe, A. Beristain, and M. Graña, “Predictive models for hospital readmission risk: a systematic review of methods,” Computer Methods and Programs in Biomedicine, vol. 164, pp. 49–64, 2018.
View at: Publisher Site | Google Scholar
C. Krittanawong, A. S. Bomback, U. Baber, S. Bangalore, F. H. Messerli, and W. H. Wilson Tang, “Future direction for using artificial intelligence to predict and manage hypertension,” Current Hypertension Reports, vol. 20, no. 9, p. 75, 2018.
View at: Publisher Site | Google Scholar
A. B. Rad, M. Zabihi, Z. Zhao, M. Gabbouj, A. K. Katsaggelos, and S Särkkä, “Automated polysomnography analysis for detection of non-apneic and non-hypopneic arousals using feature engineering and a bidirectional LSTM network,” 2019, https://arxiv.org/abs/1909.02971.
View at: Google Scholar
P. Bizopoulos and D Koutsouris, “Deep learning in cardiology,” IEEE reviews in biomedical engineering, vol. 12, pp. 168–193, 2018.
View at: Google Scholar
E. Kiperwasser and Y. Goldberg, “Simple and accurate dependency parsing using bidirectional LSTM feature representations,” Transactions of the Association for Computational Linguistics, vol. 4, pp. 313–327, 2016.
View at: Publisher Site | Google Scholar
F. Milletari, N. Navab, and S. A Ahmadi, “V-net: fully convolutional neural networks for volumetric medical image segmentation,” in Proceedings of the 2016 fourth international conference on 3D vision, pp. 565–571, IEEE, Stanford, CA, USA, October 2016.
View at: Publisher Site | Google Scholar
P. Liu, X. Qiu, and X. Huang, “Adversarial multi-task learning for text classification,” 2017, https://arxiv.org/abs/1704.05742.
View at: Google Scholar
S. N. Kasthurirathne, “The use of clinical, behavioral, and social determinants of health to improve identification of patients in need of advanced care for depression (Doctoral dissertation),” 2018, https://scholarworks.iupui.edu/handle/1805/17765.
View at: Google Scholar
K. Seetharam, S. Shrestha, and P. P. Sengupta, “Artificial intelligence in cardiovascular medicine,” Current Treatment Options in Cardiovascular Medicine, vol. 21, no. 6, p. 25, 2019.
View at: Publisher Site | Google Scholar
P. C. Austin, J. V. Tu, J. E. Ho, D. Levy, and D. S. Lee, “Using methods from the data-mining and machine-learning literature for disease classification and prediction: a case study examining classification of heart failure subtypes,” Journal of Clinical Epidemiology, vol. 66, no. 4, pp. 398–407, 2013.
View at: Publisher Site | Google Scholar
S. K. Pandey and R. R Janghel, “Automatic arrhythmia recognition from electrocardiogram signals using different feature methods with long short-term memory network model,” Signal, Image and Video Processing, vol. 14, no. 4, pp. 1–9, 2020.
View at: Publisher Site | Google Scholar
S. S. Azam, M. Raju, V. Pagidimarri, and V. C. Kasivajjala, “Cascadenet: an LSTM based deep learning model for automated ICD-10 coding,” in Proceedings of the Future of Information and Communication Conference, pp. 55–74, Springer, San Francisco, CA, USA, March 2019.
View at: Publisher Site | Google Scholar
A. FernáNdez, V. LóPez, M. Galar, M. J. Del Jesus, and F. Herrera, “Analysing the classification of imbalanced data-sets with multiple classes: binarization techniques and ad-hoc approaches,” Knowledge-Based Systems, vol. 42, pp. 97–110, 2013.
View at: Google Scholar
H. Ge, K. Sun, L. Sun, M. Zhao, and C. Wu, “A selective ensemble learning framework for ECG-based heartbeat classification with imbalanced data,” in Proceedings of the 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 2753–2755, IEEE, Madrid, Spain, December 2018.
View at: Publisher Site | Google Scholar
H. Ghasemzadeh, S. Ostadabbas, E. Guenterberg, and A. Pantelopoulos, “Wireless medical-embedded systems: a review of signal-processing techniques for classification,” IEEE Sensors Journal, vol. 13, no. 2, pp. 423–437, 2012.
View at: Google Scholar
J. M. Johnson and T. M. Khoshgoftaar, “Survey on deep learning with class imbalance,” Journal of Big Data, vol. 6, no. 1, p. 27, 2019.
View at: Publisher Site | Google Scholar
W. Liu, Z. Wang, X. Liu, N. Zeng, Y. Liu, and F. E. Alsaadi, “A survey of deep neural network architectures and their applications,” Neurocomputing, vol. 234, pp. 11–26, 2017.
View at: Publisher Site | Google Scholar
S. Pouyanfar, S. Sadiq, Y. Yan et al., “A survey on deep learning: algorithms, techniques, and applications,” ACM Computing Surveys, vol. 51, no. 5, pp. 1–36, 2018.
View at: Google Scholar
Z. Zheng, Z. Chen, F. Hu, J. Zhu, Q. Tang, and Y. Liang, “An automatic diagnosis of arrhythmias using a combination of CNN and LSTM technology,” Electronics, vol. 9, no. 1, p. 121, 2020.
View at: Publisher Site | Google Scholar
T. N. Sainath and B. Li, “Modeling time-frequency patterns with LSTM vs,” in Proceedings of the Interspeech 2016, India, September 2016.
View at: Google Scholar
S Han, H Mao, and W. J Dally, “Deep compression: compressing deep neural networks with pruning, trained quantization and huffman coding,” 2015, https://arxiv.org/abs/1510.00149.
View at: Google Scholar
T. N. Sainath, O. Vinyals, A. Senior, and H Sak, “Convolutional, long short-term memory, fully connected deep neural networks,” in Proceedings of the 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4580–4584, IEEE, South Brisbane, QLD, Australia, April 2015.
View at: Publisher Site | Google Scholar
S. S. Liew, M. Khalil-Hani, and R. Bakhteri, “Bounded activation functions for enhanced training stability of deep neural networks on visual pattern recognition problems,” Neurocomputing, vol. 216, pp. 718–734, 2016.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2022 Dhanagopal Ramachandran et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

674

Downloads

912

Citations

Computational Intelligence and Neuroscience

Exploration of Human Cognition using Artificial Intelligence in Healthcare

Classification of Electrocardiography Hybrid Convolutional Neural Network-Long Short Term Memory with Fully Connected Layer

Abstract

1. Introduction

2. Related Work

3. Methodology

3.1. Preprocessing

3.2. Enhanced Data

3.3. Fully Connected Layer [CNN-LSTM]

3.4. Architecture and Details

4. Result and Discussion

5. Conclusions

Data Availability

Conflicts of Interest

References

Copyright