Applied Mathematics in Biomedical Sciences and EngineeringView this Special Issue
Research Article | Open Access
Jongshill Lee, Youngjoon Chee, Inyoung Kim, "Personal Identification Based on Vectorcardiogram Derived from Limb Leads Electrocardiogram", Journal of Applied Mathematics, vol. 2012, Article ID 904905, 12 pages, 2012. https://doi.org/10.1155/2012/904905
Personal Identification Based on Vectorcardiogram Derived from Limb Leads Electrocardiogram
We propose a new method for personal identification using the derived vectorcardiogram (dVCG), which is derived from the limb leads electrocardiogram (ECG). The dVCG was calculated from the standard limb leads ECG using the precalculated inverse transform matrix. Twenty-one features were extracted from the dVCG, and some or all of these 21 features were used in support vector machine (SVM) learning and in tests. The classification accuracy was 99.53%, which is similar to the previous dVCG analysis using the standard 12-lead ECG. Our experimental results show that it is possible to identify a person by features extracted from a dVCG derived from limb leads only. Hence, only three electrodes have to be attached to the person to be identified, which can reduce the effort required to connect electrodes and calculate the dVCG.
Human identification has potential applications in many different areas where the identity of a person needs to be determined, and to obtain even higher security levels, more complex system are required. Specific features of human beings need to be selected to recognize a person. Much work has been carried out on human face identification [1, 2]. These methods require a high-resolution computer vision system to collect facial features, which are generally anthropometric face structures. Other methods used in this area include: voice recognition  and palm recognition , with the most common being finger print identification. The human eye also contains specific features in both the retina and the iris that may be used for recognition .
Although most of these identification methods have gained wide acceptance, one of the problems with them is the fact that a specific biometric belonging to a certain person can still be used, even if the owner of the biometric is not present or has died. Therefore, many biometric hardware systems include a liveness testing measure. This can be obtained by measuring the body temperature, moisture, oxygen level, reflection or absorbance of light or other radiation, or the presence of a natural spontaneous signal such as a pulse, the contraction of a pupil in response to light, and muscular contraction in response to an electrical stimulus. In most cases, such liveness testing is difficult to measure , and still it is needed to develop the reliable and efficient method to test the “liveness” of an applicant’s biometric.
The electrocardiogram (ECG) signal is an alternative inherent liveness biometric because of the significant fact that an ECG signal does not exist if the subject is not alive. Recently, the feasibility of using ECG as a new biometric measure for personal identification has been explored. Biel et al.  explained that automatic human identification can be achieved by analyzing the 30 features extracted from a standard 12-lead ECG. Shen et al.  investigated the feasibility of ECG as a biometric by applying template matching and a decision-based neural network to the seven features extracted from a single-lead ECG. Kyoso and Uchiyama  developed a human identification engine based on the four feature parameters of a sampled ECG data sequence on a beat-to-beat basis. Israel et al.  proposed a set of ECG descriptors that characterize the trace of a heartbeat to identify a person. 15 features have been selected from each heartbeat.
All of these researchers used time intervals (e.g., P wave duration, PQ interval, QRS interval, QT interval, and so on) or amplitude as important features in their studies. These features from the time domain have some limitations as the temporal features of interval and amplitude can vary depending on variables such as the time of day of the measurement or the physical condition of the subject. Noise and positioning of the electrode can also decrease the accuracy. In contrast, the spatial features of the cardiac electric vector, represented by the vectorcardiogram (VCG), are not affected by the variables mentioned above. It is also expected that the vectorcardiographic loops will differ in shape and orientation from person to person. It is possible to identify a person by features extracted from a VCG. We have investigated the feasibility of the VCG, which is derived from a standard 12-lead ECG, as a new biometric for personal identification in our previous study , and the experimental results have shown that it is applicable to identify a person. The drawback of this approach is the considerable effort required to connect many electrodes to the person, including six leads to the chest, which is inconvenient in a real environment.
In this work, we investigated a novel approach for identifying a person using the dVCG that was derived from limb leads only. For limb-lead recording, only three electrodes are attached to the wrist and ankle, which is much easier than 12-lead recording. By comparing the performances from limb-lead recordings and from 12-lead recordings, we analyzed the feasibility of using VCG from limb-lead recording for personal identification. First, we derived a VCG from a limb-lead ECG and extracted features from the derived VCG. To remove some redundant features and to analyze the effect of each feature, we performed feature selection using the Relief-F algorithm. Finally, we performed personal identification using a support vector machine (SVM).
2. Materials and Methods
VCG is a graphic representation of the magnitude and orientation of the heart’s electrical activity during a cardiac cycle in the form of a vector loop. In contrast to ECG, which represents the electrical potential in any one single axis, VCG displays the same heart’s electrical events along two or three perpendicular axes. VCG provides a vectorial representation of the distribution of electrical potentials generated by the heart and produces loop-type patterns (Figure 1). The magnitude and orientation of the P, QRS, and vector loops are determined according to an individual heart’s characteristics. Because of the high amplitude associated with QRS, loops from the QRS complex predominate.
The electrode positions of leads for the traditional VCG are different from those of a 12-lead ECG and must first be deduced by the recording technicians. Therefore, the method for calculating VCG from a conventional 12-lead ECG is more appealing [12, 15].
2.2. Derived VCG
From a standard 12-lead ECG, the derived VCG can be easily calculated using a method based on inverse Dower matrix  as shown in (2.1). Each of the orthogonal leads, , , and used to plot the VCG are linear combinations of the eight independent leads (I, II, and –) of a standard 12-lead ECG
To derive the limb ECG from vectorcardiographic leads, Dower et al. described a method using a transform matrix where each lead (I, II, and III) in the ECG was expressed as a linear function of the leads , , and [17, 18]. The transformation matrix for the limb leads (I, II and III) is shown in (2.2).
The transformation between the vectorcardiographic and limb-lead systems is a simple matrix operation: where is the ECG signal, is the VCG signal, and is the transformation matrix.
To calculate a VCG signal from a limb-lead system, we need the inverse of , but there is no inverse matrix because is singular (). Therefore, we use the pseudo inverse (or Moore-Penrose inverse) . The pseudoinverse of can be determined by the singular value decomposition (). Because matrix has rank 2, has two positive singular values (, ) along the main diagonal extending from the upper left-hand corner, and the remaining components of are zero. Then, (the pseudo inverse matrix of )=== since and because of their orthogonality. The matrix takes the following form:
Therefore, the pseudo inverse matrix of is shown as follows: Finally, we calculated the dVCG from the limb-lead ECG using
The pseudoinverse of () is an approximation matrix because has deficient rank. Therefore, the dVCG derived from the limb leads has different patterns than the dVCG from the standard 12-lead ECG.
Because the three-dimensional space (3D) and the frontal (XY) plane of the dVCG provide useful information, such as shape and direction , as shown in Figure 2, we used the dVCG in 3D space and the frontal plane. In the frontal plane, the large vector loop (QRS vector loop) represents the QRS complex and the small vector loop ( vector loop) represents the wave of the ECG. The vector loop has such a small shape that we did not consider it.
2.3. Feature Extraction
Since the dVCG data taken from all of the recorded heartbeats produced similar patterns for each subject, the average values were taken from each beat’s dVCG trace. Twenty-one features were extracted from the dVCG data. Three features arose from the 3D space (depicted in Figure 2(a)), seven came from each QRS vector loop (depicted in Figure 2(b)) and vector loop, and the others were the differential or proportional values obtained from the QRS and vector loops.
To separate the QRS and vector loops, we needed to detect the QRS complex and wave. To detect the QRS complex, we used the QRS detection algorithm developed by Hamilton and Tompkins . To detect the wave, we used the QRS complex and the magnitude of the dVCG. As shown in Figure 3(a), the shape of the magnitude of the dVCG was segmented into the QRS complex and wave regions. Therefore, we could easily separate the wave interval by excluding the QRS region in the magnitude of the dVCG. The data shown in Figure 3(b) were achieved by calculating the region over a specific threshold after detecting the QRS complex region. Figure 3(c) shows the QRS complex and the wave region of the signal from lead II.
2.3.1. Feature Extraction from the dVCG in 3D Space
Equation (2.6) shows the dVCG represented as vector where are unit vectors with directions along the , , and axes, respectively. The magnitude of is . If this value becomes the largest value when , then the maximum value (VCGpeak), its azimuth (VCGazimuth), and its elevation (VCGelevation) angle are as shown in the following equation and Figure 2(a):
2.3.2. Feature Extraction from the QRS Vector Loop
When points on the QRS vector loop are represented as vectors on the plane, the relationship is as shown in the following equation: The magnitude of is . If this value becomes the largest when i = p, then the maximum () and the azimuth () angle are as follows: The area of a polygon whose vertices, , have the coordinates for can be calculated using (2.11)  In the summation, we assume that and . The term represents the maximum distance between each pair of points on the QRS vector loop. If two points on the QRS vector loop are and , then the distance between them is given in If this distance is at its maximum when , , then the maximum distance () and its angle () are shown as follows: Additionally, is the length of the minor axis in the QRS vector loop. Namely, is the maximum distance between the two points, where the line perpendicular to the line connecting the two points and from the previous equation meets the QRS vector loop. The six features mentioned above are depicted in Figure 2(b).
The term is the ratio of the major and minor axis on the QRS vector loop. This is represented by
2.3.3. Feature Extraction from the T Vector Loop
Similar to the cases of the QRS vector loop, the features related to the vector loop are , , , , , , and .
From these two sets of features, four additional features are acquired using the following equations:
2.4. Personal Identification Using SVM and Relief-F
Support vector machines are learning machines based on recent advances in statistical learning theory [21, 22]. Geometrically speaking, SVMs try to maximize the margin, which is the distance between the separating hyperplane and the closest data samples (the support vectors) belonging to the different classes. For multiple class problems, pairwise classification is commonly employed, which builds binary classifiers (one versus one) and takes a majority-voted class as a winner, where is the number of target classes .
To overcome the “curse of dimensionality” or to analyze the effect of each feature on classification, various feature selection methods have been introduced in the machine-learning field. Among these, the Relief-F algorithm has been successfully used in many feature selection tasks . A key idea in Relief-F is estimating the power of each feature by increasing the interclass difference and the intraclass similarity. The algorithm initially looks for the nearest hits (samples with the same class label) and misses (samples with a different class label) for a randomly selected sample. Then, it updates the following weight for each feature, , with respect to the difference between the feature values of the selected data and nearest ones
In this study, the feature selection method by the Relief-F algorithm was adopted to improve the computational efficacy and remove possible redundant features that do not contribute to the classification performance. In addition, we used a linear SVM with a pairwise coupling method as a classifier in our experiments and compared the 10-fold cross validation accuracy by eliminating the lowest-ranked features one-by-one based on the Relief-F algorithm. We took advantage of the work of Witten and Frank  and Chang and Lin  for the Relief-F method and SVM learning.
3. Experimental Results
We used a dataset of Lee et al.  to evaluate our method and compared our proposed method with that of Lee et al. These standard 12-lead ECG data were acquired for ten healthy volunteers using a CardioTouch (Bionet Co., Korea) with a sampling speed of 500 samples per second. Each recording was 10 s long and was performed when the subject was at rest. The data collection was done for three months, and almost one hundred of recordings were done per subject.
To compare our proposed method and the previous dVCG method, we extracted 21 features from dVCG12-lead (dVCG derived from a standard 12-lead system) and dVCGlimb-lead (dVCG derived from limb-lead system), respectively. These two sets of 21 features extracted from dVCG12-lead and dVCGlimb-lead were ranked using the Relief-F algorithm, and the results are shown in Table 1. Note that the notation is the output from the Relief-F algorithm, which means the relative importance of the features in terms of the ability for increasing the interclass difference and the intraclass similarity.
For the 12-lead system, the foremost values were the angle of the maximum peak value in the vector loop and the angle of the major axis in the vector loop. Next were the values of the length and the angle of the major axis in the QRS vector loop, followed by the length of the minor axis in the QRS vector loop and the size of the QRS vector loop. The difference between the size of the QRS and vector loops came next.
In the case of the limb-lead system, the highest values were the maximum peak value in 3D space of the dVCG and the area of the QRS vector loop, along with the difference between the area of the QRS and vector loops. The length of the minor axis in the QRS vector loop and the maximum peak value in the vector loop came next.
For two sets of 21 features, we performed a classification using a linear SVM with pairwise coupling method and compared the 10-fold cross validation accuracy by eliminating the lowest ranked features one-by-one. The results of the classification performance using the extracted features from a standard 12-lead and limb-lead system are denoted by the dashed and solid lines, respectively, in Figure 4.
The recognition rate using 21 features extracted from the standard 12-lead system was 99.52%, and the rate decreased as the number of features decreased. When we used only eight features, the recognition rate was 99.19%. In the case of features extracted from limb-lead system, the recognition rate of 99.53% was achieved using all 21 features and a recognition rate of 99.37% was achieved using only the top eight ranked features. These results show that when using the dVCG derived from limb leads only, we can produce an acceptable recognition rate.
4. Discussion and Conclusions
The recording of the standard 12-lead ECG to identify a person is not readily applicable in a real environment due to the inconvenience of connecting many electrodes. To solve this problem, we have studied the feasibility of personal identification based on the dVCG derived from limb leads only.
We extracted 21 features from dVCG and performed feature selection using the Relief-F algorithm to analyze the effect of each feature. Although there were differences in rank order, seven out of the eight top-ranked features in a standard 12-lead system were also top-ranked in the limb-lead system with the exception being the angle of the major axis in the QRS vector loop. The results also show that the Relief-F algorithm is a suitable algorithm for sorting the ranks among the features since the recognition rates do not fluctuate and gradually decrease as the number of features decreases.
To identify a person, we used a linear SVM as a classifier and calculated the 10-fold cross validation accuracy. The results of the comparison between the dVCG from the limb-lead ECG and 12-lead ECG indicate that it is possible to identify a person using only a limb-lead system with three electrodes instead of the standard 12 leads.
Further studies should investigate the stability of the dVCG with a subject’s various physical condition changes such as exercising, drinking, and smoking. Additionally, a large dataset including these various conditions should be used for its validation.
This paper was supported by the research fund of Hanyang University (HY-2007-I).
- W. Zhao, R. Chellappa, P. J. Phillips, and A. Rosenfeld, “Face recognition: a literature survey,” ACM Computing Surveys, vol. 35, no. 4, pp. 399–458, 2003.
- L. F. Chen, H. Y. M. Liao, J. C. Lin, and C. C. Han, “Why recognition in a statistics-based face recognition system should be based on the pure face portion: a probabilistic decision-based proof,” Pattern Recognition, vol. 34, no. 7, pp. 1393–1403, 2001.
- A. Park and T. J. Hazen, “ASR dependent techniques for speaker identification,” in Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP '02), pp. 1337–1340, 2002.
- Y. Qiao, Z. Li, Q. Wang, Y. Zeng, and K. Liang, “Identification of palm print using dermatoglyphics analysis and detection system,” Medical Engineering and Physics, vol. 27, no. 3, pp. 229–235, 2005.
- A. Muroň, P. Koiš, and J. Pospíšil, “Identification of persons by means of the Fourier spectra of the optical transmission binary models of the human irises,” Optics Communications, vol. 192, no. 3–6, pp. 161–167, 2001.
- D. W. John, N. M. Orlans, and P. T. Higgins, Biometrics—Identify Assurance in the Information Age, McGraw-Hill, New York, NY, USA, 2003.
- L. Biel, O. Pettersson, L. Philipson, and P. Wide, “ECG analysis: a new approach in human identification,” IEEE Transactions on Instrumentation and Measurement, vol. 50, no. 3, pp. 808–812, 2001.
- T. W. Shen, W. J. Tompkins, and Y. H. Hu, “One-lead ECG for identity verification,” in Proceedings of the 2nd Joint Conference of the IEEE Engineering in Medicine and Biology Society and the Biomedical Engineering Society, pp. 62–63, October 2002.
- M. Kyoso and A. Uchiyama, “Development of an ECG identification system,” in Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp. 3721–3723, October 2001.
- S. A. Israel, J. M. Irvine, A. Cheng, M. D. Wiederhold, and B. K. Wiederhold, “ECG to identify individuals,” Pattern Recognition, vol. 38, no. 1, pp. 133–142, 2005.
- J. S. Lee, B. H. Cho, Y. J. Chee, I. Y. Kim, and S. I. Kim, “A new approach for personal identification based on dVCG,” IEICE Transactions on Information and Systems, vol. E91-D, no. 4, pp. 1201–1205, 2008.
- J. Carlson, R. Havmöller, A. Herreros, P. Platonov, R. Johansson, and B. Olsson, “Can orthogonal lead indicators of propensity to atrial fibrillation be accurately assessed from the 12-lead ECG?” Europace, vol. 7, no. 2, pp. S39–S48, 2005.
- I. Christov, I. Jekova, and G. Bortolan, “Premature ventricular contraction classification by the Kth nearest-neighbours rule,” Physiological Measurement, vol. 26, no. 1, pp. 123–130, 2005.
- P. Dilaveris, A. Pantazis, E. Gialafos, J. Gialafos, P. Toutouzas, and C. Stefanadis, “Assessment of ventricular repolarization alterations in subjects with early repolarization,” International Journal of Cardiology, vol. 96, no. 2, pp. 273–279, 2004.
- L. Edenbrandt, A. Jakobsson, E. Lindvall, P. O. Bitzén, and O. Pahlm, “Increased prevalence of large bites in 12-lead vectorcardiograms of diabetic patients,” Journal of Electrocardiology, vol. 30, no. 2, pp. 91–95, 1997.
- L. Edenbrandt and O. Pahlm, “Vectorcardiogram synthesized from a 12-lead ECG: superiority of the inverse Dower matrix,” Journal of Electrocardiology, vol. 21, no. 4, pp. 361–367, 1988.
- G. E. Dower, H. B. Machado, and J. A. Osborne, “On deriving the electrocardiogram from vectorcardiographic leads,” Clinical Cardiology, vol. 3, no. 2, pp. 87–95, 1980.
- S. E. Fischer, S. A. Wickline, and C. H. Lorenz, “Novel real-time R-wave detection algorithm based on the vectorcardiogram for accurate gated magnetic resonance acquisitions,” Magnetic Resonance in Medicine, vol. 42, no. 2, pp. 361–370, 1999.
- D. Zwillinger, Standard Mathematical Tables and Formulae, Chapman & Hall/CRC, Boca Raton, Fla, USA, 31st edition, 2003.
- P. S. Hamilton and W. J. Tompkins, “Quantitative investigation of QRS detection rules using the MIT/BIH arrhythmia database,” IEEE Transactions on Biomedical Engineering, vol. 33, no. 12, pp. 1157–1165, 1986.
- V. N. Vapnik, The Nature of Statistical Learning Theory, Springer, New York, NY, USA, 1995.
- C. J. C. Burges, “A tutorial on support vector machines for pattern recognition,” Data Mining and Knowledge Discovery, vol. 2, no. 2, pp. 121–167, 1998.
- T. Hastie and R. Tibshirani, “Classification by pairwise coupling,” The Annals of Statistics, vol. 26, no. 2, pp. 451–471, 1998.
- I. Kononenko, “Estimating attributes: analysis and extensions of relief,” in Proceedings of the European Conference on Machine Learning (ECML '94), pp. 171–182, Catania, Italy, April 1994.
- I. H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques, Morgan Kaufman, San Francisco, Calif, USA, 2nd edition, 2005.
- C. C. Chang and C. J. Lin, “LIBSVM: a library for support vector machines,” 2001, http://www.csie.ntu.edu.tw/~cjlin/libsvm.
Copyright © 2012 Jongshill Lee et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.