Research Article  Open Access
Jongshill Lee, Youngjoon Chee, Inyoung Kim, "Personal Identification Based on Vectorcardiogram Derived from Limb Leads Electrocardiogram", Journal of Applied Mathematics, vol. 2012, Article ID 904905, 12 pages, 2012. https://doi.org/10.1155/2012/904905
Personal Identification Based on Vectorcardiogram Derived from Limb Leads Electrocardiogram
Abstract
We propose a new method for personal identification using the derived vectorcardiogram (dVCG), which is derived from the limb leads electrocardiogram (ECG). The dVCG was calculated from the standard limb leads ECG using the precalculated inverse transform matrix. Twentyone features were extracted from the dVCG, and some or all of these 21 features were used in support vector machine (SVM) learning and in tests. The classification accuracy was 99.53%, which is similar to the previous dVCG analysis using the standard 12lead ECG. Our experimental results show that it is possible to identify a person by features extracted from a dVCG derived from limb leads only. Hence, only three electrodes have to be attached to the person to be identified, which can reduce the effort required to connect electrodes and calculate the dVCG.
1. Introduction
Human identification has potential applications in many different areas where the identity of a person needs to be determined, and to obtain even higher security levels, more complex system are required. Specific features of human beings need to be selected to recognize a person. Much work has been carried out on human face identification [1, 2]. These methods require a highresolution computer vision system to collect facial features, which are generally anthropometric face structures. Other methods used in this area include: voice recognition [3] and palm recognition [4], with the most common being finger print identification. The human eye also contains specific features in both the retina and the iris that may be used for recognition [5].
Although most of these identification methods have gained wide acceptance, one of the problems with them is the fact that a specific biometric belonging to a certain person can still be used, even if the owner of the biometric is not present or has died. Therefore, many biometric hardware systems include a liveness testing measure. This can be obtained by measuring the body temperature, moisture, oxygen level, reflection or absorbance of light or other radiation, or the presence of a natural spontaneous signal such as a pulse, the contraction of a pupil in response to light, and muscular contraction in response to an electrical stimulus. In most cases, such liveness testing is difficult to measure [6], and still it is needed to develop the reliable and efficient method to test the “liveness” of an applicant’s biometric.
The electrocardiogram (ECG) signal is an alternative inherent liveness biometric because of the significant fact that an ECG signal does not exist if the subject is not alive. Recently, the feasibility of using ECG as a new biometric measure for personal identification has been explored. Biel et al. [7] explained that automatic human identification can be achieved by analyzing the 30 features extracted from a standard 12lead ECG. Shen et al. [8] investigated the feasibility of ECG as a biometric by applying template matching and a decisionbased neural network to the seven features extracted from a singlelead ECG. Kyoso and Uchiyama [9] developed a human identification engine based on the four feature parameters of a sampled ECG data sequence on a beattobeat basis. Israel et al. [10] proposed a set of ECG descriptors that characterize the trace of a heartbeat to identify a person. 15 features have been selected from each heartbeat.
All of these researchers used time intervals (e.g., P wave duration, PQ interval, QRS interval, QT interval, and so on) or amplitude as important features in their studies. These features from the time domain have some limitations as the temporal features of interval and amplitude can vary depending on variables such as the time of day of the measurement or the physical condition of the subject. Noise and positioning of the electrode can also decrease the accuracy. In contrast, the spatial features of the cardiac electric vector, represented by the vectorcardiogram (VCG), are not affected by the variables mentioned above. It is also expected that the vectorcardiographic loops will differ in shape and orientation from person to person. It is possible to identify a person by features extracted from a VCG. We have investigated the feasibility of the VCG, which is derived from a standard 12lead ECG, as a new biometric for personal identification in our previous study [11], and the experimental results have shown that it is applicable to identify a person. The drawback of this approach is the considerable effort required to connect many electrodes to the person, including six leads to the chest, which is inconvenient in a real environment.
In this work, we investigated a novel approach for identifying a person using the dVCG that was derived from limb leads only. For limblead recording, only three electrodes are attached to the wrist and ankle, which is much easier than 12lead recording. By comparing the performances from limblead recordings and from 12lead recordings, we analyzed the feasibility of using VCG from limblead recording for personal identification. First, we derived a VCG from a limblead ECG and extracted features from the derived VCG. To remove some redundant features and to analyze the effect of each feature, we performed feature selection using the ReliefF algorithm. Finally, we performed personal identification using a support vector machine (SVM).
2. Materials and Methods
2.1. Vectorcardiogram
VCG have been widely investigated in the diagnosis of heart diseases, such as atrial fibrillation [12], premature ventricular contraction [13], and early ventricular repolarization [14].
VCG is a graphic representation of the magnitude and orientation of the heart’s electrical activity during a cardiac cycle in the form of a vector loop. In contrast to ECG, which represents the electrical potential in any one single axis, VCG displays the same heart’s electrical events along two or three perpendicular axes. VCG provides a vectorial representation of the distribution of electrical potentials generated by the heart and produces looptype patterns (Figure 1). The magnitude and orientation of the P, QRS, and vector loops are determined according to an individual heart’s characteristics. Because of the high amplitude associated with QRS, loops from the QRS complex predominate.
The electrode positions of leads for the traditional VCG are different from those of a 12lead ECG and must first be deduced by the recording technicians. Therefore, the method for calculating VCG from a conventional 12lead ECG is more appealing [12, 15].
2.2. Derived VCG
From a standard 12lead ECG, the derived VCG can be easily calculated using a method based on inverse Dower matrix [16] as shown in (2.1). Each of the orthogonal leads, , , and used to plot the VCG are linear combinations of the eight independent leads (I, II, and –) of a standard 12lead ECG
To derive the limb ECG from vectorcardiographic leads, Dower et al. described a method using a transform matrix where each lead (I, II, and III) in the ECG was expressed as a linear function of the leads , , and [17, 18]. The transformation matrix for the limb leads (I, II and III) is shown in (2.2).
The transformation between the vectorcardiographic and limblead systems is a simple matrix operation: where is the ECG signal, is the VCG signal, and is the transformation matrix.
To calculate a VCG signal from a limblead system, we need the inverse of , but there is no inverse matrix because is singular (). Therefore, we use the pseudo inverse (or MoorePenrose inverse) [19]. The pseudoinverse of can be determined by the singular value decomposition (). Because matrix has rank 2, has two positive singular values (, ) along the main diagonal extending from the upper lefthand corner, and the remaining components of are zero. Then, (the pseudo inverse matrix of )=== since and because of their orthogonality. The matrix takes the following form:
Therefore, the pseudo inverse matrix of is shown as follows: Finally, we calculated the dVCG from the limblead ECG using
The pseudoinverse of () is an approximation matrix because has deficient rank. Therefore, the dVCG derived from the limb leads has different patterns than the dVCG from the standard 12lead ECG.
Because the threedimensional space (3D) and the frontal (XY) plane of the dVCG provide useful information, such as shape and direction [11], as shown in Figure 2, we used the dVCG in 3D space and the frontal plane. In the frontal plane, the large vector loop (QRS vector loop) represents the QRS complex and the small vector loop ( vector loop) represents the wave of the ECG. The vector loop has such a small shape that we did not consider it.
(a)
(b)
2.3. Feature Extraction
Since the dVCG data taken from all of the recorded heartbeats produced similar patterns for each subject, the average values were taken from each beat’s dVCG trace. Twentyone features were extracted from the dVCG data. Three features arose from the 3D space (depicted in Figure 2(a)), seven came from each QRS vector loop (depicted in Figure 2(b)) and vector loop, and the others were the differential or proportional values obtained from the QRS and vector loops.
To separate the QRS and vector loops, we needed to detect the QRS complex and wave. To detect the QRS complex, we used the QRS detection algorithm developed by Hamilton and Tompkins [20]. To detect the wave, we used the QRS complex and the magnitude of the dVCG. As shown in Figure 3(a), the shape of the magnitude of the dVCG was segmented into the QRS complex and wave regions. Therefore, we could easily separate the wave interval by excluding the QRS region in the magnitude of the dVCG. The data shown in Figure 3(b) were achieved by calculating the region over a specific threshold after detecting the QRS complex region. Figure 3(c) shows the QRS complex and the wave region of the signal from lead II.
(a)
(b)
(c)
2.3.1. Feature Extraction from the dVCG in 3D Space
Equation (2.6) shows the dVCG represented as vector where are unit vectors with directions along the , , and axes, respectively. The magnitude of is . If this value becomes the largest value when , then the maximum value (VCG_{peak}), its azimuth (VCG_{azimuth}), and its elevation (VCG_{elevation}) angle are as shown in the following equation and Figure 2(a):
2.3.2. Feature Extraction from the QRS Vector Loop
When points on the QRS vector loop are represented as vectors on the plane, the relationship is as shown in the following equation: The magnitude of is . If this value becomes the largest when i = p, then the maximum () and the azimuth () angle are as follows: The area of a polygon whose vertices, , have the coordinates for can be calculated using (2.11) [19] In the summation, we assume that and . The term represents the maximum distance between each pair of points on the QRS vector loop. If two points on the QRS vector loop are and , then the distance between them is given in If this distance is at its maximum when , , then the maximum distance () and its angle () are shown as follows: Additionally, is the length of the minor axis in the QRS vector loop. Namely, is the maximum distance between the two points, where the line perpendicular to the line connecting the two points and from the previous equation meets the QRS vector loop. The six features mentioned above are depicted in Figure 2(b).
The term is the ratio of the major and minor axis on the QRS vector loop. This is represented by
2.3.3. Feature Extraction from the T Vector Loop
Similar to the cases of the QRS vector loop, the features related to the vector loop are , , , , , , and .
From these two sets of features, four additional features are acquired using the following equations:
2.4. Personal Identification Using SVM and ReliefF
Support vector machines are learning machines based on recent advances in statistical learning theory [21, 22]. Geometrically speaking, SVMs try to maximize the margin, which is the distance between the separating hyperplane and the closest data samples (the support vectors) belonging to the different classes. For multiple class problems, pairwise classification is commonly employed, which builds binary classifiers (one versus one) and takes a majorityvoted class as a winner, where is the number of target classes [23].
To overcome the “curse of dimensionality” or to analyze the effect of each feature on classification, various feature selection methods have been introduced in the machinelearning field. Among these, the ReliefF algorithm has been successfully used in many feature selection tasks [24]. A key idea in ReliefF is estimating the power of each feature by increasing the interclass difference and the intraclass similarity. The algorithm initially looks for the nearest hits (samples with the same class label) and misses (samples with a different class label) for a randomly selected sample. Then, it updates the following weight for each feature, , with respect to the difference between the feature values of the selected data and nearest ones
In this study, the feature selection method by the ReliefF algorithm was adopted to improve the computational efficacy and remove possible redundant features that do not contribute to the classification performance. In addition, we used a linear SVM with a pairwise coupling method as a classifier in our experiments and compared the 10fold cross validation accuracy by eliminating the lowestranked features onebyone based on the ReliefF algorithm. We took advantage of the work of Witten and Frank [25] and Chang and Lin [26] for the ReliefF method and SVM learning.
3. Experimental Results
We used a dataset of Lee et al. [11] to evaluate our method and compared our proposed method with that of Lee et al. These standard 12lead ECG data were acquired for ten healthy volunteers using a CardioTouch (Bionet Co., Korea) with a sampling speed of 500 samples per second. Each recording was 10 s long and was performed when the subject was at rest. The data collection was done for three months, and almost one hundred of recordings were done per subject.
To compare our proposed method and the previous dVCG method, we extracted 21 features from dVCG_{12lead} (dVCG derived from a standard 12lead system) and dVCG_{limblead} (dVCG derived from limblead system), respectively. These two sets of 21 features extracted from dVCG_{12lead} and dVCG_{limblead} were ranked using the ReliefF algorithm, and the results are shown in Table 1. Note that the notation is the output from the ReliefF algorithm, which means the relative importance of the features in terms of the ability for increasing the interclass difference and the intraclass similarity.

For the 12lead system, the foremost values were the angle of the maximum peak value in the vector loop and the angle of the major axis in the vector loop. Next were the values of the length and the angle of the major axis in the QRS vector loop, followed by the length of the minor axis in the QRS vector loop and the size of the QRS vector loop. The difference between the size of the QRS and vector loops came next.
In the case of the limblead system, the highest values were the maximum peak value in 3D space of the dVCG and the area of the QRS vector loop, along with the difference between the area of the QRS and vector loops. The length of the minor axis in the QRS vector loop and the maximum peak value in the vector loop came next.
For two sets of 21 features, we performed a classification using a linear SVM with pairwise coupling method and compared the 10fold cross validation accuracy by eliminating the lowest ranked features onebyone. The results of the classification performance using the extracted features from a standard 12lead and limblead system are denoted by the dashed and solid lines, respectively, in Figure 4.
The recognition rate using 21 features extracted from the standard 12lead system was 99.52%, and the rate decreased as the number of features decreased. When we used only eight features, the recognition rate was 99.19%. In the case of features extracted from limblead system, the recognition rate of 99.53% was achieved using all 21 features and a recognition rate of 99.37% was achieved using only the top eight ranked features. These results show that when using the dVCG derived from limb leads only, we can produce an acceptable recognition rate.
4. Discussion and Conclusions
The recording of the standard 12lead ECG to identify a person is not readily applicable in a real environment due to the inconvenience of connecting many electrodes. To solve this problem, we have studied the feasibility of personal identification based on the dVCG derived from limb leads only.
We extracted 21 features from dVCG and performed feature selection using the ReliefF algorithm to analyze the effect of each feature. Although there were differences in rank order, seven out of the eight topranked features in a standard 12lead system were also topranked in the limblead system with the exception being the angle of the major axis in the QRS vector loop. The results also show that the ReliefF algorithm is a suitable algorithm for sorting the ranks among the features since the recognition rates do not fluctuate and gradually decrease as the number of features decreases.
To identify a person, we used a linear SVM as a classifier and calculated the 10fold cross validation accuracy. The results of the comparison between the dVCG from the limblead ECG and 12lead ECG indicate that it is possible to identify a person using only a limblead system with three electrodes instead of the standard 12 leads.
Further studies should investigate the stability of the dVCG with a subject’s various physical condition changes such as exercising, drinking, and smoking. Additionally, a large dataset including these various conditions should be used for its validation.
Acknowledgment
This paper was supported by the research fund of Hanyang University (HY2007I).
References
 W. Zhao, R. Chellappa, P. J. Phillips, and A. Rosenfeld, “Face recognition: a literature survey,” ACM Computing Surveys, vol. 35, no. 4, pp. 399–458, 2003. View at: Publisher Site  Google Scholar
 L. F. Chen, H. Y. M. Liao, J. C. Lin, and C. C. Han, “Why recognition in a statisticsbased face recognition system should be based on the pure face portion: a probabilistic decisionbased proof,” Pattern Recognition, vol. 34, no. 7, pp. 1393–1403, 2001. View at: Publisher Site  Google Scholar
 A. Park and T. J. Hazen, “ASR dependent techniques for speaker identification,” in Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP '02), pp. 1337–1340, 2002. View at: Google Scholar
 Y. Qiao, Z. Li, Q. Wang, Y. Zeng, and K. Liang, “Identification of palm print using dermatoglyphics analysis and detection system,” Medical Engineering and Physics, vol. 27, no. 3, pp. 229–235, 2005. View at: Publisher Site  Google Scholar
 A. Muroň, P. Koiš, and J. Pospíšil, “Identification of persons by means of the Fourier spectra of the optical transmission binary models of the human irises,” Optics Communications, vol. 192, no. 3–6, pp. 161–167, 2001. View at: Publisher Site  Google Scholar
 D. W. John, N. M. Orlans, and P. T. Higgins, Biometrics—Identify Assurance in the Information Age, McGrawHill, New York, NY, USA, 2003.
 L. Biel, O. Pettersson, L. Philipson, and P. Wide, “ECG analysis: a new approach in human identification,” IEEE Transactions on Instrumentation and Measurement, vol. 50, no. 3, pp. 808–812, 2001. View at: Publisher Site  Google Scholar
 T. W. Shen, W. J. Tompkins, and Y. H. Hu, “Onelead ECG for identity verification,” in Proceedings of the 2nd Joint Conference of the IEEE Engineering in Medicine and Biology Society and the Biomedical Engineering Society, pp. 62–63, October 2002. View at: Google Scholar
 M. Kyoso and A. Uchiyama, “Development of an ECG identification system,” in Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp. 3721–3723, October 2001. View at: Google Scholar
 S. A. Israel, J. M. Irvine, A. Cheng, M. D. Wiederhold, and B. K. Wiederhold, “ECG to identify individuals,” Pattern Recognition, vol. 38, no. 1, pp. 133–142, 2005. View at: Publisher Site  Google Scholar
 J. S. Lee, B. H. Cho, Y. J. Chee, I. Y. Kim, and S. I. Kim, “A new approach for personal identification based on dVCG,” IEICE Transactions on Information and Systems, vol. E91D, no. 4, pp. 1201–1205, 2008. View at: Publisher Site  Google Scholar
 J. Carlson, R. Havmöller, A. Herreros, P. Platonov, R. Johansson, and B. Olsson, “Can orthogonal lead indicators of propensity to atrial fibrillation be accurately assessed from the 12lead ECG?” Europace, vol. 7, no. 2, pp. S39–S48, 2005. View at: Publisher Site  Google Scholar
 I. Christov, I. Jekova, and G. Bortolan, “Premature ventricular contraction classification by the Kth nearestneighbours rule,” Physiological Measurement, vol. 26, no. 1, pp. 123–130, 2005. View at: Publisher Site  Google Scholar
 P. Dilaveris, A. Pantazis, E. Gialafos, J. Gialafos, P. Toutouzas, and C. Stefanadis, “Assessment of ventricular repolarization alterations in subjects with early repolarization,” International Journal of Cardiology, vol. 96, no. 2, pp. 273–279, 2004. View at: Publisher Site  Google Scholar
 L. Edenbrandt, A. Jakobsson, E. Lindvall, P. O. Bitzén, and O. Pahlm, “Increased prevalence of large bites in 12lead vectorcardiograms of diabetic patients,” Journal of Electrocardiology, vol. 30, no. 2, pp. 91–95, 1997. View at: Google Scholar
 L. Edenbrandt and O. Pahlm, “Vectorcardiogram synthesized from a 12lead ECG: superiority of the inverse Dower matrix,” Journal of Electrocardiology, vol. 21, no. 4, pp. 361–367, 1988. View at: Google Scholar
 G. E. Dower, H. B. Machado, and J. A. Osborne, “On deriving the electrocardiogram from vectorcardiographic leads,” Clinical Cardiology, vol. 3, no. 2, pp. 87–95, 1980. View at: Google Scholar
 S. E. Fischer, S. A. Wickline, and C. H. Lorenz, “Novel realtime Rwave detection algorithm based on the vectorcardiogram for accurate gated magnetic resonance acquisitions,” Magnetic Resonance in Medicine, vol. 42, no. 2, pp. 361–370, 1999. View at: Publisher Site  Google Scholar
 D. Zwillinger, Standard Mathematical Tables and Formulae, Chapman & Hall/CRC, Boca Raton, Fla, USA, 31st edition, 2003.
 P. S. Hamilton and W. J. Tompkins, “Quantitative investigation of QRS detection rules using the MIT/BIH arrhythmia database,” IEEE Transactions on Biomedical Engineering, vol. 33, no. 12, pp. 1157–1165, 1986. View at: Google Scholar
 V. N. Vapnik, The Nature of Statistical Learning Theory, Springer, New York, NY, USA, 1995.
 C. J. C. Burges, “A tutorial on support vector machines for pattern recognition,” Data Mining and Knowledge Discovery, vol. 2, no. 2, pp. 121–167, 1998. View at: Google Scholar
 T. Hastie and R. Tibshirani, “Classification by pairwise coupling,” The Annals of Statistics, vol. 26, no. 2, pp. 451–471, 1998. View at: Publisher Site  Google Scholar  Zentralblatt MATH
 I. Kononenko, “Estimating attributes: analysis and extensions of relief,” in Proceedings of the European Conference on Machine Learning (ECML '94), pp. 171–182, Catania, Italy, April 1994. View at: Google Scholar
 I. H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques, Morgan Kaufman, San Francisco, Calif, USA, 2nd edition, 2005.
 C. C. Chang and C. J. Lin, “LIBSVM: a library for support vector machines,” 2001, http://www.csie.ntu.edu.tw/~cjlin/libsvm. View at: Google Scholar
Copyright
Copyright © 2012 Jongshill Lee et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.