Abstract

Heart rate variability (HRV) is a useful clinical tool for autonomic function assessment and cardiovascular diseases diagnosis. It is traditionally calculated from a dedicated medical electrocardiograph (ECG). In this paper, we demonstrate that HRV can also be extracted from photoplethysmograms (PPG) obtained by the camera of a smartphone. Sixteen HRV parameters, including time-domain, frequency-domain, and nonlinear parameters, were calculated from PPG captured by a smartphone for 30 healthy subjects and were compared with those derived from ECG. The statistical results showed that 14 parameters (AVNN, SDNN, CV, RMSSD, SDSD, TP, VLF, LF, HF, LF/HF, nLF, nHF, SD1, and SD2) from PPG were highly correlated (, ) with those from ECG, and 7 parameters (AVNN, TP, VLF, LF, HF, nLF, and nHF) from PPG were in good agreement with those from ECG within the acceptable limits. In addition, five different algorithms to detect the characteristic points of PPG wave were also investigated: peak point (PP), valley point (VP), maximum first derivative (M1D), maximum second derivative (M2D), and tangent intersection (TI). The results showed that M2D and TI algorithms had the best performance. These results suggest that the smartphone might be used for HRV measurement.

1. Introduction

The heart rate (HR) of human is not constant but varies from one heartbeat to the next. Heart rate variability (HRV) is the physiological phenomenon of tiny fluctuations in the time intervals between heartbeats. It reflects the tenseness and the balance of the sympathetic and the vagus nerve activities and their effects on cardiovascular motion [1]. It is a noninvasive method for assessing the autonomic functions [1]. Numerous publications have validated that abnormal changes of HRV are related to several cardiological and noncardiological diseases like myocardial infarction, diabetic neuropathy, myocardial dysfunction, and tetraplegia [2]. Therefore, HRV is of significant importance and is widely used in clinical application.

HRV is traditionally determined by digital processing of electrocardiograms (ECG). The R-wave peaks of QRS complexes in ECGs are detected by computer algorithms and R-to-R intervals (RRI) are calculated. Then, HRV parameters are computed using time-domain, frequency-domain, and nonlinear methods [2]. However, the traditional measurement of ECG has several limitations: (1) ECG instruments generally require three electrodes attached to specific anatomical positions, which limit the subjects’ activities and make them uncomfortable; (2) the electrodes may cause skin irritations for some special subjects with allergies; (3) the ECG instruments are usually operated by specially trained nurses in the hospital and are not suitable for daily use at home. Therefore, new technologies have been developed to measure HRV without ECG, such as photoplethysmography (PPG) [35], Finapres (continuous blood pressure monitoring) [6, 7], impedance plethysmography [8, 9], ballistocardiography [10], optical vibrocardiography [11], a microwave sensor [12], or a webcam [13, 14]. In this paper, we demonstrate that HRV can also be extracted from a smartphone.

Mobile phones have already shown promising applications in healthcare service [15]. As the new generation smartphones are becoming more powerful and more popular and with more built-in sensors, the smartphone-based healthcare applications are being rapidly developed [16]. In recent years, a new method was proposed to acquire PPG signals from the built-in camera of a smartphone, in which one only needs to press a finger on the camera lens and capture a video record with the built-in LED flash on [17]. This method is based on the principle that the intensity changes of video frames are associated with the variations of light absorption of blood. When the heart systoles and the capillaries in the fingertip are full of blood, more light is absorbed and the frame becomes darker; likewise, when the heart diastoles and the capillaries in the fingertip are full of less blood, less light is absorbed and the frame becomes brighter. PPG signals can thus be obtained by calculating the intensity changes of the frames, and then several physiological parameters such as HR [1721], respiratory rate [19], pulse volume [21], and oxygen saturation [19] can be extracted from the PPG signals. With the addition of a microphone to detect the phonocardiogram signal, blood pressure can also be estimated [22]. This method is simple, low-cost, and easy-to-use, with great potential to be used in the healthcare service in the future.

However, to our best knowledge, the extraction of HRV from the smartphone PPG signals has not been well investigated, especially compared with ECG—the golden standard. Therefore, we comprehensively studied the extraction of HRV from smartphone PPG signals and compared the results with an ECG in order to assess the accuracy. Specifically, we used five algorithms to detect the characteristic points of the smartphone PPG signals: peak point (PP), valley point (VP), maximum first derivative (M1D), maximum second derivative (M2D), and tangent intersection (TI). The performances of these algorithms were also compared.

2. Methods

2.1. Data Acquisition

The experiment was approved by the Institutional Review Board of Shenzhen Institutes of Advanced Technology (registration number: SIAT-IRB-140215-H0040). Thirty subjects participated in the experiment (20 males and 10 females, age: 20–32 years, height: 150–183 cm, and weight: 40–90 kg). All the subjects were healthy and provided their informed consent. They were asked to refrain from caffeine, alcohol, cigarette, or strenuous exercise for 2 hours prior to the study.

In the experiment, all the subjects were instructed to lie in the supine position on a mattress and place their right index finger on the camera lens of an HTC S510e smart phone with the built-in LED flash turned on. A camera application (APP) in the smart phone was used to record the video of the fingertip with a resolution of 320 × 240 pixels at an unfixed sampling rate of 20–30 frames per second (fps). The sampling rate is unfixed due to the CPU processing load. ECG electrodes in the standard configuration were attached to the subjects to measure the ECG signals with a Finometer MIDI (Model II, Finapres Medical Systems B.V., The Netherlands). The ECG signals were digitalized at 200 Hz and automatically stored in the computer by BeatScope Easy software (Finapres Medical Systems B.V., The Netherlands). The experiment lasted at least 5 minutes for each subject and the subject was asked to keep still during this period.

2.2. Smartphone PPG Processing

All the data were processed offline. The 3GP format videos recorded by the HTC S510e smart phone were converted into AVI format using Pazera Free 3GP to AVI Converter 1.3 (). All further analysis was performed on the AVI videos in MatLab 7.0 (The Mathworks Inc., USA).

First, an 80 × 80 pixel region in the center of the video image was selected as the region of interesting (ROI). Then, the average intensity of the red channel in the ROI for each individual frame was calculated to generate a time-series waveform (the raw PPG signal). The red channel was chosen because the intensity values of the green and blue channels were often tending to zero and contained no valuable information in most situations. As the smartphone PPG worked in the reflection mode, the generated waveform should be inverted to “normal” mode for further processing [23].

The raw PPG signals were often corrupted by random noise, baseline drifting, and baseline abrupt changes (increase/decrease). Baseline abrupt changes were possibly caused by sudden moves of the finger or sudden changes of the light illumination, or by other unknown reasons. They could not be completely removed by general digital filters as they contained wide-band frequency components. We used a statistics method to solve this problem. First, we calculated the difference of the raw signal and then removed the outliers out of the range mean ± 5 × standard deviation (SD) and interpolated new values using cubic spline interpolation. At last, we reconstructed the new PPG signal by summation, the inverse of the difference. The range mentioned above was determined empirically, which meant that the probability of the outliers was 5.7330 × 10−7 if the difference of the PPG was normally distributed. It was the best range according to our data and could be adjusted if required.

The random noise and baseline drifting were reduced by a zero-phase Butterworth low-pass filter with cutoff frequency of 10 Hz and a zero-phase Butterworth high-pass filter with cutoff frequency of 0.5 Hz, respectively. Zero phase filters were implemented by filtering the signal both forward and backward to eliminate phase distortion.

The PPG signals were then resampled to 800 Hz with cubic spline interpolation to increase the temporal resolution. For each cardiac circle, five algorithms were used to obtain the pulse-to-pulse interval (PPI) by detection of five different characteristic points, as illustrated in Figure 2.

(i) Maximum First Derivative (M1D). It is the location of the maximum value of the first derivative of the PPG signal [24]. The derivative is calculated by a five-point central difference equation (1).

(ii) Peak Point (PP). It is the location of the maximum amplitude in the PPG signal following the M1D point [24].

(iii) Valley Point (VP). It is the location of the minimum amplitude in the PPG signal preceding the M1D point [24].

(iv) Maximum Second Derivative (M2D). It is the location of the maximum value of the second derivative of the PPG signal [25]. The second derivative is calculated using a five-point central difference equation (1) and a subsequent seven-point central difference equation (2). Considerwhere is the input signal and is the output signal.

(v) Tangent Intersection (TI). It is the location of the intersection of the tangent line at the M1D point and the horizontal line passing the valley point [25]. The tangent line is the fitted line of five points centered at the M1D point.

2.3. ECG Processing

The ECG signals were first passed through a finite impulse response (FIR) low-pass filter with cutoff frequency of 11 Hz and then a FIR high-pass filter with cutoff frequency of 5 Hz to reduce most of the noise and interference [26]. Thereafter, they were resampled to 800 Hz with cubic spline interpolation to increase the temporal resolution. R-wave peak detection was performed using Pan and Tompkins’ algorithm [26] and RRIs were obtained as the difference of successive R-wave peak locations. For both PPG and ECG signals, missed beats and false beats were manually identified and adjusted. An example of the obtained RRI and PPIs is shown in Figure 3.

2.4. HRV Parameters Calculation

Table 1 lists some commonly used HRV parameters. These parameters often refer to a professional term, NN intervals (“normal-to-normal intervals”), which means that only regular heartbeats should be considered [2]. Therefore, ectopic beats of the RRI/PPI series were removed and replaced by cubic spline interpolation before HRV parameters calculation.

(i) Time-Domain Parameters. Seven parameters were calculated from RRI and PPI series in time domain: AVNN, CV, SDNN, RMSSD, SDSD, NN50, and pNN50, as described in Table 1.

(ii) Frequency-Domain Parameters. The RRI and PPI series were evenly resampled at 4 Hz using cubic spline interpolation, and the DC component was removed by subtracting the mean of the series. Then, a 16th order autoregressive (AR) model was employed to estimate the power spectral density. Seven parameters were calculated: TP, VLF, LF, HF, LF/HF, nLF, and nHF, as described in Table 1. The ranges of different frequency bands were in accordance with the standard definition [2].

(iii) Poincaré Parameters. The Poincaré plot is one of the most widely used methods for nonlinear HRV analysis. It is a plot of each RRI/PPI against its previous one. Two parameters were calculated from the Poincaré plot, SD1 and SD2, as described in Table 1.

2.5. Statistics Analysis

HRV parameters derived from smartphone PPG were compared with the corresponding parameters derived from ECG. The Pearson correlation coefficients were calculated and the linear regression equations were obtained. A value < 0.05 was considered statistically significant.

The agreement between the two devices (smartphone and ECG) was assessed using Bland-Altman method [27]. The limit of agreement (LOA) was defined as bias ± 1.96 × SD (3)–(5) [27] and a Bland-Altman ratio (BAR) was defined as the ratio of half the range of limits of agreement to the mean of the pairwise measurement means (6). Agreements were ranked as good (BAR < 10%), moderate (10% BAR < 20%), or insufficient (BAR 20%) [28]. Acceptable limit (AL) of agreement was defined as 20% of the mean of the pairwise measurement means (7), as there are wide interindividual variations for HRV measurement [2] and a limit greater than 20% is generally considered unacceptable. Consider where and are the HRV parameters derived from the ECG and the smartphone, respectively, and is the number of subjects.

3. Results

3.1. Agreement Analysis

Table 2 shows the Pearson correlation coefficients and linear regression equations between HRV parameters derived from the smartphone and the ECG. It was found that the correlation coefficients were >0.6 for all parameters except NN50 and pNN50. For time-domain parameters, the correlation coefficients of AVNN, CV, and SDNN were higher than those of RMSSD and SDSD. For frequency-domain parameters, all parameters showed strong correlations (, ). For nonlinear parameters, SD2 exhibited higher correlation (, ) than SD1 (, ). Nevertheless, a good correlation does not mean a good agreement as either adding a constant or multiplying a factor will still yield a good correlation. A good method for assessing agreement is Bland-Altman analysis.

Table 3 shows the Bland-Altman analysis of HRV parameters derived from the smartphone and the ECG. For the sake of simplicity, we speak of good/moderate agreement if three of the five algorithms were in good/moderate agreement. It was found that all the time-domain parameters showed insufficient agreements (BAR 20%), but the AVNN showed excellent agreement (BAR < 1%), indicating that smartphone-derived HR can be a surrogate of ECG-derived HR. This result was in line with Gregoski et al.’s [20] and Matsumura and Yamakoshi’s [21]. It was also found that all the frequency-domain parameters were in moderate agreement (BAR < 20%) except for TP and nLF which were in good agreement (BAR < 10%). TP, VLF, HF, and nHF were overestimated (bias > 0), while LF, LF/HF, and nLF were underestimated (bias < 0), implying that the smartphone-derived HRV contains more noise, which can be observed in detail in Figure 1. For nonlinear parameters, SD2 showed good agreement (BAR < 10%) and SD1 showed insufficient agreement (BAR 20%).

As shown in Table 3, a total number of 7 parameters (AVNN, TP, VLF, LF, HF, nLF, and nHF) were within the acceptable limits. Figure 4 shows the Bland-Altman plots of different frequency components of HRV which are commonly used for assessing the autonomic functions. It was found that the limits of agreement for LF, HF, nLF, and nHF were all within their corresponding acceptable limits, meaning that the discrepancies between the smartphone PPG and the ECG for LF, HF, nLF, and nHF were not considerable. It was also found that the lower limit of agreement for LF/HF is out of the range of acceptable limits.

3.2. Algorithms Comparison

In terms of both bias and SD, we analyzed these data satisfying the condition BAR < 20% in Table 3 to evaluate the performance of the five algorithms mentioned above. For each HRV parameter, the best two algorithms with the least bias or SD would gain a star “*” each. The overall performance was graded according to the number of total stars. As shown in Table 4, the M2D and the TI algorithms were better than the others, the PP and the VP were the worst, and the M1D was in the middle.

4. Discussion

As previously mentioned, no researches have been reported to measure HRV with smartphone PPG. Nevertheless, many researches have been reported to measure HRV with traditional PPG (tPPG, i.e., a pulse oximeter), which can provide valuable information for our work. A good review of tPPG-derived HRV can be found in [28], where the authors commented that it was controversial whether the tPPG-derived HRV was a surrogate of the ECG-derived HRV. Although a number of publications reported universally good agreement between tPPG-derived HRV and ECG-derived HRV for all parameters, many other studies found that short-term parameters were more susceptible to disagreement between tPPG and ECG than long-term parameters. Our results are in accordance with the latter. As shown in Table 2, the coefficients of long-term parameters (AVNN, SDNN, and CV) are higher than the short-term parameters (RMSSD, SDSD, NN50, and pNN50). As shown in Table 3, the low-frequency parameters are in higher agreement than the high-frequency parameters, as the bias of LF is less than that of the HF and their BARs have no significant difference. The nonlinear parameter SD2 presenting the level of long-term HRV is in good agreement between ECG and smartphone PPG while SD1 presenting the level of short-term HRV is in insufficient agreement.

The detection of the characteristic points is also an impact factor for the accuracy of smartphone-derived HRV measurement [4]. As the morphology of pulse wave changes over time, an algorithm less sensitive to morphology variation will produce better accuracy. We graded these five algorithms: PP, VP, M1D, M2D, and TI. Our results indicate that M2D and TI algorithms are better than the others, which are in accordance with Chiu et al.’s [25], but they are slightly different from Posada-Quintero et al.’s [29]. Posada-Quintero et al. stated that the TI algorithm was better than the VP and M2D algorithms. This slight difference is possibly due to different algorithm details and different evaluation criteria. Another interesting research found that PP was more sensitive to waveform distortion than VP and M1D when the peripheral pulse was affected by cold temperature [24]. Overall, M2D and TI algorithms are better than PP, VP, and M1D. The fact that many researchers used the PP algorithm is worthy of attention [3, 4, 30, 31]. In addition, none of the algorithms is perfect or error-free. A manual correction is usually required by visual inspection on the computer screen, which is time-consuming. More efficiency algorithms are needed.

A possible consideration of the smartphone-based HRV analysis is the sampling rate. Our HTC smartphone has a sampling rate of approximately 20–30 Hz that may be considered not suitable for HRV analysis. In fact, the spectrum of the pulse signals has the vast majority power in the range of 0~10 Hz [32]. A sampling rate of 20 Hz is not less than the Nyquist rate and the temporal resolution can be improved by interpolation, which was confirmed by Sun et al.’s experiments in [14] where they compared different sampling rates of 200, 100, 50, and 20 fps and their results revealed no significant differences of these sampling rates. Moreover, as the new generation smartphones have more powerful CPUs and higher speed cameras, the sampling rate will be improved for better performance.

The color channel should be also considered. In the processing of the recorded video, most previous studies calculated the intensity of the green channel in RGB color model [1719], because (oxy)haemoglobin absorbs more green light than red light and penetrates sufficiently deep into the skin as compared to blue light [33]. We chose the red channel over the green or blue channel, because we observed that the pixel values in green and blue channels were tending to zero and the changes of the red channel are more pronounced than the green and blue channels in most situations. Chandrasekaran et al. also observed that the prominent color was red and they used the red channel in their research [22]. Grimaldi et al. demonstrated that the distribution of the pixels in the green channel was not uniform for different models of the smartphones and the only channel that had similar characteristics was the red one [34]. This suggests that heterogeneous characteristics of different cameras in different smartphones should be taken into account and more robust algorithms are required. Some potential alternatives are principal component analysis (PCA), independent component analysis (ICA), or using other color models (e.g., CMY color model, HSI color model, and YUV color model).

Motion artifacts are another complicated problem and are tough to deal with. To our best knowledge, none of the reported studies have solved this problem very well. In our experiments, the subjects were instructed to lie on a mattress and keep their fingers as still as possible to minimize the motion artifacts. This is not practical in daily life as short-time HRV testing usually takes 5 minutes that seems so long time for keeping still. Therefore, efficient motion-resistant algorithms are required. Several motion artifacts detection algorithms in pulse oximeters could be applied in smartphone-based HRV analysis [35, 36].

5. Conclusion

Traditional ECG recordings require electrodes attached to body surface and are operated by specially trained nurses in the hospital. The new smartphone-based technology requires no more than placing a finger on the camera lens of a smartphone. It is low-cost and easy-to-use and can be used in daily life out of hospital. In the present study, we quantitatively investigated the measurement of HRV based on smartphone technology and compared the results with those derived from a standard ECG to assess the accuracy. The results suggest that the smartphone can be of potential use for HRV measurement at resting and would be applied in low-cost healthcare applications.

Conflict of Interests

The authors declare that they have no conflict of interests.

Acknowledgments

This work was supported in part by the National Basic Research Program 973 (no. 2010CB732606), the National Natural Science Foundation of China (no. 61401453), the Guangdong Innovation Research Team Fund for Low-cost Healthcare Technologies in China, the External Cooperation Program of the Chinese Academy of Sciences (no. GJHZ1212), the Key Lab for Health Informatics of Chinese Academy of Sciences, the Peacock Program to Attract Overseas High-Caliber Talents to Shenzhen, and the Shenzhen City Government (nos. KFJ-EW-STS-097 and KFJ-EW-STS-095).