A novel pupil tracking method is proposed by combining particle filtering and Kalman filtering for the fast and accurate detection of pupil target in an active infrared source gaze tracking system. Firstly, we utilize particle filtering to track pupil in synthesis triple-channel color map (STCCM) for the fast detection and develop a comprehensive pupil motion model to conduct and analyze the randomness of pupil motion. Moreover, we built a pupil observational model based on the similarity measurement with generated histogram to improve the credibility of particle weights. Particle filtering can detect pupil region in adjacent frames rapidly. Secondly, we adopted Kalman filtering to estimate the pupil parameters more precisely. The state transitional equation of the Kalman filtering is determined by the particle filtering estimation, and the observation of the Kalman filtering is dependent on the detected pupil parameters in the corresponding region of difference images estimated by particle filtering. Tracking results of Kalman filtering are the final pupil target parameters. Experimental results demonstrated the effectiveness and feasibility of this method.

1. Introduction

Gaze tracking is the technology to get gaze direction or gaze point on the computer screen through mechanical, electronic, optical, and other methods, which can be classified into two different types called the intrusive and the nonintrusive. Gaze tracking is widely used in various applications [1, 2] such as “human computer interaction for disabled people,” “virtual reality,” “vehicle driver assistance,” “human behavior study.” Recently, gaze tracking based on the analysis of digital video (video oculography (VOG)) is becoming a popular research topic.

In VOG system, computer vision is used to capture human face images and detect eye features. From these features, the gaze parameters can be extracted to acquire the gaze direction or gaze points. When human eyeball rotates, namely, gaze direction changes, some eye features such as the corner of the eye remain unchanged. Some other features such as pupil center, however, will change correspondingly. In this case, gaze parameters are produced between the changed features and unchanged features, which are used for describing gaze change. The VOG gaze tracking technology has aroused increasing interests of professionals in research and development due to its weak interference to people, simple operation, and high accuracy.

VOG gaze tracking commonly uses pupil center cornea reflection (PCCR) technique based on the active IR illumination. The PCCR method applies an infrared light source to produce cornea reflection (Purkinje spot) and calculates the vector from cornea reflection to the pupil center in captured images. Therefore, the PCCR method can estimate gaze direction through eye structure model or mapping model [36]. Thus, VOG gaze tracking is composed of two components: gaze feature parameter extraction and gaze direction estimation.

Eye feature detection in PCCR within VOG system consists of two processes: pupil segmentation and Purkinje location in the neighborhood of pupil. In the existing references, the method of differentiating of bright pupil and dark pupil images is utilized widely in the active illuminator (light source) system which is made up of two concentric rings of IR LEDS whose center coincides with the camera optical axis [35]. The inner ring LED light source is close to the camera optical axis for producing bright pupil image, which is just like “red eye” in some photographs. The LED light source of outer ring is used to produce Purkinje spot on the surface of the user’s eyeball. The inner ring and outer ring LED light alternately to produce bright pupil image and dark pupil image in adjacent frames, which lead to the background elimination via differentiating bright pupil and dark pupil images of adjacent even and odd frames. As a result, pupil is more apparent and easy to be found in the whole face image [68]. In differentiating images of bright pupil and dark pupil, threshold method is usually used to segment pupil in local eye region [911]. Images of dark pupil, bright pupil, and difference pupil are shown in Figures 1(a), 1(b), and 1(c), respectively.

The extraction of gaze parameters within VOG system mostly relies on pupil location, which is accomplished in two adjacent frames of bright and dark pupils. Operational steps for locating the pupil must be repeated in every two bright pupil and dark pupil frames of video sequence to ensure the accurate gaze parameters. But there will be some drawbacks, shown below, if we choose to detect pupil by scanning each whole image.(1)A large amount of required calculation to locate pupil results in the worse real time performance of gaze direction estimation.(2)Since the historical information of pupil motion cannot be used to eliminate the influences of some factors such as eye blinking and external light, it is significantly difficult to extract gaze parameters robustly and get gaze direction estimation accurately.

Under some special conditions, such as in the case of eye blinking or eye slightly closed, it is hard to obtain precise pupil parameters by pupil segmentation directly without using the historical information of pupil motion. In order to solve the above two issues, after locating pupil in the initial frame, pupil tracking should be carried out in the following video sequence. Based on pupil tracking results, we can locate and accurately segment pupil to extract pupil parameters. In video sequences, target tracking can be categorized into two different types.(1)Tracking before detection [1214]; that is, determine the location of target through tracking algorithm and then finely segment and detect target in the tracking region.(2)Tracking after detection [15]; that is, estimate the detected target parameters through tracking algorithm using the historical information of target, which can get more precisely stable target parameters.

Currently, most pupil target tracking algorithms in the gaze tracking system implemented in two adjacent frames focus on the pupil fast detection to determine the location of pupil. This way, which can be called tracking before detection, would result in avoiding detecting pupil in the whole image and improving real-time pupil detection. Pupil tracking can be completed and achieved by various proposed techniques such as Kalman filtering, mean shift, and combination of Kalman filtering and “mean shift” [11, 16, 17]. In reality, the fact is that pupil occurs and moves randomly and often disappears especially in some special conditions like eye blinking, so it is complicated and difficult to build an accurate pupil motion model. Both Kalman filtering and “mean shift” have their own defects to deal with pupil tracking especially under some special condition that we discussed above.

Particle filtering is ideal to predict the optimal estimation of a nonlinear and non-Gaussian dynamical system. It can be used to solve the problems of the pupil occlusion and disappearance which are caused by randomness of the eye movement, blinking, and eye closure. For example, Hansen and Pece [18, 19] used particle filtering for iris and pupil tracking. References [2022] used particle filtering for pupil tracking as well. However, the particle filtering methods above all aim at solving the pupil fast detection, which do not take the pupil shape model and the characteristic of the movement into account. For example, since the shape of pupil is ellipse, the foreground and background cannot be distinguished with a rectangular model; they never consider the geometric similarity of the pupil target ellipse as important clue in the research.

Real-time pupil tracking is the common problem in the VOG system. The considerations for effective tracking are listed as follows.(1)Since the size of pupil is very small and often influenced by eyelashes and eyelids, one of the most important functions of pupil target model in tracking process is to minimize background interference and distinguish target and background.(2)Because of the random pupil motion which is caused by head movement and pupil rotation, transition equation must always attempt to achieve the detailed characteristics of pupil motion as far as possible.

In this paper, a costless and widely used gaze tracking system using PCCR technique is developed for the research. To achieve fast and precise pupil detection, a pupil tracking method based on hybrid of particle filtering and Kalman filtering (HPFKF) in the VOG system is proposed and qualified to meet the requirements as mentioned. The main work in this paper includes the following.(1)Particle filtering is used to track pupil for the fast determination of the accurate pupil location.(2)Kalman filtering is utilized to estimate pupil parameters for pursuing the precision in parameter detection integrated with particle filtering location, which could improve the accuracy and reliability of pupil parameters.

2. Gaze Tracking System and the Initial Pupil Detection Method

In this paper, the gaze tracking system is composed of an infrared light source with two concentric rings, optical glass filters, CCD camera, and image grabbing cards, GPIO (general purpose input/output) cards, single-chip computer, computer, and display screen. There is an inner ring within the infrared light source that produces a bright pupil image when the inner ring is turned on. Alternately, when the outer ring is on, it produces a dark pupil and Purkinje image. When the user watches the screen, the CCD camera obtains face images and sends them to the PC, in which eye features are extracted and eye focus is gotten by gaze mapping function while gaze points are displayed on the screen. To capture the face images that enable this process, the GPIO card is used to obtain the frame alignment signal of the CCD video sequence and employs a single chip microcomputer to control the switch of the light source, activating the inner and outer LED rings alternatively. Gaze tracking system is shown in Figure 2.

To estimate the gaze points on the screen, the first step is to detect gaze parameters in image, in which pupil detection plays an important role. In the initial two frames which CCD camera grabbed, the pupil is located firstly and its characteristic parameters are detected. In the following video image sequences pupil tracking is carried out. The scheme of initial pupil detection and characteristic parameters extraction are described as in Figure 3.

3. The Pupil Tracking Based on Combination of Particle Filtering and Kalman Filtering

3.1. Framework of HPFKF

The procedure of parameters extraction proposed in Section 2 is completed in the first two adjacent frames. We capture the pupil in the two initial frames firstly and then perform pupil tracking in the following video sequences.

In this paper it is shown that pupil target estimation is accomplished by the combination of particle filtering [19] and Kalman filtering by using of bright pupil image, dark pupil image and difference image in adjacent frames. The latter process tracks for accurate estimation of pupil parameters, whereas particle filtering tracks for fast location of pupil. The objective of this proposed approach is to improve real-time performance of pupil detection by particle filtering and improve accuracy of pupil parameters detection by Kalman filtering, which is carried out in every two adjacent frames.

The flow chart of pupil target tracking method based on the combination of particle filtering and Kalman filtering is shown in Figure 4.

During this tracking process, four adjacent frames images (bright pupil, dark pupil, bright pupil, and dark pupil, resp.) are generated by system hardware and defined as follows: the first two frames (bright pupil and dark pupil) are called previous image frame, and the following two frames are called next image frame. Difference image of previous image frame is the subtraction of the bright pupil image and the dark pupil image of the previous image. Difference image of next image frame is the subtraction of the bright pupil image and the dark pupil image of the next image frame. In the initial two frames of bright pupil and dark pupil image, the following steps are implemented: firstly, find the difference of the bright and dark pupil images; secondly, segment pupil in difference image by use of the pupil detection method introduced in Section 2; thirdly, detect pupil target parameters; and finally, accomplish the tracking initialization. Hence, according to the definition of previous image frame and next image frame, the pupil tracking process is implemented in these resulting images.

In previous and next image frames, triple-channel color image is synthesized by bright pupil, dark pupil, and the corresponding difference images. The pupil tracking based on particle filtering is achieved in synthesized color image made from previous image frame and next image frame. According to the detected pupil parameters in previous image frame, we can predict particle state in triple-channel color image synthesized by next image frame. The process of particle filtering pupil tracking can determine the location of pupil. The tracking result based on particle filtering plays a guiding role for the Kalman filtering pupil parameters estimation which is accomplished in the difference image made from previous image frame and next image frames. The Kalman state transition equation of pupil motion is determined by particle filtering tracking result, which can predict pupil target parameters. According to particle filtering tracking results, pupil can be segmented and pupil parameters can be extracted to serve as observation for the correction of the Kalman prediction above. Finally, the Kalman filtering process can estimate the pupil parameters of next difference image. Tracking result of Kalman filtering is not only the results of final pupil detection, but also the target parameters in the next previous image frame.

3.2. Review of Kalman Filtering and Particle Filtering

(1) Particle Filtering. Particle filtering [23, 24] is an approximation algorithm based on Bayesian estimation of sampling theory, which combines sequential Monte Carlo (SMC) method together with the Bayesian theory.

Let the state parameter vector of a target at time be denoted by and its observation by . The history of observation from time 1 to is denoted by . The Bayesian formulation of particle filtering is expressed as

The basic idea of particle filter is to find a set of random samples in the state space of the posterior probability density approximation and to replace with the sample mean to obtain a state of minimum variance estimate.

The key point of implementation of particle filter is to find random samples of obeying the distribution of , which are called particles. sampling points are extracted from the posterior probability density independently, which represented the posterior probability density by the weighted sum.

Bayesian importance sampling (BIS) is to use an easy sample of known distribution to replace posterior probability density for sampling by weighing the sampling particles of the importance function to approximate . From the Bayes theory we obtain After sampling from the importance function, mathematical expectation of the target state vector is approximated as where is normalized weights and is a sampling particle from . To overcome the deficiencies of importance sampling, resampling technique is often used for sampling. The basic idea of resampling is to suppress or eliminate small weight particles. Big weight particles are replicated according to their weights. With the dynamic/temporal propagation, particle filtering has been widely used for tracking applications.

(2) Kalman Filtering. Kalman filtering [25, 26] is based on state space model and state space equations of linear dynamical systems, providing a recursive solution of linear optimization filtering. Using the estimated value of the previous time step and the observed value of the current time step to update the target state estimation, we can get the estimated target state value of the current time.

Kalman filter is an optimal linear recursive filter according to minimum mean square error rule. Its dynamic system is described by a state equation and observation equation. The state equation is The observation equation is where is the state vector in time , is the observation vector in time , is the state transition matrix from time to time , is the measurement matrix in time , is state noise vector, and is observation noise vector. Assume that the process noise covariance is , the observation noise covariance is . Combining the predicted value with measured value of the state, the estimated state value of the current time can be obtained: where is Kalman gain and is corresponding covariance to . The above algorithms constitute the recursive structure of Kalman filtering.

3.3. Pupil Tracking Based on Particle Filtering
3.3.1. Triple-Channel (HSV) Color Map

In order to make the pupil region more apparent in the tracking process, we propose the employment of a synthesized triple-channel color map (STCCM) based on bright pupil image, dark pupil image, and corresponding difference images in which particle filtering pupil tracking can be achieved. STCCM is defined as follows: chrominance of STCCM is equivalent to that of difference image in order to highlight the pupil region color; saturation of dark pupil image is representative of saturation of STCCM; and STCCM brightness is equal to the average brightness of bright pupil and dark pupil images to confirm that the brightness of STCCM is between that of bright pupil and dark pupil images, which prevents significant change in brightness caused by the external light source. Synthesized triple-channel color map is shown in Figure 5. We know that tracking in bright pupil or dark pupil image is difficult, where the pupil feature is not obvious (pupil gray is close to the rest of face). However, STCCM takes full advantage of the triple-channel information, making the color feature of pupil apparent and significantly different from the rest of face. The purpose of the proposed STCCM is to improve the accuracy and stability of pupil tracking.

3.3.2. Pupil Target Model

Pupil target model is established for particle filtering tracking; namely the state of pupil target is described by the vector as follows: where represent image coordinates of the center of the pupil ellipse.

3.3.3. Particle Initialization

The initial state of particle [27] is where represents the sample set, is the state and weight of each particle of the set, respectively, and is noise vector. , that is, state of time , is shown in Figure 6(a).

3.3.4. Particle Selection

According to the weight distribution, randomly select status from . During filtering, the particles with higher weight have a greater chance of being selected and are chosen more often. Hence, the particles with low weight are eliminated during the selection process:

3.3.5. Pupil Motion Model

Taking into account the changes of pupil state in a variety of situations, it is possible to establish a simple linear model to describe pupil motion. The law of pupil state movement is as follows: the elliptical center position of pupil at time is representative of the sum of the elliptical center position of pupil at time and displacement of the pupil movement. In addition, when the state of particles is predicted, noise vector should be considered. Then we can obtain the equation of the pupil motion:

By predicting the state of particles according to pupil motion equation, we can get the particle collection at current moment as shown in Figure 6(b).

3.3.6. Particle Optimization

After iterations for many times, the weights of many particles are smaller, which means that the weight variance of the particle is bigger. The target tracking may be inaccurate when the target is estimated by these particles with very small weights. Furthermore, we can see the phenomenon of particle degeneration. Currently, there are three methods to solve this problem: choose the appropriate sampling density function; use the important resample method [28]; and increase the particle number. Reference [20] has proved that the best sampling density function is the posterior probability density function, which is not easy to sample. Generally the sampling density function is represented by the prior probability density which is notably different from the real probability density. The important resampling method that Gordon et al. [29] have proposed can be described as follows: the particles with high weight reproduce the new particle many times, while the smaller weighted particles reproduce fewer particles. As a result, particle variety is reduced. Although the important resampling method can avoid the particle degeneration, it may cause sampling impoverishment. In addition, the method of increasing particle number can increase the computational burden and influence the real-time tracking.

In this paper, we use the Camshift [30] algorithm to deal with the problem of particle degeneration. The Camshift algorithm (continuously adaptive mean shift) is a nonparameter iteration technology based on a probability distribution model and uses color histogram to represent the object’s observation model. The purpose of the Camshift method used in particle filtering is to accomplish particle aggregation. In other words, iterated particles will move to the region of the target, which reduce the amount of particles with smaller weight. Even if the particle number is small, it can reflect the target observation model. An experiment in which 20 particles are used to track pupil in 600 image frames is applied to verify the effectiveness of the Camshift based particle optimization method.

RMSE (root of mean square error) of tracking results shows that the Camshift based particle optimization method can improve the tracking performance.

Figure 7 has shown the tracking results of frames 15, 177, 310, and 544. Table 1 is the experimental results of RMSE, which is the average coordinates of particle states.

3.3.7. Pupil Observation Model

Color distribution is suitable to be used as target model since it is robust and maintains the persistence of nonrigid target deformation, rotation, and scale changes. Thus, we consider the color weighted histogram of pupil region as our observation model. Epanechnikov kernel function highlights the contributions of different pixel to color histogram. The greatest contribution to the color histogram comes from the existence of pixels located at target center, while the smallest contribution occurs at edge region which may contain background.

Here we use as distance measurement [31], which is known as “gray contour curve distance” and can measure the important weight of the pupil pixel. This means that the smaller the is, the greater the pixel weights are. The ellipse curve is composed of pixels of the same . That is to say, the points of this curve have the same gray value. The of pupil ellipse contour is 1, which gradually decreases towards the pupil center until is equal to 0, where the “gray contour curve” changes to the center of pupil ellipse. The initial state of pupil and pupil histogram are shown in Figures 8 and 9.

During the update procession, we calculate the weighted histogram for each particle and discover that when there are similar histograms, there is higher weight. The weight of each particle can be calculated based on the weighted histogram of each particle [32]:

3.3.8. Pupil State Estimation

Pupil target state is estimated by

Firstly, the histogram of the estimated target is calculated. Then, we get the similarity , and finally, we obtain the probability weight . If probability weight of the estimated target is larger than threshold , the tracking is considered to be successful, and target state is updated by ratio . Contrarily, if the tracking is deemed a failure, then the target state cannot be updated. The tracking result is shown in Figure 10.

3.3.9. The Estimation Process of Particle Filtering

The flow chart of the pupil tracking based on particle filtering is shown in Figure 11.

Synthesize triple-channel color image.

Establish pupil model:

Initialize the state of each particle of the set

Select particle.

According to the weight distribution, select the higher weight particles from ; then constitute :

Build pupil motion model.

According to pupil motion model, particle state is predicted; it can also describe pupil motion state by using a simple linear model:

Establish pupil observation model.

Calculate the histogram of each particle and then calculate weight of each particle based on histogram:

(7) Estimate pupil state:

3.4. Pupil Tracking Based on Kalman Filtering

After pupil tracking based on particle filtering, we can determine the location of pupil. As shown in Figure 12, the rectangle describes the location of pupil. Next, we segment the pupil in the corresponding pupil region in difference image and attain pupil parameters to be used as observation of next image frame. Kalman filtering [33] is then used to precisely estimate the pupil parameters based on pupil parameters of the previous frame image and the observation of the next frame image. Hence, Kalman filtering tracking is implemented in the difference images.

According to the theory of Kalman filtering, we assume that the pupil model in the difference image at time can be represented as follows: where are representative of the pupil center coordinates at time ; then are the major axis and minor axis of the pupil ellipse, respectively. is the angle between the major axis of pupil ellipse and the vertical direction. represent their respective velocity at time in direction. signify the rate of change of pupil scale and angle , respectively. Pupil location estimated by particle filtering at time is . Assuming pupil position parameters estimated in previous image frame being , we can get and .

The state vector at the next time frame is linearly related to previous state by the system motion model as follows:

Here, is the state transition matrix, represents system perturbation which is normally distributed as , and represents the process noise covariance.

If we assume that observation vector of pupil target is which is the pupil parameters detected at time , we can see that the observation model of the Kalman filtering is

Here matrix is called “observation matrix,” which represents the relationship between current state and current measurement, while represents measurement uncertainty. is normally distributed as , and is the measurement noise covariance. Here is defined as follows:

Given the state transition model (see (20)), the observation model (see (21)) and the initial condition, the state vector and its covariance matrix can be updated as follows.

Let be the estimated state at time , resulting from the transition of system state model. is often referred to as the “prior state estimate.” is estimated using both the system model (see (20)) and the measurement model (see (21)) and is usually referred to as the “posterior state estimation.” Let and be the covariance matrices for the state estimates of and , respectively. These matrices determine the uncertainties associated with the prior and posterior state estimates. The Kalman filtering algorithm for state prediction and updating may be summarized below.

3.4.1. State Prediction

According to estimation state and its covariance at time, and its covariance at time can be predicted:

Here, represents state transition matrix. In Section 3.3 of this paper, we can determine the location of pupil by the method of particle filtering. Then, in the corresponding pupil position in the difference image of next image frame, we segment pupil target into binary image and calculate the circularity of the pupil target:

Here, is the area of the pupil target, while is the perimeter. Both of them are described by pixel number. The range of value is , being that the rounder the target is, the larger the is, and the value of a standard circle is 1. The shape of the pupil ellipse can be described by circularity.

During pupil motion process, both its shape and scale change. Assuming that is the target circularity of the region that Particle filtering tracked at time and at time , the ratio of and would be defined as follows:

According to different value of , we can get the different state transition matrix:

3.4.2. State Updating

Based on particle filtering tracking results (Section 3.3), the prior state estimate and its covariance matrix , we can determine the region of the pupil, and therefore we segment the pupil and attain pupil parameters . We can then perform state updating to derive the final state and its covariance matrix. The first task during state updating is to compute the Kalman gain :

The gain can be regarded as a weighing factor when determining the contribution of measurement and prediction to the posterior state estimate . is computed as follows:

The a posteriori error covariance estimate is computed as follows:

Kalman filtering always produces complete recursive state estimation through the processes of state prediction and measurement updating. Consequently, the posterior estimation at current moment is a priori estimation of next moment.

4. Experimental Results and Analysis

4.1. Results of the Pupil Tracking

In order to prove the validity of the tracking procedure proposed in this paper, we chose 1000 frames consecutive image sequences offered by 3 users for our experiments. An Intel Pentium Dual, CPU E2200 with 2.20 GHz frequency hardware configuration was used to create image of which resolution is 760 by 576 pixels. The experimental circumstances included indoor, cloudy, and normal fluorescent lamp illumination without strong light interference. 1000 frames tracking results of every user are recorded each time.

During tracking process, the main purpose is to validate the tracking ability of pupil under various conditions. Figure 12(a) shows the tracking results of frames 169, 468, 715, and 856 under significant facial pose changes. Clearly, it is seen that the proposed method can stably track pupil under the conditions of different head poses. Figure 12(b) shows the tracking results of frames 167, 465, 744, and 955 when there are significant face pose changes with glasses. In these situations, if the light glaring on the glasses does not completely occlude the pupils, our method will effectively track pupil. Figure 12(c) shows the tracking effect of frames 166, 310, 519, 767, and 989 under significant face pose changes when blinking. If there is no long time eye closure, or there is no sudden head movement, the proposed method in gaze tracking system can accurately track pupil and meet real-time requirement. Figure 12(d) shows the tracking effect of frames 82, 454, 684, and 946 under normal conditions. These results demonstrate that this method of combining the two tracking techniques produces much better tracking results than using the direct detection method and particle filtering method individually.

4.2. Analysis of Tracking Performance

To illustrate the differences among particle filtering method, direct detection method, and our proposed method, we show the tracking results of these methods under different conditions. As can be seen from Figure 13(a), particle filtering can be used to track the eye during the whole sequence under different conditions, but it is not always successful in detecting the correct eye position. In Figure 13(a), we also can see that particle filtering can only recognize the center of the rectangle range as pupil center, which leads to the center deviating from the pupil center. As shown in Figure 13(b), the direct detection method cannot correctly detect pupil when one is blinking or wearing glasses. In contrast, the proposed method then shown in Figure 13(c) can accurately obtain the pupil center under different conditions.

Table 2 shows experimental results of RMSE for the detection of pupil center using three methods (particle filtering, direct detection, and the proposed method). In the experiment, we are able to locate the true center of pupil frame by frame manually and determine the real image coordinate of pupil center. From above testing sequences, RMSE between the center of the tracking result and the true center of pupil target can be calculated as shown in Figure 14. In Figure 14, particle filtering and direct detection may be able to track pupil target, but the results are not accurate. In the entire tracking process, the proposed method usually maintains stable and accurate tracking and has a smaller margin of error, which meets the accuracy requirement of the pupil tracking. Figure 15 shows the final gaze point estimation results on the screen of user 1 based on the proposed method in gaze tracking system described in Figure 2.

This paper proposes a framework for target tracking by using particle filter combining with Kalman filtering. It not only meets requirement of the rapid target location with particle filter, but also estimates the parameters of the target with Kalman filtering. The entire framework of this algorithm achieves the rapid target location and accurate detection of pupil parameters. Currently, many literatures also proposed some portfolio tracking methods. In the literature [17], pupil is located with mean shift method firstly, and then the algorithm updates the tracking parameters with Kalman filtering. The literature [34] tracks the target with particle filter combined with mean shift. In order to verify the effectiveness of the method of particle filter combining with Kalman filtering proposed in this paper, we compare our method with the method of the literature [17], the literature [34], and the method of particle filtering. We capture 2 groups of image sequences which includes about 500 frames in each group. We choose 235 frames from each group including normal image and blinking image. The four methods mentioned above are used for tracking pupil target. The tracking results of frames 127, 147, 173, 179 and 61, 88, 110, 129 from each image sequence are shown in Figures 1, 2, 3, and 4. In Figures 1 and 2, the green rectangle is the final results of our method, which is updated by Kalman filtering. The red rectangle is the results before Kalman filtering updating. Figures 3 and 4 are the results by using particle filtering and using the method of particle filtering combined with mean shift, respectively. Figures 1 and 3 show that when the tracking error is relatively larger by using particle filter, our method can obtain the more accurate results. During the tracking process, the robustness of our method is also better. Figure 2 shows the results of Kalman filtering combined with mean shift, which use the method mean shift firstly for target location and then use Kalman filtering to update the parameters of the tracking target. For the pupil tracking, the results of the method of Kalman filtering combined with mean shift are not better than the method we proposed but are better than the other two methods. For more complex target tracking problem, the effect of the proposed tracking method where particle filtering is combined with Kalman filtering will be far superior to the method where Kalman filtering is combined with mean shift (see Figures 16, 17, 18, and 19). The results of the method of particle filtering combinedwith mean shift are shown in Figure 4. In this method the local peaks of all particles are gotten by using mean shift; then the tracking process is calculated together with the original particles and processed particles. But the experimental results show that the tracking accuracy of this method is not very nice. This is because some particles have offset the original particle states, which make the calculated location farther away from true position. But the number of particles used in this method is less than that of other methods, in which only 50 particles can achieve better results while the number of other methods is 200. The results of these tracking methods are shown in Table 3. We can see that the results of the proposed method are better than those of other combination methods [17, 34].

5. Conclusions

In this paper, we proposed a method of pupil tracking in a gaze tracking system based on the combination of particle filtering and Kalman filtering when we oriented to fast and accurate pupil target detection using an active infrared source gaze tracking system. In order to make the color feature of the pupil more prominent, we synthesize triple-channel color image by using the bright pupil image, dark pupil image, and difference pupil image. Particle filtering is applied for fast pupil tracking by using triple-channel color image. On the basis of the tracking results of particle filtering, we use Kalman filtering to attain more accurate pupil parameters in difference image. Experimental results and analysis verify and illustrate the effectiveness of the proposed method.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.


The work is supported by Beijing Key Discipline Development Program (no. XK100080537), the Beijing Natural Science Foundation (4122050), and the Opening Project of Key Laboratory of Operation Safety Technology on Transport Vehicle, Ministry of Transport, China.