Recent Advances in Information TechnologyView this Special Issue
Research Article | Open Access
Design of Jitter Compensation Algorithm for Robot Vision Based on Optical Flow and Kalman Filter
Image jitters occur in the video of the autonomous robot moving on bricks road, which will reduce robot operation precision based on vision. In order to compensate the image jitters, the affine transformation kinematics were established for obtaining the six image motion parameters. The feature point pair detecting method was designed based on Eigen-value of the feature windows gradient matrix, and the motion parameters equation was solved using the least square method and the matching point pairs got based on the optical flow. The condition number of coefficient matrix was proposed to quantificationally analyse the effect of matching errors on parameters solving errors. Kalman filter was adopted to smooth image motion parameters. Computing cases show that more point pairs are beneficial for getting more precise motion parameters. The integrated jitters compensation software was developed with feature points detecting in subwindow. And practical experiments were conducted on two mobile robots. Results show that the compensation costing time is less than frame sample time and Kalman filter is valid for robot vision jitters compensation.
Computer vision is the most important sensor of intelligent moving robots. In real environment, the surface evenness always causes the camera jitters to affect the precision of operation. Electronic vision stabilization has been widely used in the autonomous robot vision [1–4], which includes motion estimation, smoothing and compensation processes . The method of BMA (Blocks Matching Algorithm) was provided with the exhaustive search method in . BMA can get high precision with large amount of calculation. However, its real time capability is bad and just fit for the simple motion vision . Reference  used the circular block matching to improve the estimation for the rotational motion vision. Reference  researched a matching algorithm based on splitting and merging of block and representative points to improve the calculation speed. Projection algorithm (PA) was put forward for the vision motion estimation to gain the displacement vector based on the grayscale change of images. However, PA is valid for these images with the obvious grayscale change. Reference  researched the feature-tracking algorithm (FTA). Reference  used the optical flow constraints equations to solve the motion parameters of images. Optical flow is the 2D instantaneous velocity field of moving points in focal plane array. The aim of motion error compensation is to reconstruct images based on the smoothed parameters. The mean filter was used for smoothing motion vector in . Reference  put motion parameters into the finite impulse response (FIR) filter and filtered them with product of input sequences and interruptive function. To distinguish the independent movement from the jitters, Kalman filter was used in image stabilization . Kalman algorithm can predict the image motion and adjust observation data based on error covariance. Moreover, authors had compared the translation jitters to rotation jitters, FIR filter to Kalman filter, and relative parameters filter to absolute parameters filter, and the advantages and disadvantages of various algorithms were given out clearly . From the above literatures, we found that the analysis of images motion equations solving error is less, especially in view of feature point pairs number and matching error, and jitters compensation test is absent on the autonomous robot moving on bricks road in outroom.
The rest of this paper is organized as follows. Firstly, an image kinematics model is established and the feature points detecting and matching methods are designed based on the gradient matrix Eigen-value and the optical flow, and image motion parameters solving method is given in Section 2. A jitters compensation process based on filter is described in Section 3. Then the condition number of the equation coefficient is used to analyse the parameters error. A compensation software is developed and experiments are implemented based on the platform of two autonomous robots moving in outroom in Section 4, and curves are compared between before and after Kalman filter. Finally, results are given in Section 5.
2. Kinematics Modelling and Solving
2.1. Kinematics Modelling of Images
Coordination of the pixel is defined as at time in the image coordinate system. denotes the coordination of in the given frame and in the adjacent frame. According to the imaging producing principle, coordination moving equations are described as where is the focal length of camera and is the position of in direction of optical axis of camera coordinate system. , , and are the amount of coordination increment, and is the rotation amount. According to (1), images motion is relative to 6 parameters. So the image motion kinematics can be established as where , , , and indicate the scale and rotation amount, and and the translation amount.
2.2. Feature Point Detecting
The feature window is defined as a ( is odd number) square, and the center point of this feature window is, namely, the feature point. Gradient matrix of the feature window is where represents the feature window scope and and represent the gradients in horizontal and vertical direction, respectively, which can be got by numerical difference.
Using () subwindow to scan the feature window along horizontal and vertical direction, we can get scanning windows. Each gradient matrix of the scanning window has two real Eigen-values, and the lesser Eigen-value is denoted by . The maximum value of the lesser Eigen-value of all scanning windows can be expressed as
This paper adopts to describe the center point characteristics quantity. If is larger than the given threshold, this point will be selected as the useful feature point.
2.3. Feature Points Matching Based on Optical Flow
When robot is moving in the continuous surface, the adjacent points have homothetic motions, constant brightness, and a tiny small motion in continuous time.
The frames constraint equation can be transformed using Taylor formula ; the following equation can be got where and are the derivatives in horizontal and vertical direction, respectively, is the derivative of time, and and are the coordination difference of the feature point, namely, the gradient of optical flow.
We establish (6) of all points in square and use the least square method to solve the and . Based on the and , the feature point’s corresponding pixels coordinate in adjacent frame can be got.
2.4. Kinematics Parameters Solving
The aim of kinematics parameters solving is to get the image motion parameters , , , , , and . According to (3), there are 6 parameters needing to be solved, so at least 3 pairs of matching points are necessary. To ensure precise solving and stability, the number of pairs, denoted by , is always more than 3. The solving equation of the kinematics parameters is established as where and are the matching point coordinate position of the feature point, respectively.
When , (7) is an overdetermined equation and can also be solved using the least square method.
3. Jitter Compensation and Errors Analysis
Using and to denote the corresponding pixels coordinates in the frames, of and separated by frames respectively, we can transform (3) as the affine transformation kinematics model through the recursive method where , , , , and , , and where is the largest frame count of the video.
According to the coordinate relations of the matching feature points, and can be got through the same solving method as (7).
Due to the jitters, the motion parameters are not smooth. This paper used Kalman filter to smooth the parameters and then get the new and .
In motion compensation process, (8) was solved again with the new and to get the smoother and . Whole continuous and post compensation stable video can be obtained using the first frame image and recursion algorithm.
The motion parameters are the basics of compensation, so the parameters solving errors will obviously affect jitters compensation. For the purpose of errors analysis, we adopted the condition number of matrix to quantificationally illustrate effect of the feature points quantity on parameters solving errors.
In mathematics, the matrix condition number is defined as the product of matrix norm and its inverse matrix norm and expresses the sensitivity of matrix calculation to error.
The condition number of the matrix of (7) was given by where is a symbol of matrix norm.
In view of (7), if the matrix has a big condition number, the slim errors of feature point matching will cause a huge change on kinematics parameters. In order to analyse the effect of the numbers of the feature points on the solve precision, two computing examples were carried out.
Firstly, 3 pairs of the feature points, , , and , were used for solving (7).
Then designedly add 1 pixel error in vertical direction to the 3rd feature matching points, such as , and (7) was solved again.
Similarly, 30 pairs of the feature point, including the above 3 pairs, were used for solving (7), and also designedly add 1 pixel error to the 3rd point. Kinematics parameters solving results are shown in Table 1. The bold numbers in Table 1 refer to the parameters with errors. Table 1 shows that solving with 30 pairs matching points is robust to the matching error, and the individual point matching error would not produce a large change on results. The reason is that the condition number of with 30 pairs is much lesser than with 3 pairs. So the solving method just using 3 pairs matching points is more sensitive to the matching error.
|The bold numbers refer to the parameters with errors.|
In view of (3), using parameters got by (7) to compute the condition number of , solve the corresponding point of the compensation processing. Results are shown in Table 2. The bold numbers in Table 2 refer to the parameters with errors.
|The bold numbers refer to the parameters with errors.|
4. Vision Stabilization Software Testing
4.1. Test Setup
Experiments were conducted on two autonomous moving robots. Robot.1 (large) is the Voyager-IIA autonomous robot made in China. Robot.2 (small) is the X80-H robot made in Canada. Robot.1 has many sensors such as vision camera, ultrasonic, infrared ray, and gyroscope. Robot.2 is equipped with wireless communication equipment. The physical experiment scene is shown in Figure 1. In experiment processing of Robot.1 following Robot.2, the moving speed of Robot.1 is set to 0.11 m/s, while of Robot.2 to 0.07 m/s and linear forward.
The two autonomous moving robots are controlled by the personal computer (PC) through wireless network. Autonomous navigation software on PC controls motion of the autonomous mobile robot, such as move forward, turn back, speed up, and slow down. The CMOS camera is fixed on Robot.1 and connected with PC by USB line, and it transfers the real-time images to PC.
4.2. Jitter Compensation Software Design
Software was developed using the Visual C++6.0 programming language on the Windows XP operating system. And the central processing unit (CPU) is an Intel Core2Duo 2 GHz system with 1 GB of RAM. The whole software is composed of three parts, the control software of Robot.1, the control software of Robot.2, and the jitters compensation software. Video was captured based on DirectShow. After the image stabilization, the smooth video is displayed on the screen of PC. The compensation software procedure is illustrated in Figure 2(a) and software visual interface in Figure 2(b).
(a) Software procedure
(b) Software interface
The video sampling frequency in the mobile robot moving is 20 Hz; namely, the time interval of the adjacent frame is 50 ms. All jitters compensation time was tested through GetTickCount() and cvGetTickFrequency() functions provided by LIB files. And test result is about 24 ms, greatly less than 50 ms. So the proposed jitters compensation algorithm is real-time.
4.3. Subwindow Feature Point Detecting Experiment
The feature point detecting algorithm based on the gradient matrix Eigen-value always gets the feature points collected on the some objects. In order to uniformly distribute the feature points and accelerate the detecting speed, we divided whole image into many nonoverlapping domains with square size. Then we scanned subwindow to get feature points. Figure 3 shows the feature points detecting of one frame in the video sequence based on the conventional feature extraction and the improved feature extraction.
(a) Result of whole window scans
(b) Result of subwindow scan
Feature points may concentrate on some objects in Figure 3(a), such as Robot.2 and the tree in background, which will easily make wrong matching and is disadvantageous to the parameters solving. Figure 3(b) shows that scan in subwindow makes the feature points equably distributed in the whole image. And the dispersed feature points are beneficial for the kinematics parameters solving using the least square method.
4.4. The Parameters Smooth Results Using Kalman Filter
Two robots moved linearly forward, respectively, apart by about 1.5 m. There is the same size blocks paved on robot moving road. The length and width of blocks are 19 cm and 9.4 cm, respectively, and slot between blocks is of 0.7 cm width and depth 0.3 cm. Robot.2 is forward, while Robot.1 is behind and its motion velocity is more than that of Robot.2. So Robot.1 is continuously getting closer to Robot.2. The test time of jitters compensation is 16 s.
The process state variance of Kalman filter has key effect on parameters smooth degree of intended movement. Meanwhile, the observed variance of Kalman filter decides the changeability of unintended jitters movement. If observed variance is zero, it will cause no motion compensation effect. So the process state variance and observed variance values must be set according to intended movement and jitters motion quantity, respectively. The initial process state variance in Kalman filter is , the state square error is 0.00001, and the observed variance is 0.01. Filter was implemented with relative motion parameters of matrix . Then, in order to show motion parameters clearly, the corresponding absolute parameters were got by successively adding relative parameters and are illustrated in Figure 4.
Mean square errors (MSE) comparisons between before and after filter are shown in Table 3.
Figure 4 shows that the curves after filter are smoother than before filter. The variation is more complicated than other parameters. The reason is that the video is obtained during the mobile robot is moving on the road paved with bricks, and the interval slots between bricks mainly produce vibration of robot wheel in vertical direction. Table 3 shows the mean square error after filter is less than before evidently.
4.5. The Effect of Video Stabilization
The series frames of before and after the images stabilization are as shown in Figure 5.
(a) Original series
(b) Series after the image stabilization
Figure 5 shows the frames 132, 136, 140, 144, and 148 in turn. With Robot.1 moving, the left corner blacker brick should be continuously near to the bottom line of images. According to comparison of 132th frame to 136th frame in Figure 5(a), the blacker brick is almost motionless. According to comparison of 140th frame to 144th frame in Figure 5(a), the blacker brick is away from the bottom line instead. It is caused by the jitters of the robot wheels moving on bricks seam. And after image stabilization, the video sequence is clearly reflecting the motion of the blacker brick smoothly close to the bottom line of images.
Moving on the bricks seam will cause the video bidirectional shake, so Figure 5(a) shows the left-up corner aslant pillar is around swing. But this swing is slight in the video sequence after the image stabilization in Figure 5(b).
Based on comparative analysis, the following can be got.(1)The number of feature point pairs has great effect on the parameters solving precision, and this effect can be quantificationally analyzed by the condition number of matrices and . The condition number of is far larger than . Equation (7) is very sensitive to errors; kinematics parameters must be solved using the feature point pairs as many as possible to reduce the solving errors.(2)Subwindow feature point detecting can avoid the feature points gathering on some objects.(3)The visual jitters compensation algorithm based on optical flow and Kalman filter, developed based on PC, USB camera, Microsoft Windows operating system, and VC++, meets the requirements of precision and real-time demand of robot vision.
But the proposed method cannot compensate the migration jitters caused during the exposure time of the camera. Further study will focus on how to make the parameters of Kalman filter adaptively change with the different jitters amplitude and frequency.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
This research was funded by a Grant (no. LQ13E050004) from the Natural Science Foundation of Zhejiang province and a Grant (no. 201210076) from the Research Project of General Administration of Quality Supervision, Inspection and Quarantine of China.
- T. Proscevicius, A. Bukis, V. Raudonis, and M. Eidukeviciute, “Hierarchical control approach for autonomous mobile robots,” Elektronika ir Elektrotechnika, no. 4, pp. 101–104, 2011.
- L. Lingqiao, F. Zhizhong, X. Jingjing, and Q. Wei, “Edge mapping: a new motion estimation method for video stabilization,” in Proceedings of the International Symposium on Computer Science and Computational Technology (ISCSCT '08), pp. 440–444, Shanghai, China, December 2008.
- S. Battiato, G. Puglisi, and A. R. Bruna, “A robust video stabilization system by adaptive motion vectors filtering,” in Proceedings of the IEEE International Conference on Multimedia and Expo (ICME '08), pp. 373–376, Hannover, Germany, June 2008.
- G. Corsini, M. Diani, and A. Masini, “Video sequence stabilization for real-time remote sensing applications,” in Proceedings of the IEEE International Geoscience and Remote Sensing Symposium (IGARSS '06), pp. 988–991, Denver, Colo, USA, August 2006.
- T. Marius, A. Sakari, and V. Markku, “Method of motion estimation for image stabilization,” Acoustics, Speech and Signal Processing, vol. 2, no. 2, pp. 277–280, 2006.
- F. Vella, A. Castorina, M. Mancuso, and G. Messina, “Digital image stabilization by adaptive block motion vectors filtering,” IEEE Transactions on Consumer Electronics, vol. 48, no. 3, pp. 796–801, 2002.
- S. Niauronis, V. Laurutis, and R. Zemblys, “Eye-hand coordination during self-moved target guiding along labyrinth path,” Elektronika ir Elektrotechnika, no. 10, pp. 71–74, 2011.
- L. D. Xu, F. W. Fu, and X. G. Lin, “Circular block matching based video stabilization,” in Proceedings of the Visual Communications and Image Processing, pp. 1307–1314, July 2005.
- J. L. Shi, W. J. Zhang, and S. Y. Yu, “Motion estimation based on block-splitting-and-merging and its criteria,” Journal of Shanghai Jiaotong University, vol. 32, no. 4, pp. 49–53, 1998.
- J. C. Huang and W. S. Hsieh, “Automatic feature-based global motion estimation in video sequences,” IEEE Transactions on Consumer Electronics, vol. 50, no. 3, pp. 911–915, 2004.
- B. Feng, C. H. Zhao, T. Yang, H. C. Zhang, and Y. M. Chen, “Real-time human action recognition based on optical-flow feature and sequence alignment,” Application Research of Computers, vol. 24, no. 3, pp. 194–198, 2007.
- S. J. Ko, S. H. Lee, and S. W. Jeon, “Digital image stabilizing algorithm based on bit-plane matching,” IEEE Transactions on Consumer Electronics, vol. 44, no. 3, pp. 32–39, 1998.
- W. H. Zhao, T. X. Yao, X. Q. Ye, and W. K. Gu, “RANSAC algorithm in video stabilization,” Journal of Circuits and Systems, vol. 10, no. 4, pp. 91–94, 2005.
- K. K. Lee, K. H. Wong, M. M. Y. Chang, Y. K. Yu, and M. K. Leung, “Extended Kalman filtering approach to stereo video stabilization,” in Proceedings of the 19th International Conference on Pattern Recognition (ICPR '08), pp. 1–4, Tampa, Fla, USA, December 2008.
- Y. Xu, B. R. Wang, and Y. L. Jin, “Robot vision image stabilization based on feature matching and Kalman filtering,” Computer Engineering of China, vol. 37, no. 20, pp. 194–199, 2011.
Copyright © 2014 B. R. Wang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.