Design of an Efficient Real-Time Algorithm Using Reduced Feature Dimension for Recognition of Speed Limit Signs

Cho, Hanmin; Han, Seungwha; Hwang, Sun-Young

doi:https://doi.org/10.1155/2013/135614

The Scientific World Journal

On this page

Abstract Introduction Background Experimental Results Conclusion References Copyright Related Articles

Research Article | Open Access

Volume 2013 | Article ID 135614 | https://doi.org/10.1155/2013/135614

Design of an Efficient Real-Time Algorithm Using Reduced Feature Dimension for Recognition of Speed Limit Signs

Hanmin Cho,¹Seungwha Han,²and Sun-Young Hwang¹

Academic Editor: N. Nishchal, M. Nappi, P. Daponte

Received28 Aug 2013

Accepted01 Oct 2013

Published25 Dec 2013

Abstract

We propose a real-time algorithm for recognition of speed limit signs from a moving vehicle. Linear Discriminant Analysis (LDA) required for classification is performed by using Discrete Cosine Transform (DCT) coefficients. To reduce feature dimension in LDA, DCT coefficients are selected by a devised discriminant function derived from information obtained by training. Binarization and thinning are performed on a Region of Interest (ROI) obtained by preprocessing a detected ROI prior to DCT for further reduction of computation time in DCT. This process is performed on a sequence of image frames to increase the hit rate of recognition. Experimental results show that arithmetic operations are reduced by about 60%, while hit rates reach about 100% compared to previous works.

1. Introduction

Driver safety is the main concern of the advanced vehicle system which became implementable due to the development of the autonomous driving, automatic control, and imaging technology. An advanced vehicle system gives driver information related to safety by sensing the surroundings automatically [1]. Speed limit signs recognition is regarded to be helpful in safety for drivers using advanced vehicle system. The system needs to recognize the speed limit sign in the distance quickly and accurately in order to give the driver precaution in time since vehicle is moving fast. But existing algorithms perform recognition by using many features extracted from captured image, requiring a large amount of arithmetic operations for classification [2].

Several classification algorithms have been proposed, which include Neural Networks [2, 3], Support Vector Machine (SVM) [2], and Linear Discriminant Analysis (LDA) [2, 4]. Among these, SVM has relatively higher recognition rate, and LDA is used in many classification applications due to its low computational complexity. However, its computational complexity needs to be further reduced to be used in real-time application. It can be achieved by reducing the number of inputs of LDA.

This paper proposes an efficient real-time algorithm for recognition of speed limit signs by using reduced feature dimension. In this research study, DCT is employed and parts of Discrete Cosine Transform (DCT) coefficients are used as inputs to LDA instead of features extracted from image. DCT coefficients are selected by a devised discriminant function. To further reduce DCT computation time, binarization and thinning are applied to the detected Region of Interest (ROI). Image of speed limit sign in the distance obtained from camera has a low resolution and it gives poor rate of recognition. To resolve this problem, this paper proposes a recognition system using classification results on a sequence of frames. It can enhance hit rate of recognition by accumulating the probability of single frame recognition.

2. Background

In this section, LDA is briefly described, which is popularly employed for classification. LDA is a classical statistical approach for dimensionality reduction [2]. It projects high-dimensional data onto a lower dimensional space by maximizing the scatter of data points from different classes and minimizing the scatter of data belonging to the same class simultaneously, thus achieving maximum class discrimination in the dimensionality-reduced space [5]. For example, Figure 1 shows how points in 2-dimensional space can be projected onto 1-dimensional space.

(a)

(b)

The projection shown in Figure 1(b) shows more efficient separation of data than that of Figure 1(a). This concept can be expanded to n-dimensional space. Equations to be followed are derived to find the most efficient axis . Let x be data points belonging to a certain class C_i and y the projection points of x onto axis . Equation (1) shows the average of y, , where is mean of x and n_i is number of data:

It is required to find the axis which maximizes the ratio of distance between and to sum of within-class scatter. This ratio can be represented as (2), where , are within-class scatters of projected data in class 1 and class 2, respectively:

Within-class scatter of class i, , can be represented as in the following equation:

From (3), the denominator of (2) is derived as in the following equation:

The numerator in (2), , is shown in the following equation:

From (4) and (5), r() can be written as in the following equation:

Optimal , , can be obtained as in the following equation, which becomes a conventional eigenvalue problem:

Even though LDA is one of the most popular mathematical models used for classification, it is difficult to be directly used. S_w term in (7) becomes singular when the number of samples is much smaller than dimension of features as can be observed in many practical classification applications, which is called small sample size problem [6]. Also, high dimension of features makes LDA difficult to be directly applied to classification due to its computational complexity. To solve the problem, a method which applies Principal Component Analysis (PCA) before LDA was proposed [7, 8]. The purpose of PCA is to reduce the dimensionality while preserving the variance information as much as possible. However, it is suboptimal due to its ignorance of class information associated with patterns [9]. Direct LDA (DLDA) method [10–12] was also proposed. It directly processes data in the original high-dimensional vectors. The performance of the DLDA algorithm heavily depends on the control scheme that determines the number of features [13].

In this paper, a method which can reduce feature dimension effectively without increasing computational complexity is proposed for real-time algorithm for classification of speed limit signs.

3. Proposed Algorithm

As the number of operations in classification process is proportional to the number of data inputs, it is desirable to remove less significant inputs for classification [14]. By using DCT coefficients instead of features extracted from an ROI image, much less inputs are forwarded to classification process.

Figure 2 shows the overall flow of the proposed algorithm. After preprocessing a detected ROI, binarization and thinning are performed so that DCT computation time can be reduced. For further reduction of arithmetic operations in classification process, parts of DCT coefficients are selected by a devised discriminant function. To increase hit rate of recognition, the proposed algorithm performs classification for a sequence of images.

3.1. Preprocessing

Since the size of ROI varies with the distance between vehicle and speed limit sign, bicubic interpolation is employed to normalize the size of ROI into a predetermined one. Normalized ROI is converted into gray image to reduce bit width of each pixel, and the area indicating a speed limit is cropped by separating foreground from background. Then, white balancing is performed to reduce brightness variance of obtained image. To improve the resultant quality of auto white balancing, the proposed algorithm uses the white area of speed limit sign as a reference. Figure 3 shows an example. Figure 3(a) shows an acquired image, and Figures 3(b)–3(e) show the results of preprocessing for the image of Figure 3(a).

(a)

(b)

(c)

(d)

(e)

(f)

(g)

3.2. Binarization and Thinning

Prior to DCT computation, binarization and thinning are performed in the proposed algorithm. DCT computation uses each of pixel values to obtain coefficients, which require a large amount of operations for usage in real-time recognition. By using 1-bit pixels obtained by binarization, the time for multiplication can be significantly reduced. The threshold of binarization is set to 128, middle value of grayscale image, since the brightness variance has been compensated by applying white balance in advance. Figure 3(f) shows binarized image of preprocessed ROI. Even though the feature of an image is degraded by binarization, experimental results show that the hit rate of recognition has not decreased significantly.

For further reduction of DCT computation time, thinning [15] is applied to generate more 0’s in the binarized image. Thinning also removes noises remaining after binarization. The noise removal will improve classification performance. In thinning process, each pixel value is calculated by using the values of its 8 neighbors. For thinning, lookup table is used for binarized image instead of complicated operations required for gray image. Figure 3(g) shows the image after thinning.

3.3. DCT Computation

2D DCT computation can be replaced by two 1D DCT computations using the row-column decomposition [16]. The time for the first 1D DCT computation can be significantly reduced due to increased number of 0-valued pixels after binarization and thinning. In the second 1D DCT computation, parts of DCT coefficients are generated, which are selected using a devised discriminant function, for reduction of computation time.

3.4. DCT Coefficient Selection

Classifier’s performance increases dependently on the number of features. However, computational complexity and memory requirements are proportional to the number of the features both in the learning and in the classification processes. To reduce these burdens we need to remove less significant features [17]. The selected DCT coefficients are used as features in the proposed algorithm, and the performance of the classification is not degraded by using reduced amount of DCT coefficients. As mentioned in the previous section, parts of DCT coefficients are selected by using a devised discriminant function obtained through intensive analysis on the attributes of object class. The function is defined through a training process performed off-line on the classified database. The procedure to obtain discriminant function is as follows.

First, mean of DCT coefficients , ), is calculated for images per class c. Then, intraclass variance, intra_var_c(), is calculated for every D() by (8) and interclass variance for all the classes, inter_var() is obtained by (9):

Here, D() is coefficient of 2D DCT and D^k() is D() of the kth training image. From the equations above, discriminant factor for each DCT coefficient D() can be calculated as in the following equation:

Classification is more efficient when samples in the same class are clustered together and samples belonging to different classes are scattered in the feature space. The larger the discriminant factors are, the greater the impact on classification is in the field. The devised discriminant function selects a number of indices of 2D DCT coefficients in descending order which have large DF values. Those selected indices are used as reference positions whose corresponding DCT coefficients will be applied in classification process.

3.5. Classification of Speed Limit Signs

Classification is performed using the Linear Discriminant Analysis (LDA) and Mahalanobis distances [2]. LDA is performed to transform DCT coefficients into the format suitable for matching with the classes in database, and Mahalanobis distance is used as a metric for matching. Classification results for a sequence of images are used for recognition of speed limit signs. Equation (11) expresses the probability of matching with class c after classification for N image inputs: where ’s are the weights determined experimentally. They are inversely proportional to the distance between vehicle and object. MD_c(k) is Mahalanobis distance between captured image k and class c. The image is classified as class c whose probability P_c is the highest from P₁ to P_N.

4. Experimental Results

Images used for training and classification were captured on road using a mirrorless camera (MOS sensor, 4/3 inch) mounted with a 20 mm lens at resolution and 30 frames/s in normal daytime. Classification is started with an image of speed limit sign captured about 30 meters away. ROI detected from the image consists of pixels. We used 200 images captured at different distances for training purpose per class. Table 1 shows the hit rates of recognition for the speed limit signs captured 1,000 times on the road.

The hit rates of recognition are about 100% when classification is performed for 7~9 consecutive images. Table 2 compares the number of arithmetic operations with LDA and SVM [2]. The numbers in parentheses represent the reduction percentages. The numbers of arithmetic operations are reduced by about 60% and 80% when compared with LDA and SVM, respectively. In the experiments 57 DCT coefficients are selected out of 400 using the proposed discriminant function.

5. Conclusion

A real-time algorithm for speed limit sign recognition has been proposed with reduced amount of operations using DCT. The number of arithmetic operations was reduced by using lookup table on binarized image, which was obtained through binarization and thinning. To reduce feature dimension, discriminant function which selects parts of DCT coefficients was devised. Selection of DCT coefficients makes it possible to reduce runtime for recognition.

Accurate recognition of speed limit signs in low resolutions or in the distance is achievable by applying the proposed algorithm.

Acknowledgment

This research was supported by the MEST (Ministry of Education, Science and Technology), through NRF (National Research Foundation) of Korea under Grant no. 2012-0002586.

References

R. Bishop, “A survey of intelligent vehicle applications worldwide,” in Proceeding of the Intelligent Vehicles Symposium, pp. 25–30, 2000.
View at: Google Scholar
R. Duda, P. Hart, and D. Stork, Pattern Classification, Wiley-Interscience, NewYork, NY, USA, 2nd edition, 2001.
C. M. Bishop, Neural Networks for Pattern Recognition, Oxford University Press, Oxford, UK, 1995.
K. Fukunaga, Introduction to Statistical Pattern Recognition, Academic Press, New York, NY, USA, 2nd edition, 1990.
S. Ji and J. Ye, “Generalized linear discriminant analysis: a unified framework and efficient model selection,” IEEE Transactions on Neural Networks, vol. 19, no. 10, pp. 1768–1782, 2008.
View at: Publisher Site | Google Scholar
R. Huang, Q. Liu, H. Lu, and S. Ma, “Solving the small sample size problem of LDA,” in Proceedings of the 16th International Conference on Pattern Recognition, vol. 3, pp. 29–32, 2002.
View at: Google Scholar
J. Li, B. Zhao, and H. Zhang, “Face recognition based on PCA and LDA combination feature extraction,” in Proceedings of the 1st International Conference on Information Science and Engineering (ICISE '09), pp. 1240–1243, December 2009.
View at: Publisher Site | Google Scholar
P. N. Belhumeur, J. P. Hespanha, and D. J. Kriegman, “Eigenfaces vs. fisherfaces: recognition using class specific linear projection,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 711–720, 1997.
View at: Publisher Site | Google Scholar
H. Li, T. Jiang, and K. Zhang, “Efficient and robust feature extraction by maximum margin criterion,” in Proceedings of the Conference on Neural Information Processing Systems (NIPS '03), pp. 97–104, 2003.
View at: Google Scholar
L. F. Chen, H. Y. M. Liao, M. T. Ko, J. C. Lin, and G. J. Yu, “New LDA-based face recognition system which can solve the small sample size problem,” Pattern Recognition, vol. 33, no. 10, pp. 1713–1726, 2000.
View at: Publisher Site | Google Scholar
H. Yu and J. Yang, “A direct LDA algorithm for high-dimensional data with application to face recognition,” Pattern Recognition, vol. 34, no. 10, pp. 2067–2070, 2001.
View at: Publisher Site | Google Scholar
J. Yang, Y. Yu, and W. Kunz, “An efficient LDA algorithm for face recognition,” in Proceedings of the 6th International Conference on Control, Automation, Robotics and Vision, 2000.
View at: Google Scholar
X. Wu, J. Kittler, J. Yang, K. Messer, and S. Wang, “A new direct LDA (D-LDA) algorithm for feature extraction in face recognition,” in Proceedings of the 17th International Conference on Pattern Recognition (ICPR '04), pp. 545–548, August 2004.
View at: Publisher Site | Google Scholar
C. Bahlmann, Y. Zhu, V. Ramesh, M. Pellkofer, and T. Koehler, “A system for traffic sign detection, tracking, and recognition using color, shape, and motion information,” in Proceedings of the IEEE Intelligent Vehicles Symposium, pp. 255–260, June 2005.
View at: Publisher Site | Google Scholar
T. Zhang and C. Suen, “A fast parallel algorithm for thinning digital patterns,” Communications of the ACM, vol. 27, no. 3, pp. 236–239, 1984.
View at: Publisher Site | Google Scholar
D. Slawecki and W. Li, “DCT/IDCT processor design for high data rate image coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 2, no. 2, pp. 135–146, 1992.
View at: Publisher Site | Google Scholar
E. Alpaydin, Introduction to Machine Learning, MIT Press, Cambridge, Mass, USA, 2nd edition, 2004.

Copyright

Copyright © 2013 Hanmin Cho et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

1007

Downloads

747

Citations