Obstacle Detection in Hybrid Cross-Country Environment Based on Markov Random Field for Unmanned Ground Vehicle
In order to detect the obstacle from the large amount of 3D LIDAR data in hybrid cross-country environment for unmanned ground vehicle, a new graph approach based on Markov random field was presented. Firstly, the preprocessing method based on the maximum blurred line is applied to segment the projection of every laser scan line in x-y plane. Then, based on -means clustering algorithm, the same properties of the line are combined. Secondly, line segment nodes are precisely positioned by using corner detection method, and the next step is to take advantage of line segment nodes to build an undirected graph for Markov random field. Lastly, the energy function is calculated by means of analyzing line segment features and solved by graph cut. Two types of line mark are finally classified into two categories: ground and obstacle. Experiments prove the feasibility of the approach and show that it has better performance and runs in real time.
Accurately environmental awareness is the key point for unmanned ground vehicle. Obstacle detection is a critical perception-requirement for UGV autonomous navigation. In order to ensure the safety of driving, autonomous vehicles are usually equipped with sensors including camera, laser radar, and microwave sensor. All of those are used to detect obstacles and the ground area [1, 2]. There are many types of nontraversable obstacles that UGV can encounter during autonomous navigation in hybrid cross-country environment, including nontraversable rocks, tree trunks too wide to push over, logs, stumps, ditches, holes, low overhangs or large shrub, water bodies, mud bodies, and steep terrain . The ability to detect and avoid obstacles poses a current challenge to the acceptance and proliferation of tactical UGVs .
In the early stage, unmanned ground vehicle generally used the single laser radar as the distance sensor. The small amount of data and the quick response correspond to its advantage but has the detection limits. With the Velodyne 64-line laser radar technology, the three-dimension laser radar began to replace the single laser radar, widely used for autonomous vehicle platform [5, 6]. HIMMELBACH describes a quick method to segment the large-size long-range three-dimension point clouds that especially used to classify the objects. Efficiency is improved in his approach, splitting the problem into two simpler subproblems of complexity reduction: local ground plane is estimated by the two-dimension labeling components . Guo et al. used the gradient signaling of the road geometry to construct a Markov random field (MRF) and implement an effective belief-propagation (BP) algorithm to put the road environment into four categories: the reachable area, the driving area, the obstacle region, and the unknown region . Although the above detection method based on the grid is relatively stable, it has inherent defects. Detection accuracy depends on the grid size and tends to be pretty lower. The range of the grid map is not large and the large amounts of original data are discarded, because the computation speed is limited . The three-dimension data distribution is of nonhomogeneity and lots of the grids have no data, which leads to the storage and processing wasted.
A simple algorithm for detecting obstacles in Velodyne scans would be to find points with similar x-y coordinates whose vertical displacement exceeds a given threshold . Indeed, this algorithm can be used to detect large obstacles such as pedestrians, signposts, and cars. However, the range is big and calibration error is great. So, it is difficult for this sensor to set the displacement threshold to detect the curb-sized objects on the condition of lots of the positives . Powers and Davis observe that the point-cloud images can be used to identify objects based on combined spatial and spectral features in three dimensions and at long standoff range. The spectral LADAR can distinguish between dangerous terrain (mud) and good driving roads (dry) . Grigore-Müler and so forth observe that the procedures based on the neural network solution are allowed to definite and implement the computational algorithm. That can solve complex problems in terms of detecting and identifying the nature-vegetation targets by concrete countermeasures . Compared to the segmentation algorithm based on grid type, the algorithm based on the graph has higher accuracy and can process all the radar data . The shortcoming is that all the existing algorithms use the single three-dimension points as the nodes of a graph and are easily affected by noise, so its robustness is not satisfactory in the off-road environment.
This paper puts forward a new algorithm based on the graph. The laser scan’s lines are projected in the x-y plane. Then, the fuzzy-fragment algorithm and -means clustering are applied to process the original laser data. Finally, the line segments are used as nodes of Markov random field. The analysis of line segment feature aims to establish the energy function and get the global optimum solution by using the image segmentation algorithm.
2. Summary on Algorithm
The significant differences can be found between the ground data and the obstacle data, through observing the projection of radar data on the x-y plane. As is shown in Figure 1(a), the background stands for the ground and the foreground stands for the obstacle. It is shown in Figure 1(b) that the surface area enlarged assumes the circular pattern. It is shown in Figure 1(c) that the single scan line of obstacle area is enlarged and the scan line shape on the same line is broken or angular. Based on the above analysis, this paper uses the segment-clustering feature to detect the obstacles.
It is shown in Figure 2 that the flowchart of the obstacle detection algorithm includes four steps. First, project the scan line clustering. Second, employ K-means clustering to gather the same line classifications. Third, use the corner detection method to precisely position segment nodes. Fourth, establish the Markov random field exporting the data of obstacles and ground.
Taking into account the projection of every scan line on the x-y plane, the adjacent point in the same class has the same property. So, firstly rapidly split every scan line based on the max blurred line method and then based on the analysis method of the K-means clustering combine the same properties’ lines. Due to the noise, the wrong positioning of adjacent cluster-segment endpoint (nodes or corner points) may occur, so corner detection is used to accurately select endpoint and increase the accuracy of the algorithm. Interclass endpoint is selected as graph nodes, so the processing speed of the graph cut is increased due to reducing the number of nodes, and more than one feature from class property can be gained, which improve the robustness of the algorithm. Finally, using the graph cut algorithm can obtain the overall optimization results and further improve the algorithm’s performance.
3. The Clustering Segmentation of Scan Line Projection
Due to the ground roughness and laser radar itself having the detection noise, the projection of scan line on the x-y plane contains noise. This paper adopts the method based on the maximal blur-line segments (BS) to quickly divide the scan line projection. Then, based on K-means clustering, get the data collection of the similar lines and reclassify the combined lines having the same attribute.
3.1. Preprocessing Based on the Max Blurred Line Segments
The blur-segment is defined as the two-dimensional point set which is finite and discrete between straight lines : and : , as shown in Figure 3. The width of BS is as follows: Count the convex envelope’s support lines in order to obtain the blur-segment parameters of the two-dimensional point set . And have a linear relationship between the number of points and the computing complexity.
Steps of the segmentation algorithm are as follows.(1)Sequentially import the mapping points of scan lines. If the distance between the current point and the previous point is greater than the threshold , regard the previous point as the endpoint of the current line segment and the current point as the starting point of a new line. The two adjacent points are signed as the separation point.(2)If the distance between the current point and the previous point is less than the threshold , calculate the blur-segment parameters. If the width of BS is greater than , then regard the previous point as the endpoint of the current line segment and the current point as the starting point of a new line. The two adjacent points are signed as the connection points.The three-dimension laser radar under different distances has different resolutions. Both and are adaptive parameters related to the distances. Consider is the distance between the current point and the original point. and are the scaling factor.
The segmentation result of the single scan line is shown in Figure 4, and the adjacent segments are represented by the two different colors.
3.2. Merging Segments Based on K-Means Clustering
First, randomly select points from the line segment data set as the initial cluster centers, which can be any data contained in the line segment data set. Then, calculate the distance between every point of the whole line segment data set and the initial cluster centers, which is the basis of merging the data to the nearest class. The new cluster centers are every class’s average. If the new cluster centers are consistent with the previous ones, the cluster-criterion function is convergent and regulating the data set class is over. If not, repeat the previous step until the cluster-criterion function is convergent, and then the algorithm ends (Figure 5).
The steps of -means clustering are as follows.
Set the data set of the line segments as .(1)Initialization: randomly select data from the data set as the initial cluster centers.(2)Distribute and : calculate the distance , , from , , to every cluster center , and then divide to its nearest cluster center’s class.(3) Revise : calculate the average of data and regard it as the new cluster center.(4)Calculate the deviation : (5)Judge whether the is convergent: if is convergent, the whole algorithm is over. If not, return to step until is convergent.As is shown in Figure 6, after the processing of -means clustering, the line segments belonging to the background have been merged together. So, the number of line segments participating in the subsequent operation is significantly decreased, which will reduce the complexity of the corner detection and improve the real time and robustness.
4. Corner Detection of Between-Cluster Line Segment
Since the certain width of fuzzy lines existing has been reclassified or merged, the connection point cannot be accurately determined on the maximum curvature. Here, put forward corner detection algorithm to reset the connection point.
Considering the integral operator , is defined as follows: The curve of two dimensions is defined as the parametric equation , and the kernel function is defined as the Gaussian function: is spread near by the Taylor formula: Then, The value of the integral operator on the normal direction is proportional to the curvature , and choose the maximum-curvature point as the angle point.
5. Construct the Undirected Graph on the Markov Random Field
The global sign-optimization model is established by the frame of Markov random field and is often used to solve the problem of the scene classification . is defined as the undirected graph, and the node belongs to . Edge belonging to corresponds to the adjacent node pair. The nonnegative weight is assigned to every edge belonging to , which is used to describe the dissimilarity of the adjacent nodes and .
The mark-speculation algorithm can find the tag with the highest probability for each node, for example, the graph cut . Define as a finite mark set, is the mark-assignment function, and marks every node. Assume that the marker changes slowly in the region but changes dramatically on the edge of this region. The mark’s accuracy is measured by the energy equation: The first item on the right side is the data item, which restrains the node. The second item is the smooth item, which restrains the smoothness for energy equation .
In this paper, every segment may be labeled as a node. All of the adjacent three-dimension points aligning the current node form a set of the adjacent segments. The side-neighbor segment (SN) is defined as the same scan line segments, which is adjacent to the node. The up-neighbor segment (UN) is defined as the previous scan line segment that is adjacent to the node. The down-neighbor segment (UN) is defined as the next scan line segments that are adjacent to the node. According to the definition, every line segment may have two sides of the adjacent segments but have multiple adjacent segments up and down.
The number of segment nodes based on the threshold can be divided into two categories, the long segment () and the short segment ().
When the ground is smooth, the long segments mostly belong to the high probability’s ground area (). But under the following conditions, the long segment is defined as the high probability’s obstacle area .(1)The long segment endpoint is marked as the separation point. And the difference between the point to the radar’s origin and the adjacent segment’s separation point to the origin is larger than the threshold.(2)The long segment endpoint is signed as the connection point. And the angle difference between the angle of the connection point and adjacent segments and right angle is less than the threshold.(3)The gradient between long line segment and up-neighbor segment is larger than the threshold. The gradient is defined as the ratio between the average height of three-dimension points contained in the line segment and the average distance of segments on the x-y plane.The short segments mostly belong to the high probability’s obstacle area. But under the following conditions, the short segment is defined as the high probability’s ground area .(1)The short segment endpoint is marked as the connection point. And it is greater than the threshold that the angle difference is between the endpoint and the long-adjacent segment defined in the high probability’s ground area and the right angle.(2)It is less than the threshold that the gradient is between the short segment and the long up segment defined as the high probability’s ground area.According to the above discussion, In order to ensure the consistency of the local area, define the smoothing term: is the function, which measures the similarity according to the average height difference of the adjacent segment in the three-dimension space. In addition, is theexperiential coefficient: Eventually, the graph cut is employed to optimize the energy equation and obtain the marking results.
Assume that and are the random fields of the two-dimensional plane. is the observed image, and is the original image. According to the Bayes criterion, the optimal criterion for segmentation is the following. It is obvious that is the optimal segmentation result: is the prior probability. is a conditional probability function, whose probability distribution is obeyed to the image’s distribution.
Formula (12) is equivalent to formula (13), because is known for a given picture :
6. Experiments and Results Analysis
In this paper, the information acquisition system of the unmanned ground vehicle includes a computer processing system and a monocular vision CCD and 3D laser sensor. The function is mainly for acquiring and processing sensor data in the cross-country environment. CCD and laser sensors are used for acquiring information and decide obstacles’ identity based on the pattern recognition. Figure 7 shows the data-acquisition area of the unmanned vehicle and 3D laser sensor.
Collect the sensor data in the cross-country environment, and compare the results of the algorithm put forward here with the manual-mark results, as is shown in Figures 8 and 9. (a) shows the CCD picture in the same scene. (b) shows the manual-marking results of the laser image in the same scene, and white represents the raised areas, black represents the sunken areas, and gray represents the ground area. (c) shows the results of this paper’s algorithm, and gray represents the ground area, red represents the raised areas, and blue represents the sunken areas. (d) shows the detection results based on the local convexity-concavity algorithm . These arguments can be predicted by contrastive analysis, the detecting-algorithm results of (c) are close to the manual-marking results, and the results of the local convexity-concavity algorithm (Figures 8(d) and 9(d)) are poor, which uses local plan’s normal vector direction as the criterion and selects 4 adjacent points from 3D laser data to get the fitting plan. So, in constructing the graph, it removes the sides whose length is larger than the threshold so as to avoid the long-distance point used to fit the plane, which can lead to the misjudgment on the obstacles’ edge. Because in the long distance, the adjacent points of laser data are far away from each other, which lead to turning up lots of unknown area after removing long sides that affects detecting results.
(a) CCD picture in the same scene
(b) Laser radar results of manual marking in the same scene
(c) Results based on the algorithm from this paper
(d) Detection results based on the local convexity-concavity algorithm
(a) CCD picture in the same scene
(b) Laser radar results of manual marking in the same scene
(c) Results based on the algorithm from this paper
(d) Detection results based on local convexity-concavity algorithm
This paper’s algorithm performs much better than the algorithm of the partial concave and convex. Due to this algorithm, using the segment’s gradient, the noise is reduced, having an effect on single point gradient. The characteristics’ judgment of the lines’ angle and the separation point’s distance increases the reliability of the ground detection. Finally, applying the Markov random field to the global optimization makes the results with the better robustness.
Under two different kinds of scenes, run the 100-frame data sequences, and compare the accuracy between the different algorithms’ results and the artificial-markers true value: TP is the number of ground points correctly marked. FP is the number of obstacle points marked as ground points. FN is the number of ground points marked as obstacle points. TN is the number of obstacle points correctly marked.
As is shown in Figure 10, the horizontal axis represents the frame number, and the vertical axis represents the accuracy rate. The upper curve shows the accuracy of the obstacle detection based on the algorithm we put forward, and the lower curve shows the accuracy based on the local convexity-concavity algorithm. Through this paper’s multiple characteristics of global optimization, cushion the negative impact coming from the vehicle’s posture change and have the high stability. And, in the hybrid cross-country environment, this algorithm is more applicable and has better accuracy.
This paper puts forward the obstacle detection algorithm based on laser radar data of three dimensions in the hybrid cross-country environment. Firstly, here, use the method of the maximum and blurred lines to split every laser-line projection in the x-y plane. Then, -means clustering is applied to integrate the blurred line and use the corner detection algorithm to accurately relocate the endpoint of the integration segment. The third step is related to the unidirectional graph of the Markov random field, which helps in marking and detecting the obstacle area. Lastly, the energy function is calculated by means of analyzing line features and solved by the graph cut. The line segments are finally classified into two categories: the ground and the obstacle.
The method’s performance is evaluated on real data acquired in different outdoor scenes, and the results are compared to those of the existing methods. Our method requires less run time. At the same time, the segmentation results are better applied for classifying and identifying the objects. The algorithm allows for estimating the robust of a deviation between a digital map and the real world. The experiments in the cross-country environment show that this paper’s algorithm has better performance and can provide driving area and obstacle information with better robustness for unmanned ground vehicle in the cross-country environment.
The future work consists of integrating the other segmentation method into the object-identification framework presented in [20, 21]. It will be interesting to see how object classification will benefit from the new segmentation results.
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
The paper is sponsored by Specialized Research Fund for the Doctoral Program of Higher Education (20110041120024); National Natural Science Foundation Project (51205038); and Fundamental Research Funds for the Central Universities DUT13JS14 and DUT13JS02. Finally, the authors are most grateful for all the friends they made during the research and the events and the many journalists who covered this event.
B. Z. Yao, P. Hu, X. H. Lu, J. J. Gao, and M. H. Zhang, “Transit network design based on travel time reliability,” Transportation Research Part C: Emerging Technologies, vol. 43, part 3, pp. 233–248, 2014.View at: Publisher Site | Google Scholar
B. Yao, P. Hu, M. Zhang, and S. Wang, “Artificial bee colony algorithm with scanning strategy for the periodic vehicle routing problem,” Simulation, vol. 89, no. 6, pp. 762–770, 2013.View at: Publisher Site | Google Scholar
A. Rankin, A. Huertas, and L. H. Matthies, “Stereo-vision-based terrain mapping for off-road autonomous navigation,” in Unmanned Systems Technology XI, 733210, vol. 7332 of Proceedings of SPIE, Orlando, Fla, USA, April 2009.View at: Publisher Site | Google Scholar
M. E. Mills, “Challenges to the acceptance and proliferation of tactical UGVs,” RUSI Defence Systems, 2007.View at: Google Scholar
C. Glennie, “Calibration and kinematic analysis of the velodyne HDL-64E S2 lidar sensor,” Photogrammetric Engineering and Remote Sensing, vol. 78, no. 4, pp. 339–347, 2012.View at: Publisher Site | Google Scholar
M. Haselich, R. Bing, and D. Paulus, “Calibration of multiple cameras to a 3D laser range finder,” in Proceedings of the IEEE International Conference on Emerging Signal Processing Applications (ESPA '12), pp. 25–28, IEEE, Las Vegas, Nev, USA, January 2012.View at: Publisher Site | Google Scholar
M. Himmelsbach, F. V. Hundelshausen, and H.-J. Wuensche, “Fast segmentation of 3D point clouds for ground vehicles,” in Proceedings of the IEEE Intelligent Vehicles Symposium (IV '10), pp. 560–565, San Diego, Calif, USA, June 2010.View at: Publisher Site | Google Scholar
C. Guo, W. Sato, L. Han, S. Mita, and D. McAllester, “Graph-based 2D road representation of 3D point clouds for intelligent vehicles,” in Proceedings of the IEEE Intelligent Vehicles Symposium (IV '11), pp. 715–721, June 2011.View at: Publisher Site | Google Scholar
B. Z. Yao, J. B. Yao, and M. H. Zhang, “Multi-step-ahead prediction for tunnel surrounding rock displacement using support vector machine,” Scientia Iranica, In press.View at: Google Scholar
B.-Z. Yao, C.-Y. Yang, J.-B. Yao, and J. Sun, “Tunnel surrounding rock displacement prediction using support vector machine,” International Journal of Computational Intelligence Systems, vol. 3, no. 6, pp. 843–852, 2010.View at: Publisher Site | Google Scholar
M. Montemerlo, J. Becker, S. Shat et al., “Junior: the Stanford entry in the urban challenge,” Journal of Field Robotics, vol. 25, no. 9, pp. 569–597, 2008.View at: Publisher Site | Google Scholar
M. A. Powers and C. C. Davis, “Spectral LADAR as a UGV navigation sensor,” in Laser Radar Technology and Applications XVI, vol. 8037 of Proceedings of SPIE, June 2011.View at: Publisher Site | Google Scholar
O. Grigore-Müler, M. Barbelian, J. Arhip, and M. Jurba, “Intelligent image data processing for acquiring target in a multisensor platform,” in Proceedings of the 12th International Conference on Optimization of Electrical and Electronic Equipment (OPTIM '10), pp. 981–988, Brașov, Romania, May 2010.View at: Publisher Site | Google Scholar
B. Yao, C. Yang, J. Hu, J. Yao, and J. Sun, “An improved ant colony optimization for flexible job shop scheduling problems,” Advanced Science Letters, vol. 4, no. 6-7, pp. 2127–2131, 2011.View at: Publisher Site | Google Scholar
I. Debled-Rennesson, F. Feschet, and J. Rouyer-Degli, “Optimal blurred segments decomposition in linear time,” in Discrete Geometry for Computer Imagery, vol. 3429 of Lecture Notes in Computer Scienc, pp. 371–382, Springer, Berlin, Germany, 2005.View at: Publisher Site | Google Scholar | MathSciNet
O. François, S. Ancelet, and G. Guillot, “Bayesian clustering using hidden Markov random fields in spatial population genetics,” Genetics, vol. 174, no. 2, pp. 805–816, 2006.View at: Google Scholar
R. Logeswaran, D. Kim, J. Kim, K. Jung, and B. Song, “Graph-cut energy minimization for object extraction in MRCP medical images,” Journal of Medical Systems, vol. 36, no. 1, pp. 311–320, 2012.View at: Publisher Site | Google Scholar
Y. Boykov and V. Kolmogorov, “An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 9, pp. 1124–1137, 2004.View at: Publisher Site | Google Scholar
B. Douillard, J. Underwood, N. Kuntz et al., “On the segmentation of 3D lidar point clouds,” in Proceedings of the IEEE International Conference on Robotics and Automation (ICRA '11), pp. 2798–2805, May 2011.View at: Publisher Site | Google Scholar
Y. Zhao, J. Li, L. Li, M. Zhang, and L. Guo, “Environmental perception and sensor data fusion for unmanned ground vehicle,” Mathematical Problems in Engineering, vol. 2013, Article ID 903951, 12 pages, 2013.View at: Publisher Site | Google Scholar
Y. Zhao, H. Xu, L. Guo, L. Li, and M. Zhang, “Research of obstacle recognition technology in cross-country environment for unmanned ground vehicle,” Mathematical Problems in Engineering, vol. 2014, Article ID 531681, 13 pages, 2014.View at: Publisher Site | Google Scholar