Abstract

Surface ground-penetrating radar (GPR) is one of the radar technology that is widely used in many applications. It is nondestructive remote sensing method to detect underground buried objects. However, the output target is only hyperbolic representation. This research develops a system to identify a buried object on surface GPR based on decision tree method. GPR data of many basic objects (with circular, triangular, and rectangular cross-section) are classified and extracted to generate data training model as a unique template for each type of basic object. The pattern of object under test will be known by comparing its data with the training data using a decision tree method. A simple powerful algorithm to extract feature parameters of object which is based on linear extrapolation is proposed. The result showed that tested buried basic objects can be correctly predicted and the developed system works properly.

1. Introduction

Surface ground-penetrating radar (GPR) which has evolved as popular technology for nondestructive testing method has been widely used for the detection of underground buried objects since 1980. The interesting applications are for the detection of underground infrastructure [1] and the landmine [2]. GPR transmits short pulses of high frequency electromagnetic into the ground. These waves propagate with a velocity that depends on the dielectric property of medium. If the waves encounter a buried object with different reactive indices, some of the waves are reflected back and the receiver processes them to create a hyperbolic image as the object representation. In order to create an image, the GPR transmits electromagnetic pulses at a certain frequency for a certain time slot and samples the responses as an A-scan corresponding to a single position of the GPR. By moving the GPR in direction, a collection of these A-scans called a B-scan which represent different GPR positions are constructed. Then it is processed to become a hyperbolic image. Ideally, the imaging process of GPR data is described in Figure 1.

Automatic generation of hyperbolic image from B-scan data has been widely reported using various methods and techniques such as Hough transform [3, 4], statistical and probability [5, 6], genetic algorithm [7], neural networks [4, 8, 9], and the latest technique is compressive sensing [10].

Figure 2 shows the B-scan of GPR data for basic circular and rectangular objects. From the image, it is hard to recognize the shape of the object. Image of an object generated by GPR is not corresponding to its geometrical representation.

Regeneration of a geometric model is a much more complex procedure and is not usually attempted [11]. However, because the output target is only a hyperbolic representation, interpretation of a geometric buried object is depends on operator’s expertise and experience.

Recent developments have shown that significant efforts have been focused towards the extraction of meaningful physical interpretation from GPR data. Researches on generating geometrical representation have been reported in [1214]. Unfortunately, these reports are based on diffraction tomography or cross-hole GPR method, which is not practical to drill holes and is time consuming [15].

This research tries to interpret geometrical representation from a B-scan of GPR data based on template matching. The pattern of object under test will be interpreted by comparing its data with the training data using a simple decision tree method.

2. The Proposed System

Proposed system for identification [16] is shown in Figure 3. Firstly an output GPR (a file*a.out) is extracted to generate reflected signal (V/m) as function of time (ns). The difference of the output GPR format and the input format of the C.45 [17] software tool, needs a program to convert those formats. The program is based on Java called DataCreator, NamesCreator, TstCreator, and AhCreator which generates a file*.data, a file*.names, a file*.tst, and file*.ah, respectively. The first two files are needed to generate the training data and the last twofiles are neededto generate and to test the test data.

The C.45 software tool needsabuilt-tree of training data to predict a decision. Generation of built-tree can be done after file*.data and file*.names have been generated. Values of training data are calculated to determine entropy and gain ratio for each attribute. Entropy is used to measure how informative is a node in a tree, and gain ranks attribute to build a decision tree of the training data.

Interpretation of a test data can be done after the file*.tst is generated. Using the built-tree, a tested data record that has unknown attribute values can be classified by estimating the probability of the various possible results.

Finally the ResultShower.jar program converts the output decision of C.45 in text form to visual pattern form to show the visual of the tested unknown object. These steps of the process are summarized in algorithm 1.

As example on this experiment, training data consist of 30 samples of three types (circular, triangular, and rectangular) of object data. For each type object data has 10 different size samples (: 7, 10, 12, 13, 15, 17, 18, 19, 20, and 23). Each sample has 10180 attributes of the file*.data, so the training data has totally 305400 attributes. Figure 4 shows the built-tree of the training data.

2.1. GPR Test Range Model

The model GPR test range is a box with 3 meter length, 3 meter wide, and 2 meter high ( meter), containing a half full of sand. A 1 GHz transmitter and receiver of GPR, which has 10 centimetre fix distance and located 20 centimetre above the sand, moves to direction in 5 centimetre intervals until 1 meter length (20 scan positions). In each interval, the pulse signal with 12 ns width is transmitted. The sand’s parameter relative permittivity is 3.0, conductivity is 10−2 Siemens/meter, and relative permeability is 1. A simple buried object (with three types of patterns: circular, triangular, or rectangular cross-section) is made from perfect conductor material and is lied in the sand at 50 cm depth. Figure 5 shows the model of simulation.

For training purpose, 30 samples of three types (circular, triangular, and rectangular) of object data are generated. Each type of object data has 10 different size samples. Test data consist of 12 samples of the three types of objects in which each type has 4 different sample sizes.

Successful GPR modelling based on the finite-difference time-domain (FDTD) theory [18] has been reported by many authors [1922].

Giannopoulos [19]has developed GPR modelling facilities called GprMax2D/3D ver. 2.0 which are used to generate the GPR data. The output simulator can be a file*b.out and file*a.out to represent hyperbolic image in binary file and ASCII file, respectively. Sampling experiment by using real GPR tool will be done in GPR test range ICTR-ITB laboratory to validate and verify the GPR data simulator. An example of input and output GPR data simulator is shown in Figure 6. The research takes files*a.out as input of the proposed system to generate training data and tests data files.

2.2. Testing Scenarios

There are scenarios to cover both the same environment test condition and the different environment test condition. The test data for each scenario consist of 12 samples of the three types of object in which each type has 4 different sample sizes (: 8, 9, 21, and 22). The total samples test data for all scenarios are 96 samples. These scenarios are as follows.(1)Tested and training objects are in the same 50 centimetre depth condition.(i)Scenario number 0. The sand and the tested object have the same parameters as in the training data (the sand: , , and the object: perfect conductor). Tested and training objects have the same environmental condition.(ii)Scenario number 1. The sand has the same parameter as in training data and the tested object has different material (the sand: , , and the object: fresh water, ).(iii)Scenario number 2. The tested object has the same parameter as training data and the sand has different permittivity (sand: , , and the object: perfect conductor).(iv)Scenario number 3. The sand medium and the tested object have different parameters compared to training data (sand: , , and the object: fresh water, ).(2)Tested and training objects are in the different depth.(i)Scenario number 4. The sand and the tested object have the same parameters as in the training data (the sand: , , and the object: perfect conductor), and depth of the tested object is 40 cm.(ii)Scenario number 5. The sand and the tested object have the same parameters as in the training data (the sand: , , and the object: perfect conductor), and depth of the tested object is 55 cm.(iii)Scenario number 6. The sand has the same parameter as in training data and the object to be tested has different material (the sand: , , and the object: fresh water, ), and the depth of the tested object is 40 cm.(iv)Scenario number 7. The sand has the same parameter as in training data and the tested object has different material (the sand: , , and the object: fresh water, ), and depth of the tested object is 55 cm.

2.3. The Test Result Based on Algorithm Number 1

An example of interpretation of the test result for scenario number 6 is shown in Figure 7. This scenario in which tested and training objects are in the different environmental condition, generates the extreme result, that is, the most of interpretation is false. On the other hand, scenario number 0, in which tested andtraining objects are in the same environmental condition, generates the best result, that, is all interpretation is true.

The complete test results are summarized in Table 1. From 96 tested data samples, the system can truly well interpret only for 50 samples (52%). Algorithm 1, which represent direct comparison between test and training data with template matching method, has inaccurate interpretation for condition where the tested and the training object have different environmental condition.

3. The Feature Extraction

An effort to enhance the performance of the system is done by adding a feature extraction of the object to algorithm 1. This improved the algorithm called as algorithm 2.

Investigation of reflected signal received by receiver for 30 samples of three types of (circular, triangular, and rectangular) object of training data carries out an important information, that is, each type of object has a unique pattern signal. In the same type of object, different size will generate different amplitude signal with the same pattern signal. This phenomenon can be seen in Figure 8.

Afterward, the unique pattern signal of object is extracted as the objectfeature, that is, arepresentation of the object. To simplify the feature and to reduce the number attribute of complex extracted signal pattern, the time variable is excluded by sorting values of the from the smallest to the largest, and then a linear extrapolation is applied. Changing the testing condition (, and depth) will shift the position in time axis and change the amplitude signal without changing the pattern of object.

By this process, each reflected signal pattern of an object, which in algorithm 1 has 10180 attributes, will be transformed to a straight line that only has 3 attributes, called slope of the line (), intersection point of the line with the axis (), and standard deviation (). Furthermore, these attributes are used as the object representations are processed using template matching method by comparing attribute of tested data to training data, the same as in algorithm 1.

Figure 9 shows algorithm 2 illustrated in each step, starting from extracting the reflected signal, extracting the feature, and matching the tested and training attribute data using decision tree to generate the interpretation result.

3.1. The Test Result Based on Algorithm Number 2

The system based on algorithm 2 is tested by the test data and scenarios the same as in the system based on algorithm number 1. An example of interpretation of the partial test result for scenario number 5 is shown in Figure 10 and all the test result is summarized in Table 2. For 96 test samples in different scenarios, the result showed that the system correctly interprets all the samples. Algorithm number 2 has accurate interpretations for all environmental testing conditions.

4. Calculation of the Object Size

Based on reflected signal for 30 samples training data of three types (circular, triangular, and rectangular), the maximum and minimum signals amplitude can be determined. In the same type of object, different size will generate different amplitude signal with the same pattern signal. By linear or logarithmic extrapolation of signals amplitude, a formula to calculate another size of objects in the same object type with similar environmental condition (, , , and ) can be generated as in Figure 11. Unfortunately for the triangular object, the generated formula is constant. Constant relationship between amplitude and size of triangular shape can be due to the backscattered rays which are not directed towards the antenna.

The formula for a tested object has different condition (, , and ) with the training object, which is need to be corrected by a correction factor as can be derived from Figure 12.

Intrinsic impedance, transmission, and reflection factors are as follows: The correction factor for different environmental condition isAs example for scenario number 1, correction factor became For ,  cm and , the factor correction is

Table 3 shows that the correction factor for all scenarios has been investigated in this research.

Using the formula and the correction factor, size of tested object can be predicted. Table 4 shows the prediction result for rectangular and circular tested object. Unfortunately, due to the constant relationship between signal amplitude and object size in generated formula for triangular object, this approach did not success in predicting the triangular tested object. The overestimate or underestimate size prediction for the circular and rectangular objects occur due to the linear or logarithmic approximation of the experimental result graph as shown in Figure 11.

5. Testing Measurement in the Laboratory

Testing data based on sampling experiment using real GPR tool has been done in GPR test range at ICTR-ITB laboratory. The tool consists of test range, GPR scanner, spectrum analyser, and personal computer with the software tools for controlling scanner and processing data. Figure 13 shows the GPR test range in ICTR-ITB laboratory.

Three types objects are tested. These objects are buried objects (circular, triangular, and rectangular cross-section with diameter 9, 20, and 8 cm, resp.). The object made from perfect conductor material is buried in the sand at 35, 25, and 25 cm depth, respectively. Figure 14 shows the measurement result for the circular object.

From Figure 15 which represents the zooming and adding cursor on B-scan, the depth of buried object as the top of hyperbolic line in axis is shown by the cursor on 517, surface of the sand on 491, and position of the floor on 566. The real height of the sand is 100 cm represented by value point and the position of the object is represented by value point. So, the real position of the buried object is  cm depth.

Algorithm number 2 has been applied to the measured tested data. The reflected signal received by GPR receiver that is represented by 20 A-scan of 20 point trace experiment result is extracted to get only the reflected signal of buried object by capturing the reflected signal in the axis range where the hyperbolic line is located and then converting the signal to the binary data in ASCII file for further manipulation.

By sorting values of the binary data from the smallest to the largest and then applying a linear extrapolation, the feature parameter of the buried object can be obtained, that is, , , and .

Matching the feature parameter with the built-tree that has been generated from training data using the C.45 tool as illustrated in Figure 9 will generate a decision of buried object interpretation. The extract feature parameter and the interpretation result of this testing are shown in Figure 16.

The same process is applied to the measured tested data for triangular and rectangular objects, generating feature parameter with values of , , and , and values of , , and , respectively.

The amplitude of extracted signal for circular object is 119 mV/m. Entering this value to the equation results in the size value, of 32.5 cm. The testing experiment condition in laboratory is different from training condition, so the size value should be corrected using correction factor derives from (2) as follows: The size prediction of the test circular object can be derived by multiplying by factor , that is,  cm.

Similar process can be applied to rectangular and triangular tested object. All the test result of measured test data in the ICTR-ITB laboratory can be summarized in Figure 17.

6. Conclusions

This paper proposed a system for identifying and predicting target shape and size of buried basic object on surface ground-penetrating radar system. Feature parameters of object are extracted by using simple proposed algorithm which is based on linear extrapolation. 96 samples of basic objects which are laid underground and have both same and different condition with the data training are tested. Results show that all basic objects can be correctly interpreted and the proposed algorithm number 2 can improve the performance of the shape object interpretation from 52% to 100%. Three sampling experiments of buried object have been done in GPR test range ICTR-ITB laboratory and the result of the test showed that the developed system works properly.

The proposed method is appropriate to be implemented in the industry to enhance the capability of GPR tools. The next output GPR target is not only in a hyperbolic representation but also can directly interpret the shape and the size of the detected object. The main application is for the underground utility infrastructure detection and interpretation such as the telecommunication and electrical cable and duct which have regular shape and size of the rectangular or circular cross-section object.

However, there is a weakness in predicting the size of triangular object due to the constant relationship between amplitude and size of the triangular shape that is generated from B-scan GPR data. For further research, instead of B-scan GPR data, using C-scan GPR data as the system input might solve this problem, even though the generation of C-scan GPR data needs more time.