Abstract

Synthetic aperture Radar (SAR) uses the relative movement of the Radar and the target to pick up echoes of the detected area and image it. In contrast to optical imaging, SAR imaging systems are not affected by weather and time and can detect targets in harsh conditions. Therefore, the SAR image has important application value in military and civilian purposes. This paper introduces the classification of Gaussian process. Gaussian process classification is a probabilistic classification algorithm based on Bass frame. This is a complete probability expression. Based on Gaussian process and SAR data, Gaussian process classification algorithm for SAR images is studied in this paper. In this paper, we introduce the basic principle of Gaussian process, briefly analyze the basic theory of classification and the characteristics of SAR images, provide the evaluation index system of image classification, and give the SAR classification model of Gaussian process. Taking Laplace approximation as an example, several classification algorithms are introduced directly. Based on the two classifications, we propose an indirect multipurpose classification method and a multifunction classification method for two-pair two-Gaussian processes. The SAR image algorithm based on the two categories is relatively simple and achieves certain results.

1. Introduction

Synthetic aperture Radar (SAR) is an active modern high-resolution microwave imaging Radar [1]. SAR is an active Earth observation system, which can be installed on aircraft, satellites, spacecraft, and other flight platforms to observe the Earth all day and in all weather conditions and has a certain ability to penetrate the surface. SAR improves azimuth and range resolution by using synthetic aperture and pulse compression techniques. It breaks the boundary of Radar camera mode in the past and improves the camera ability of the observation area. Compared with other remote sensing imaging systems, the number of synthetic apertures has increased [2]. The Radar imaging system cannot be affected by light and bad weather and can take necessary ground scenes in one day and in all weather conditions. Because of the above advantages, SAR imaging technology is widely used in resource exploration, land exploration, wartime hostile target inspection, etc. and also has great application to people’s livelihood [3]. It has great application potential. When the 5.12 Wenchuan earthquake occurred, it was difficult to obtain rescue information and monitor disasters due to the harsh local climatic conditions. Synthetic aperture Radar can overcome these problems well [4].

With the continuous exploration and development of SAR imaging technology, the demand for SAR image data is getting higher and higher. However, on the other hand, SAR images are quite different from optical images and have the characteristics of correlated noise [5, 6]. This makes the readability of SAR images much stronger than that of optical images, which makes it difficult for scientific researchers to interpret them manually. Therefore, it is difficult to automatically recognize SAR images at present. Automatic Target Recognition (ATR) technology for SAR images is becoming more and more important and becomes an important research direction [7].

In military warfare, with the development of electronic information countermeasure technology, the outcome of electronic warfare has become one of the decisive factors. Especially in the complex electromagnetic environment, it is very important to identify and detect the enemy’s intentional camouflage.

Automatic recognition of conventional SAR images requires complex steps such as filter noise reduction, image optimization, and image classification [8]. For automatic recognition of conventional SAR images, complex steps such as image preprocessing, feature extraction, image classification, and recognition are required. The specific process is shown in Figure 1.

There are too many processing steps, and the speed and accuracy cannot meet the actual military needs. Because of the excellent performance of Gaussian process in image processing, SAR image ATR based on Gaussian process has become the focus of research [9]. Gaussian process is a kind of stochastic process in probability theory and mathematical statistics, which makes it easier to classify and segment images. A large number of studies show that Gaussian process technology can be used to effectively process SAR images [10].

2. SAR Image Classification Algorithm Based on Gaussian Process

2.1. Research Status of Gaussian Process

Machine learning has been studied for more than a century, and many theories have been formed, but the research on Gaussian process is relatively late. Since 1970s and 1980s, Gaussian process has been applied to geostatistics in the name of Gaussian process regression, such as mineral exploration and remote sensing monitoring [11]. The main problem to be solved is the low-dimensional problem of geoscience statistics, and there is a lack of research on the probability of models and the independence of kernel functions.

Hagan first proposed the use of Gaussian process in the regression model in 1978. Later, Devid J. C. Mackey proposed to establish empirical models by nonlinear parameterization of models, such as radial basis function and layer perceptron. Subsequently, bar Shalom and Fortmann used the Kalman filter to explain the stationary one-dimensional Gaussian process [12].

In 1990s, people made a deeper exploration of kernel function in machine learning. A famous example is support vector machine. At this stage, the application of Gaussian process in machine science has attracted some attention [13]. For example, in 1992, Mackey proposed a framework using Gaussian processes by using approximation methods. However, until 1996, Neal found that its nodes obey Gaussian distribution in the research of Bayesian neural network and Gaussian process began to attract wide attention [14].

In recent years, Gaussian process has been applied to predictive control and soft sensor modeling because of its good model reliability and can meet the requirements of industrial field application. At the same time, it has been further studied in the field of machine learning [15, 16].

2.2. Fundamentals of Gaussian Process
2.2.1. Machine Learning Theory

Machine learning breeds strong vitality [17]. This is to find out the rules between data by analyzing the characteristics and association of data and to mine all kinds of information needed by human beings at a deeper level. Several common learning methods are briefly introduced as follows:(1)Description-Based Learning [18]. In a data area, we describe a concrete example, find the main factors associated with the concrete example, and finally perform the process of forming a generalized concept.(2)Analogical Learning [19]. We compare two data regions, find the similarities between them, and explain the unsolved problems of other data regions from the related theories of one data region [20, 21].(3)Concentrate on Your Studies. We reason about the existing instances and summarize the specific hypothesis process. From the special to the ordinary, from the simple to the complex, using the local to the whole method, we summarize the learning.

As can be seen from Figure 2, the machine learning system is always on optimized learning mode. By providing back the results from the executive part to the learning part and optimizing the knowledge base all the time, the whole model is optimized. The most important thing in machine learning is to derive learning patterns through training sample data. Therefore, machine learning needs to first determine the learning objects (X, Y) and training samples, learn the sample data with corresponding learning methods, and derive the learning model. Finally, according to the model, a value Y is obtained for any input X, which is a vector and contains the hidden information in the data, that is, the necessary information. The mathematical representation of machine learning, which selects the function that best represents the training result from the given f (x, a). The selection is trained only based on the sample data and does not depend on prior knowledge of the sample data that is known in advance. Whether the learning model is suitable to be evaluated by the difference L (Y, F (X, A)) between the sample Y value and the value F (X, A) obtained under the machine learning condition is established by

The training sample consists of a probability distribution composed of n pairs of observed data (x1, y1), …, (xn, yn) under independent and identical distribution conditions. In most practical cases, F(x, y) in training is unknown.

The powerful feature extraction ability of CNN depends on the convolution operation of the target image expansion by the convolution kernel. The new value obtained by dot product and sum operation on the image is the eigenvalue of each point pixel [22]. At the level of data structure, both image and convolution kernel are essentially a numerical matrix, in which the size of image matrix is W × W × D and the size of convolution kernel square matrix is F×F×D. D above represents the depth of the image channel, W represents the side length of the image, and F represents the side length of the convolution kernel. A single channel image with depth 1 and width and height 5 and a convolution kernel with side length 3 are used for convolution operation display. The convolution kernel firstly covers the left vertex area of the feature map and carries out dot product and sum operation on the pixel values in the covered area to obtain the pixel values in the red virtual box in the output feature map, which is the first calculation step of convolution. The green virtual box is the coverage area where the convolution kernel shifts to the right by one unit length and then repeats the dot product and summation operation in the first step to obtain the value of pixels in the green virtual box in the output feature map. Here, the unit length of the convolution kernel shifts to the right is called the moving step size, which is represented by S, and this process is the second step of the convolution operation process. When the convolution kernel repeats the above steps and traverses the image matrix from left to right and from top to bottom, it completes a convolution operation process and finally obtains a 3 × 3 output feature map.

By substituting the corresponding numerical value, the value of N is 3, that is, the size of the output characteristic map is 3 × 3. It can be seen from the whole convolution operation process that the convolution kernel in each operation process has not changed, and this operation ensures the invariance of the internal parameters of the kernel, that is, weight sharing, which avoids the problem of excessive increase of parameters. In addition, the above convolution operation still brings some problems, such as the reduction of feature image size and insufficient utilization of image edge information. Facing the above problems, we can use filling strategy at the edge of the image to solve them. The edge of the image is filled with pixels with a length of 1 and a value of 0, and the feature image with a size of 5 × 5 can be obtained by recalculating with a convolution kernel with a size of 3 × 3, which makes the output image keep the same size as the input image and makes full use of the edge pixels of the original image.

2.2.2. Bayesian Theory

Bayesian theory takes the form of

2.3. Classification Decision Theory

The output of the regression model is converted into class probability by response function, and the following model is adopted:

Up to now, there is no correct answer as to which method to choose. Compared with the direct generation method, the indirect reference method has the advantage that when the modeling of is direct, the x dimension is high and the prior density of the class distribution of data is relatively easy to obtain and the practical problem may be more complicated if the direct generation method is adopted, so the indirect reference method is adopted in this paper.

Common optical images are easier to recognize, while SAR images have poor recognition ability because of their features. The synthetic aperture Radar (SAR) imaging system firstly collects the electromagnetic wave reflected from the measured object from the sensor and uses SAR imaging algorithm to reproduce the SAR image in reverse. According to the transmitting and receiving mode of the electromagnetic wave of the sensor, the imaging mode of SAR image can be divided into vertical polarization (V) and horizontal polarization (H). After the two combinations, there are four camera modes: HH, HV, VH, and VV. The color optical image has three channels: R, G, and B. SAR images have different channel numbers according to different shooting methods. SAR images are divided into three types according to the number of image channels: unipolarized, dual-polarized, and multipolarized. As the number of channels in a SAR image increases, it contains more information.

The Radar image can be obtained by processing the reflected electromagnetic wave signal of the detected object. However, SAR images have fatal shortcomings. Coherent point noise produces interference phenomenon when the target rotates, which leads to anomaly. The gray value of the image shakes violently in a certain range, which is bright and dark, and has a sense of wear. The recognition ability of SAR images is reduced, which makes it difficult to recognize SAR images.

When the surface of the photographing area is uneven, electromagnetic signals reflected from a plurality of different objects are superimposed and returned. Because the surface of each object is uneven, the distance between the SAR imaging system and receiver is random and the signal received by the receiver is related to frequency, but the phase and random change are strong. It is going to return one degree.

Assuming that the imaging area is K scatterers and has an echo reflected on each object, phase P, and amplitude A, the echo signal received by the receiver after superposition of all scatterer echoes is as follows:where A and φ represent the amplitude and phase of the echo received by the receiver. From the above formula, it is found that when the Radar imaging system moves, the echo path changes and the corresponding phase changes, which affects the whole echo signal. Because of the shortcomings in photography principle, random correlated speckle noise will be produced during photography. In this way, SAR images also produce randomness. On the other hand, SAR images are particularly sensitive to each other’s position angle. If the same detected object is shot at different angles, it will cause dramatic changes in SAR images.

There is a very complex correlation between coherent point noise and SAR image information, such as gray value and reflection intensity. K distribution, Weibull distribution, and lognormal distribution, which are often used now, explain the statistical characteristics of SAR images.

2.3.1. K Distribution

For high-resolution SAR images, we will use K distribution to describe its probability density:where b is the scale parameter, is the shape parameter, and is the modified Bassel function of the second kind.

The expectation and variance of the probability density function are as follows:

2.3.2. Weibull Distribution

Weibull distribution is more suitable for describing SAR images, and its probability density function can be expressed as

Expectations and variances are as follows:

2.3.3. Lognormal Distribution

When SAR imaging, there is a main target to be detected, which is suitable to be described by lognormal distribution. The probability density function is as follows:where µ denotes scale parameter and s denotes shape parameter. Expectations and variances can be expressed as

We should use different distribution models to fit the distribution according to the actual situation.

2.4. Classification Accuracy Evaluation System

The classification accuracy of SAR images is mainly based on the measured images on the ground. With the development of SAR remote sensing technology and the complexity of various applications, the accuracy evaluation of SAR image classification becomes more and more important.

2.4.1. Confusion Matrix

The real classification of land surface is expressed by confusion matrix. The values of each column are the same as the true aberration of the surface. The number of categories corresponds to the category image.

2.4.2. Accuracy of Users and Producers

The accuracy of a particular category classification can be determined by calculating the accuracy of a user (UserAcery, UA) and a producer (Producer Accuracy, PA). The specific formula is as follows:

2.4.3. Kappa Coefficient

The Kappa coefficient can be calculated by the following formula:

2.5. SAR Image Classification Algorithm Based on Gaussian Process

Taking the second classification as an example, this section briefly introduces the SAR classification algorithm of Gaussian process. The training data is given, the predicted output takes into account the target value of the sample, and the Gaussian noise usually including zero mean variance is specifically , where , that is,

Gaussian process extends multi-Gaussian distribution to an infinite number of random variables. Its kernel function is k:

Find the edge distribution of p(y):

When a similar function is configured as a Probit function and the Gaussian result is converted to a probability value, the target variable y follows the Bernoulli distribution:

Let be denoted as vector fN + 1. For fN + 1, the form iswhere

For the classification problem, the distribution form is

According to the two classifications, multipurpose classification is carried out. Most of them adopt a pair of multifunctional classifications.

The logarithmic likelihood function of Gaussian process is as follows:

The partial derivative of logarithmic likelihood function for hyperparameters is

3. SAR Image Classification Model Based on Multitarget Strategy Gaussian Process

Most of Gaussian process classification algorithms take two classifications as examples, and SAR images include many target categories. Firstly, this paper introduces the direct multipurpose classification and puts forward the indirect multipurpose classification according to the two classifications.

3.1. Multipurpose Classification Algorithm for Direct Gaussian Processes

Suppose there is a data set (x, y) consisting of n training samples, x is the observation data, and the corresponding category label is y, and is a potential function that divides the whole sample n into vector C. In the covariance of C, the classification is sometimes denoted by C below. The probability distribution of F is F ∼ N (0, K). It is assumed that the computation between F is independent, K belongs to block diagonal structure, Y is a vector with the same dimension as F, and for I = 1, …, N, its corresponding value is 1 and otherwise, it is 0.

The output of the Softmax function at training point i is :

At this time, a record c with the same length as f is included. The vector of i represents n. The multifunctional learning formula is a nonstandard delayed classification label.

Derivation of f on both sides of equation (26) results in

It can be seen from the extreme value formula that when formula (27) is zero, is the maximum value. Derivation is performed again, and it satisfies

We have

The Newton method is used to search ψ, which satisfies the following conditions:

If you need to predict the test point , you can get a posterior distribution . In general, use

Since and follow Gaussian distribution, follows Gaussian distribution according to the properties of Gaussian distribution. The projected average values for category C are as follows:

The final equation is obtained from equation (31). By defining the matrix of Cn × C,

Equation (32) is transformed into equation (33).

Or, we can get the following:

For both classifications, it is necessary to transform the average value of Gaussian distribution. Edge likelihood can be obtained as follows:

Suppose W is not a diagonal matrix and K is a block diagonal matrix of Cn × Cn size. By analyzing the inherent structure of the matrix, it can be modified before using K−1 + W:

The matrix inversion lemma used in this case is is inverted. First, the inverse transformation of the W matrix is applied to the matrix of equation (34).where ; the matrix of Cn × n size composed of the identity matrix of is .

Bring in

3.2. Multipurpose Classification Algorithm for Indirect Gaussian Processes

The one-to-multipurpose classification described in the above section is one of the multipurpose classifications of indirect Gaussian processes. Support vector machine (SVM) can only deal with two classification problems at first, but it cannot deal with many classification problems directly. This section refers to SVM promotion mode and is based on two classification algorithms. The two-to-two Gaussian model of class C classification is shown. This method allows application from two classifications to multiple classifications.

For a given training set D(X, Y) and test data , and . In order to train a classifier between two classes, for M class problems, two classifiers need to be constructed in the training stage. Each classifier collects any two types of data for training. The type of test data is determined by the maximum counting method. The method introduced above is fast and has high classification accuracy. The algorithm of this method is as follows:(1)Initialize the input values for each class(2)In the training stage, select any two data for training and construct two classifiers(3)In the training stage, train two classifiers finally between two classes(4)In the test phase, when the test data are classified and discriminated in the training between class i and class j, if the output result of these two classifiers is 1, it belongs to class i of test data

During the test phase, the test data are calculated. Among the various other numerical values, the largest class of numerical values is the class of .

4. Experiment

The data used in this experiment are from the website of the National University of Bus, Spain. The actual classification samples of the experimental image include 9 kinds of samples, but it can be seen that the data when the samples are discarded are external black stripes.

Figure 3(a) is an original image of Pavia University showing the distribution of the samples shown in Figure 3(b), with irregular land distribution and high degree of crushing. Specifically, the ratio of training samples to prediction samples is about 1 : 8.

Table 1 shows the specific category information and corresponding quantity of samples.

In this experiment, the comparison between indirect multipurpose classification algorithm and support vector machine verifies the direct multipurpose classification algorithm. The experimental results are shown in Figure 4. The approximation algorithms used in experiments are all Laplace approximation methods.

As can be seen from Figures 4 and 5, even if the direct or indirect Gaussian process is applied to SAR classification, the classification accuracy is better than SVM. It is better to classify more kinds of SAR than to classify SAR directly according to the indirect strategy of two classifications. The classification accuracy of SAR images is preferably one-to-many.

This paper compares the classification methods for evaluating performance in other MSTAR datasets including LNP, imported-GAN, MGAN, and triple-GAN. LNP assumes that each data point can obtain information from adjacent data and reconstruct it linearly. Triple-GAN uses three network modules, discriminator, generator, and classifier, to achieve the best results of the deep countermeasure model. Indirect Gaussian process classification algorithm achieves the best results in classification and can use the latest training model. MGAN improves images generated using dynamically tuned multirecognizer architecture and improves classification accuracy. The results of typical machine learning methods, for example, SVM and Adaboost, are compared. In the comparative experiment, the same number of labeled samples were used to randomly select samples, and the experimental results were compared with the quantitative results of the paper. The method we use is indirect Gaussian process classification algorithm. The comparison results are shown in Table 2. Figure 6 shows the classification accuracy of methods with different amounts of tag training data.

5. Conclusion

This paper describes the characteristics of SAR images and the difficulty of processing and extracts the advantages of Gaussian process classification in SAR images. Finally, the evaluation index of image classification is given, and a certain standard of Gaussian SAR classification algorithm is given to evaluate the classified image and its basic program is introduced.

Based on the analysis and introduction of machine learning theory and Bessel principle, this paper expounds the basic principle of Gaussian process. This paper introduces the basic theory of classification, then leads to the fact that classification usually needs to be solved, and discusses it in detail. This paper describes the characteristics of SAR image and the difficulty of processing and extracts the advantages of Gaussian process classification in SAR image processing. Finally, we give the evaluation index of image classification, give a certain standard of Gaussian process SAR classification algorithm to evaluate the classified image, and introduce its basic program.

Data Availability

The experimental data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest regarding this work.

Acknowledgments

This work was sponsored in part by Henan Province Higher Education Training Program for Young Core Teachers, under Grant no. 2021GGJS192.