Abstract

Aiming at the problem of radar target recognition of High-Resolution Range Profile (HRRP) under low signal-to-noise ratio conditions, a recognition method based on the Constrained Naive Least-Squares Generative Adversarial Network (CN-LSGAN), Short-time Fourier Transform (STFT), and Convolutional Neural Network (CNN) is proposed. Combining the Least-Squares Generative Adversarial Network (LSGAN) with the Wasserstein Generative Adversarial Network with Gradient Penalty (WGAN-GP), the CN-LSGAN is presented and applied to the HRRP denoise. The frequency domain and phase features of HRRP are gained by STFT in order to facilitate feature learning and also match the input data format of the CNN. These experimental results show that the CN-LSGAN has better data augmentation performance and can effectively avoid the model collapse compared to the generative adversarial network (GAN) and LSGAN. Also, the method has better recognition performance than the one-dimensional CNN method and the Long Short-Term Memory (LSTM) network method.

1. Introduction

HRRP is the vector sum of all scatters electromagnetic echoes, which can reflect the geometric structure, scatters point distribution, and other characteristics of the target. Also, it is easy to be obtained and processed, so the radar target recognition of HRRP has become a research hotspot that has attracted much attention in the academic community [111]. In recent years, with the large-scale rise of deep neural networks, deep learning provides new ideas for the research of radar automatic target recognition (RATR) of HRRP. Compared to the traditional recognition method, the HRRP target recognition method of deep learning can avoid the excessive use of manual rules to extract the target features and acquire high-order features. The recognition rate of the Deep Belief Network (DBN) reached 92.8% [12]; the CNN was also applied to RATR [13, 14], and the recognition rate was greatly improved; the Recurrent Neural Network (RNN) [15] and LSTM [16] had both achieved good recognition rates. However, in these actual application scenarios, HRRP acquired contain noise, which affects the amplitude of HRRP. When the signal-to-noise ratio of the data is low, the amplitude of these data differs greatly from the amplitude of the real data, and when the features are extracted, the extracted features also differ greatly from the features of the real data, thus interfering with the recognition results. So, HRRP needs to be enhanced in order to improve the signal-to-noise ratio. The GAN is a deep-learning model widely applied to data augmentation, and it was used for image blind denoising [17] and speech augmentation [18, 19]. But, it has not been applied to HRRP augmentation. HRRP is denoised only by an Auto Encoder (AE) [20].

To solve the problem of radar target recognition of HRRP under low signal-to-noise ratio circumstances, we proposed a new kind of generative model, called the CN-LSGAN. The CN-LSGAN can generate data similar to the real data with low signal-to-noise data as input, thus improving the signal-to-noise ratio of the data. In addition, in order to simulate the noisy HRRP data in actual application scenarios, we added different degrees of Gaussian white noise to the real data and then used the CN-LSGAN to enhance the data to improve the SNR. Then, we used the obtained data for target recognition. In order to improve the recognition effect as much as possible, time-frequency analysis is performed on the data before the recognition, and the data are converted from one-dimensional data to two-dimensional data. The obtained time-frequency data contain more features. In the target recognition task, the more the data features, the better the recognition effect is generally. Finally, the obtained time-frequency data are recognized by the CNN, and the recognition result is obtained. In this paper, the following contributions have been achieved:(1)Improve the LSGAN to solve the model collapse problem(2)Improve the signal-to-noise ratio of HRRP data by the CN-LSGAN(3)Use the combination of STFT time-frequency analysis and shallow network CNN architecture to test the recognition rate of the enhanced data by CN-LSGAN

The rest of this paper is organized as follows: The second part introduces the proposed method in detail, mainly including the illustration of the CN-LSGAN, STFT, and CNN; the third part chiefly focuses on the experimental results and analysis; and the fourth part concludes the paper.

2. Time-Frequency Analysis and Target Recognition of HRRP Based on the CN-LSGAN, STFT, and CNN

2.1. Design of the CN-LSGAN

The GAN is a generative model designed by Goodfellow [21]. The model consists of a generator (G) and a discriminator (D), as shown in Figure 1.

The input of is real data or , and the output of is 1 or 0. The adversarial training is carried out between and , so that approaches the real data. When cannot determine the source of the input after continuous adversarial training and iterative optimization, it is considered that has learned the real data distribution.

The loss function of the GAN iswhere is the expectation of the distribution function, is the real data distribution, and is the noise distribution.

The real data are the clean HRRP, and the input of is the noisy HRRP, as is shown in Figure 2. Through the adversarial training, the data generated by are similar to the clean HRRP, and then, the denoise purpose is achieved.

The LSGAN [22] is used instead of the GAN in this paper, and the loss function of the LSGAN is shown as follows:

Compared to the GAN, the LSGAN uses least-squares loss in place of cross-entropy loss, which effectively solves the gradient disappearance problem during the model training stage. However, the LSGAN also has shortcomings; the LSGAN has a large output freedom degree and is prone to model collapse. These problems will lead to poor quality and low diversity of generated samples. In terms of these problems, a constraint item is, respectively, added to the loss function of the LSGAN's and , and the two constraint items are shown as follows:

Through the improvement of the LSGAN, formula (2) can be rewritten as follows:

In formula (5), and represent the adjustment weights, which adjust and , respectively. can solve the model collapse, and can reduce the output freedom degree of G. The LSGAN with the abovementioned constraint items is called the CN-LSGAN.

Both and of the CN-LSGAN adopt the fully connected network structure, as shown in Figures 3 and 4 .

2.2. Time-Frequency Analysis Based on STFT

Time-domain analysis of HRRP only characterizes the amplitude of the signal, which only provides limited information for feature learning. In contrast, the spectral feature is a two-dimensional (time and frequency) representation of HRRP, which includes the frequency-domain characteristics of the target and reflects the phase information. STFT is used for the time-frequency analysis of HRRP, and the window function is Hanning window. The width of the window affects the relative resolution of the time domain and frequency domain. In order to make a trade-off between the time-domain resolution and the frequency-domain resolution, the adjacent signal segments are superimposed on each other. According to the HRRP characteristics, the overlapping length between adjacent signal segments is selected to be half of the length of each segment, and Hanning windows with window lengths of 32, 16, 8, and 4 are used for STFT.

2.3. Design of the CNN

The dimension of the time-frequency data obtained by the time-frequency analysis for HRRP is (2, 64), and the CNN is designed according to the dimension. The model consists of two convolutional layers, a pooling layer, two fully connected layers, and a SoftMax classifier, and the sizes of the convolutional kernel of two convolutional layers are 3∗3, as shown in Figure 5.

3. Experimental Results and Analysis

3.1. Experimental Data

The clean HRRP are gained by the electromagnetic simulation technology, which has simulated five kinds of aircraft (Su27, F16, M2000, J8II, and J6). There are 18,000 samples for each aircraft, so there are a total of 90,000 samples. In order to verify the effectiveness of the denoise method based on the CN-LSGAN, Gaussian white noise is added to the clean HRRP, and six kinds of noisy HRRP data with signal-to-noise ratios of 0 dB, 5 dB, 10 dB, 15 dB, 20 dB, and 25 dB are obtained, where, the clean HRRP is called dataset A, the noisy HRRP is called dataset B, and the HRRP generated by the generative model is called dataset C; these three datasets are divided into a training set and test set at a ratio of 9:1.

3.2. HRRP Denoise

Figure 6 shows four samples of dataset A, and Figure 7 shows four samples of dataset B with a signal-to-noise ratio of 10 dB.

Dataset B with a signal-to-noise ratio of 10 dB is input to the GAN, LSGAN, and CN-LSGAN, respectively. Three kinds of data are generated, and they are shown in Figures 810 .

Cosine similarity and Mean Square Error (MSE) among these data samples are technical indexes which indicate the probability of model collapse. They are calculated, respectively, for three kinds of data generated by the GAN, LSGAN, and CN-LSGAN. The calculated data are shown in Table 1.

When cosine similarity is equal to 1 or the MSE is equal to 0, it means that the data samples in a certain dataset are very similar, which means that model collapse has occurred. From Table 1, it can be seen that the cosine similarity of the data generated by the GAN and LSGAN is both 1.0 and the MSE is equal to or very close to 0, which means that the data generated by the GAN and LSGAN are very similar and model collapse has occurred.

Pearson Correlation Coefficient (PCC) and Grey Relational Analysis (GRA) are used to evaluate the quality of generated data and denoise performance of different generation models. The measured data are shown between four kinds of data and the clean HRRP, as shown in Table 2.

When the PCC is equal to 1 or the GRA is equal to 1, the generated data are the same as the clean data, which means that the quality of the generated data and the denoise performance are great. The PCC value of the CN-LSGAN is slightly lower than the PCC value of the LSGAN, but is higher than the PCC value of the GAN. Also, the GRA value of the CN-LSGAN is much larger than that of the other two networks. From Tables 1 and 2, it can be seen that the CN-LSGAN not only effectively solved the model collapse problem but also generated higher data quality and better denoise performance than the GAN and LSGAN.

3.3. Time-Frequency Analysis

STFT is used for the HRRP time-frequency analysis. When the window length is large, the time resolution is poor and the frequency resolution is high. The situation is the opposite when the window length is narrow. According to the sample characteristics, the Hanning windows with window lengths of 32, 16, and 4 are used, and the obtained time-frequency data are shown in Figure 11.

Data set A performed STFT under the Hanning windows of four window lengths, and the obtained time-frequency data are recognized by the CNN; the result is shown in Figure 12.

When the window length of the window function is 4, the recognition rate of the obtained time-frequency analysis results can reach 99.8%, so the optimal STFT window function length is 4.

3.4. Target Recognition

The enhanced data by the CN-LSGAN are transformed into two-dimensional data by STFT and then recognized by the CNN. A variety of methods are used to recognize the noisy HRRP data (dataset B), and these recognition rates are shown in Figure 13.

These recognition rates of three methods are compared to the proposed method. The first method is the one-dimensional CNN method, the second method is the LSTM method, and the third method is the STFT-CNN method, that is, STFT is combined with the two-dimensional CNN. It can be seen that the proposed method based on the CN-LSGAN, STFT, and CNN has a better recognition performance than the other three recognition methods. Also, the other three recognition methods are greatly affected by the signal-to-noise ratio, and when the data signal-to-noise ratio is low, these recognition rates are low. By comparing the recognition rate curve of the first method to the curve of the third method, the recognition rate of the third method is slightly lower than that of the first method when the signal-to-noise ratio is 0 dB, while under other signal-to-noise ratios, the recognition rates are higher than the those of the first method. It shows that the third method can improve the recognition performance by combining STFT with the two-dimensional CNN.

4. Conclusions

This paper described an HRRP target recognition method based on the CN-LSGAN, STFT, and CNN, which effectively solves the recognition problem under low signal-to-noise ratio conditions. Among them, the CN-LSGAN is obtained through adding a constraint item to and of the LSGAN, respectively. The CN-LSGAN not only solves the problems of excessive freedom degrees of GAN output, poor quality of generated data, and insignificant denoise performance but also solves the model collapse problem of the GAN and LSGAN. By the means of STFT which analyzes the time-frequency characteristics of HRRP, the frequency-domain features and phase features of the target are introduced, and more features are helpful in the feature extraction and target recognition for the CNN. These experimental results show that the method based on STFT and the CNN has better recognition performance compared to the method of one-dimensional CNN and the method of LSTM.

The proposed target recognition method has achieved a high recognition performance based on low signal-to-noise ratio HRRP, but there are still some problems. For example, CN-LSGAN training is unstable, and it is difficult to achieve the trade-off between the time resolution and the frequency resolution by STFT. In future research, the network structure and parameter adjustment of the CN-LSGAN will be further optimized and the Deep Convolution Generative Adversarial Network (DCGAN) will be tried to be combined with the CN-LSGAN. Also, other time-frequency analysis methods such as wavelet transform will be utilized for HRRP.

Data Availability

Experimental data are provided in supplementary materials.

Conflicts of Interest

The authors state that there are no conflicts of interest in the publication of this article.

Acknowledgments

This work was supported by the National Natural Science Foundation (61661035), Jiangxi Natural Science Foundation (20192BAB207001), and Aviation Science Foundation (201920056001).

Supplementary Materials

See Figures 1–13 in the supplementary materials for comprehensive image analysis. For datasets A, B, and C, see text clean, 10db, and generate in the supplementary materials. In addition, for data file description, see text document description in the supplementary materials. (Supplementary Materials)