Abstract

Cognitive communication behavior is becoming a research hotspot in the field of communication confrontation. In theory, the behavioral intention of noncooperating parties can be obtained by analyzing communication signals. Considering the complexity of the actual electromagnetic environment, even when the signal-to-noise ratio (SNR) is low, a certain accuracy still needs to be guaranteed. In this paper, according to five types of physical burst waveforms defined by the shortwave radio interoperability standard, a signal feature extraction method based on autocorrelation spectrogram features is proposed, and a two-input convolutional neural network (CNN) for classification is designed to improve the identification ability of shortwave communication behavior. The experimental results illustrate that the five kinds of shortwave radio communication behaviors can be accurately identified even when the noise is large. The research in this paper can directly analyze the communication behavior through physical layer signal without demodulation, which has the ability to grasp the communication behavior of the shortwave radio station in real time.

1. Introduction

Behavior is the active response of an organism such as a person or an animal to the internal and external environment under the control of thought. Research on behavior recognition based on machine learning (ML) [13] and probabilistic methods [47] has been developing rapidly. In recent years, the proposal of cognitive electronic warfare (CEW) requires that communication countermeasures be intelligent, and the concept of cognition has begun to receive attention in the field of communication. Research on cognitive radio [8], cognitive radar [9], cognitive Internet of Things (IoT) [10], and cognitive electronic jamming [11] has developed rapidly. Cognitive communication behavior is a potentially hot area. In electronic warfare, if enemy behaviors can be analyzed in real time, we can take actions in a timely manner, which is conducive to gaining advantages.

In 2010, a project called Behavioral Learning for Adaptive Electronic Warfare (BLADE) [12] was proposed to meet the needs of intelligent electromagnetic spectrum operations. The importance of behavioral learning was emphasized in the field of electronic warfare for the first time. It is meaningful to carry out behavioral recognition research based on wireless signals from communication radiation sources (CRSs). CRS itself is a concept without biological characteristics, but because of human operation, it has observable behavioral information that reflects the behavior of the operator. Taking a radio station as an example, its operator will perform operations such as calling, data transmission, image transmission, and switching on and off. The radio station will change its working mode and send different signals according to what the operator does; therefore, it has observable behavioral information. By analyzing the wireless signals obtained by reconnaissance, a series of behavioral information can be unscrambled, and the operator’s task, situation, status, and other key information can be understood.

In recent years, machine learning (ML) has been applied in the field of signal processing, including signal modulation pattern recognition [1316], CRS individual recognition [1719], communication specific signal type recognition [20, 21], and other fields, and a series of achievements have been achieved. However, research on CRS behavior recognition is still in the beginning stage. There have been very few relevant studies in the published literature thus far.

Research on communication behavior can be divided into communication group behavior [22, 23] and communication individual behavior [2436]. The former mainly refers to a series of combat tasks coordinated by groups equipped with CRS, while the latter mainly refers to the working modes, tasks of a certain CRS, and the actions of the person or unit operating the CRS. Individual communication behavior is the focus of our research. At present, relevant research is mainly based on the following three methods. The first method is data mining for spectrum monitoring data to gain communication relationships [2429]. The second method is to obtain the communication behavior by parsing the communication protocol. [30, 31]. The third method transforms the problem of communication behavior recognition into the problem of radio signal classification recognition by analyzing the behavior connotation of intercepted radio signals [3236]. Liu et al. proposed a method to discover the communication relationship of ultrashort-wave radio stations [24]. In the literature [25], DBSCAN is improved and used to mine the hidden communication behavior of spectrum data. By mining the physical characteristics of the spectrum monitoring signal and analyzing their statistical laws, individual communication behavior can be known [26]. Mining of communication relations in the case of missing and disordered data is realized in the literature [27]. Pan et al. proposed a communication behavior structure mining algorithm, which can be used without analyzing the signal content to obtain the communication relation and the communication node of the target field communication [28]. Cheng et al. proposed a method called DECBR, which DCGAN was used for data enhancement and realized the identification of communication behaviors under small sample conditions [29]. Zhang et al. obtained the behavioral intention of the frequency hopping transmitter by analyzing the TDMA protocol [30]. You and Ge proposed a regular expression-based Fetion communication message identification method through the analysis of specific fields in the Fetion protocol, so as to achieve the purpose of inferring the Fetion communication behavior [31]. Zhou et al. observed the differences in the deep-level features of shortwave burst waveforms and demonstrated the feasibility of directly identifying shortwave communication behaviors based on physical layer signals [32]. Wu et al. used the improved LeNet to identify five communication behaviors [33]. Wu et al. [34] used the improved one-dimensional DenseNet to identify seven automatic link establishment (ALE) behaviors under the same types of burst waveform BW0. Furthermore, literature [35] conducted visualization research of the seven ALE behaviors mentioned in literature [34] and observed the deep-level feature differences of the seven ALE signals. Based on this, ACGAN was used to recognize seven types of ALE behavior under small sample conditions [36].

In this paper, the relationship between burst waveforms and communication behaviors is first introduced according to the five types of burst waveforms specified by the MIL-STD-188-141B standard, and then a feature extraction method is proposed for extracting the features of the autocorrelation spectrogram of the signal. The features extracted by this method are not easily polluted by noise, especially under the condition of low SNR, and more original features of the signal can be presented on the spectrogram. The main contributions of this paper are as follows: (i)we use the autocorrelation method to preprocess the signal and convert the noise-containing signal into an autocorrelation time series. This method is simple to calculate, can effectively reduce the interference of Gaussian white noise, and can facilitate subsequent feature extraction(ii)we perform bispectral transformation on the autocorrelation time series and extract bispectral features. The higher-order spectral domain can present some feature differences that cannot be presented in the time domain. It is convenient to save the extracted features in a format similar to a picture, which is beneficial to the processing of the CNN. Compared with traditional method, this method can significantly improve the recognition accuracy under low SNR conditions(iii)we design a two-input CNN for classification recognition. The extracted autocorrelation spectral features are input into the two branches of the neural network, and then the outputs of the two branches are processed. This two-input CNN can further extract features and reduce the possibility of incorrect decisions, meanwhile, it does not require much additional computing time

2. Background

The third-generation shortwave communication protocol standard MIL-STD-188-141B is proposed to develop adaptive short-wave communication, realize shortwave link establishment, link maintenance, shortwave network construction, and improve communication quality. Corresponding burst waveforms are defined in this standard for the various kinds of signaling required in the short-wave communication system, to meet distinctive requirements for payload, duration, time synchronization, acquisition, and demodulation performance in the presence of noise, fading, and multipath. Figure 1 shows its basic architecture.

Among them, the physical layer defines five burst waveforms to perform different tasks, including BW0, BW1, BW2, BW3, and BW4. In Figure 2, the details of the five types of burst waveforms are summarized, which contain signal parameters, application scenarios, and other information.

Further, the generation process of the five burst waveforms is shown in Figure 3.

Each type of burst waveform can achieve its own function in short-wave communication. BW0 is used for third-generation automatic link establishment (3G ALE), BW1 is used for traffic management (TM), BW2 and BW3 are used for data transmission, respectively, high-rate data link (HDL) and low-rate data link (LDL) transmission, and BW4 is used for LDL acknowledgment. To clarify what these behaviors are, we will explain them one by one.

ALE encompasses establishing a communication link for shortwave communication. TM represents coordinating traffic exchanges on connections established based on the 3G-ALE protocol, and establishing a traffic link on which traffic can be delivered. HDL consists of providing reliable high-rate point-to-point data transfer service over the established links, while LDL means providing reliable low-rate point-to-point data transfer service.

This also illustrates that different burst waveforms correspond to different communication behaviors, which is summarized in Table 1.

3. Methods

Figure 4 shows the simple technical route of our research. On the basis of the burst waveforms of the physical layer, we carry out research on the identification of the communication behavior of the shortwave radio. Since behavior is a concept that describes biology, it is necessary to explain its meaning in the field of communication. Then, the mapping relationship between communication behaviors and burst waveforms needs to be found; both steps are mentioned in the second section. After the previous steps, the problem is transformed into the problem of signal processing and classification. In this section, the method of signal preprocessing is given, while the structure of the network model is designed.

3.1. Signal Denoising by the Autocorrelation Method

Suppose is a random signal, is the autocorrelation function of , which is defined as the degree of correlation of at different times. Then the definition formula of is as follows:

Therefore, assume is the additive white Gaussian noise (AWGN), its mean is 0 and its variance is , is the transmitter signal, and are uncorrelated, and is the receiver signal, that is,

Then, according to Equation (1), the autocorrelation function of can be expressed as

Since has the following nature

Then, (Equation (3)) can be converted to (Equation (5)) where is the time difference, which can take positive or negative numbers.

Therefore, it is proven that the autocorrelation function of is not easily affected by AWGN. When the SNR is 0 dB, the time domain waveform of BW0 is shown in Figure 5. The abscissa represents the duration of the signal, and the ordinate represents the amplitude of the signal.

According to Equation (5), the autocorrelation denoising curve of BW0 is shown in Figure 6. The abscissa is the time difference, and the ordinate is the autocorrelation function.

3.2. High-Order Spectral Analysis of Signals

Compared with the power spectrum, the higher-order spectrum contains more useful information, therefore, it can be considered as the development of the power spectrum. The bispectrum of the signal, also called the third-order spectrum, is the most basic higher-order spectrum. It has the advantages of requiring a small amount of calculation, suppressing AWGN to a certain extent [37], and retaining the original information of the signal as much as possible, so it is widely used in the field of signal processing.

The bispectrum and third-order cumulant of are defined as follows:

Among them, the necessary and sufficient condition for the existence of is that the third-order cumulant is absolutely summable, that is, where and are independent variables of , representing two time differences, and are the axes of the bispectral two-dimensional plane [38]. Through bispectrum transformation, the signal is transformed from the time domain to the spatial domain, which can retain more useful information.

3.3. Feature Extraction Process

The autocorrelation spectrum of the signal can be calculated according to Equations (5), (6), and (7), and the specific process is as follows:

(a) According to Equation (5), calculate the autocorrelation function of .

(b) Calculate the third-order cumulant of , and then calculate the bispectrum of according to Equation (7). Here, we denote as the autocorrelation spectrum of .

When the SNR is 0 dB, we plot the autocorrelation spectrogram pictures of five kinds of burst waveforms, as shown in Figure 7, where and are normalized frequencies.

For the convenience of observation, in Figure 8(a), we draw the contour map of Figure 7(a). The contour map can be considered as a two-dimensional representation of the three-dimensional autocorrelation spectrogram image. As a comparison, we also draw the autocorrelation spectrogram contour map of BW0 when the SNR is 5 dB, as shown in Figure 8(b).

According to Figure 8, we know that when the SNR is 0 dB, the autocorrelation spectrogram contour map is cluttered due to the influence of noise. When the SNR increases to 5 dB, the noise part is basically invisible, which proves the effectiveness of the autocorrelation method we select in denoising. It is feasible to extract the autocorrelation spectrogram features of the signal.

The process of extracting features is shown in Table 2.

3.4. Neural Network Model

In this section, we design a two-input CNN. Compared with the traditional CNN, two independent inputs represent two different network models, which can further learn the extracted features, which further reduces the probability of misclassification. The network model we designed can be considered to have two independent model structures because of its two branches. These two independent models only have convolutional layers, pooling layers, and fully connected layers, which are simple in structure, easy to implement, and fast to compute. Figure 9 is the framework of the two-input CNN we designed, which shows the basic structure of the network in this paper.

The extracted features are subjected to convolution operations in the two models, and then two outputs are obtained. The two outputs are merged through the add layer, and then through the subsequent layers, the classification task can be completed.

Figure 10 shows the specific structure of the two branches Model 1 and Model 2. Among them, the first branch has a small number of layers, only two convolution layers and two fully connected layers. After convolution calculation, features can be learned quickly and effectively. The second branch has more convolutional and fully connected layers than the first branch. The operation time of this branch is longer, but further features can be learned. Compared with traditional CNN, the model has stronger learning ability by using two branches to learn features separately. At the same time, because the number of layers in the first branch is relatively small, the extra calculation time consumption brought by the design of the two inputs is not much.

In this paper, the inputs of both two branches are autocorrelation spectrogram feature matrix extracted in Table 2.

The training process of our neural networks is summarized in Table 3.

4. Experiments

The experiments consist of the following parts. First, the recognition accuracy of the proposed algorithm for five communication behaviors is compared under different SNR conditions, meanwhile, the effects of different parameters on the experimental results are compared. Then, the improvement in the results of our proposed algorithm is verified by comparison with the previous algorithm. Finally, the recognition results of different network models are compared.

The experimental parameters are set as follows: 5000 samples are selected. The number of training sets and test sets is divided into 4 to 1, and 25% of the training sets are randomly selected as validation sets. We use the momentum optimizer, which dynamically adjusts the learning rate. The initial learning rate of the network is 0.01; the attenuation factor of the learning rate is 1e-6; the batch size is 64, and the epoch is 500.

Experimental Environment: Windows 10 Operating System, 11th Gen Intel (R) Core (TM) I5-11260H CPU, NVIDIA GeForce RTX3050, Python 3.7, TensorFlow 2.5.0, and Keras 2.8.0.

4.1. Verify the Validity of the Algorithm

According to the experimental parameters, the algorithm should first be verified. When the SNR is 0 dB, 5 dB, 8 dB, 10 dB, and 15 dB, respectively; the recognition results on the test set are shown in Figure 11. The results illustrate that BW1 and BW2 are most easily classified into wrong classes. When the SNR is not lower than 5 dB, the recognition accuracy of each communication behavior can reach more than 80%, and with the reduction of noise, the recognition accuracy can reach close to 100%.

Figures 1214 show the comparison of different initial learning rates, different attenuation factors, and different batch sizes, respectively.

The experimental results show that when the initial learning rate is 0.01, and the attenuation factor is 1e-6; the learning ability of our designed network is the best. When the batch size is 64, 32 and 16, the learning ability of the network is not much different. However, according to Table 4, when the batch size is 64, the learning time per epoch is the shortest. As a result, it is reasonable to set the batch size to 64.

4.2. Experimental Comparison of Different Algorithms

Currently, the research on radio behavior recognition based on signals from the physical layer is in its infancy, and only the study in [33] carries out communication behavior recognition based on five types of burst waveforms. When the SNR is 15 dB, the recognition accuracy can reach 99.3%, but with the increase in noise, the recognition accuracy obviously decreases. When the SNR is lower than 8 dB, the overall recognition accuracy is lower than 80%. When the SNR is 0 dB, it is even lower than 50%. In this experiment, we compare the algorithm proposed in this paper with the algorithm in the literature [33] as well as the network model in this paper plus, the features in the literature [33], and the network model in this paper plus the features in the literature [33]. The experimental results are illustrated in Figure 15.

It can be observed that the algorithm proposed in this letter offers the best results, especially on the lower SNR conditions. Since the autocorrelation denoising method can effectively eliminate noise; our algorithm can significantly improve recognition accuracy. Even if the SNR is 0 dB, the accuracy can still reach 91%. When the SNR is 5 dB, 8 dB, and 10 dB, the accuracy of the algorithm in this letter can increase by 23.8%, 15.9%, and 3.6%, respectively, compared to the algorithm in literature [33]. When the SNR is 15 dB, the influence of noise can be ignored, and the autocorrelation method to reduce noise will cause the loss of information from the original signals, which results in the loss of features. This part of the loss offsets the influence of noise, making the accuracy slightly lower than in the literature [33], but it can still reach 99.2%.

If we extract the autocorrelation spectrogram features and select the network model in the literature [33], the recognition accuracy under low SNR conditions can still be effectively improved. When the SNR is 0 dB, 5 dB, 8 dB, and 10 dB, the recognition accuracy can increase by 43.6%,23.0%, 15.9%, and 3.4%, respectively. When the SNR is 15 dB, due to the loss of features caused by the denoising method, the recognition accuracy is lower than that of literature [33]. Compared with the method in this paper, the recognition accuracy is also lower due to the simple network.

In addition, when the extracted features are autocorrelation spectrogram features, the recognition accuracy can be further improved by using the network in this paper.

4.3. Comparison of Different Network Models

This experiment compares the recognition accuracy of several classical convolutional neural networks. First, as representatives of simple convolutional neural networks, AlexNet and LeNet are chosen as comparison objects. Classic network models such as VGG-19 and ResNet34 are also compared. When the network tends to be stable, the experimental results of different network models are shown in Figure 16.

AlexNet itself is easy to implement due to its simple structure, and its experimental results are better than those of LeNet and VGG-19, and slightly inferior to ResNet34. When the SNR ranges from 0 to 15 dB, our network model can perform better recognition, which is superior to traditional CNNs.

5. Conclusions

In this paper, we carry out research on the identification of shortwave radio communication behavior, starting from five types of burst waveforms in the physical layer. A signal feature extraction method based on autocorrelation spectrogram features is proposed. A neural network model is improved to further optimize the algorithm. The experimental results prove that the autocorrelation spectrogram features are not easily affected by noise, and the proposed network can further improve the recognition results. Our proposed algorithm can dramatically improve recognition results under low SNR conditions. Behavioral learning is increasingly important in communications, as a typical CRS, shortwave radio is just the first research object in a series of relevant works. Our subsequent work will continue to focus on behavior recognition research on the CRS. In the future, we aim to expand related research to more types of CRS and wider communication scenarios, and consider more environmental factors, which is quite meaningful for communication confrontation.

Data Availability

Each burst waveform is generated according to the third-generation shortwave communication protocol standard. The dataset we use is made according to the feature extraction process mentioned in this paper. You can also ask for the data by contacting [email protected].

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

We would like to thank the National Natural Science Foundation of China, for the research on several key technologies for the individual identification of communication radiation sources in complex electromagnetic environments. (Grant 62071479).