Abstract

Detecting increasing anomalous signals is critical to effective spectrum management due to the complexity of the use of electromagnetic spectrum. The anomaly recognition approach based on autoencoder (AE) heavily relies on the assumption that the reconstruction error of normal signals is generally lower than that of abnormal signals. Unfortunately, such an assumption does not necessarily hold due to the excessive generalization ability of the AE. The memory-augmented autoencoder model (MemAE) has been recently proposed to address this issue by introducing memory modules. Still, the standard MemAE model performs poorly on complex image datasets, and its performance for abnormal communication signals recognition is unknown. Therefore, we propose an image enhancement and improved memory-augmented autoencoder model (IIMemAE) to recognize abnormal communication signals. Specifically, we consider two key factors, i.e., the existence of redundant information in the time-frequency spectrogram and the low recognition accuracy of normal signals of the standard MemAE model. We introduce an image enhancement module and an anomaly determination module compared with the standard MemAE model. The proposed IIMemAE model can address the issue that the performance of anomaly recognition may be degraded due to the imbalanced communication signals in the real world. The simulation results show that IIMemAE can effectively recognize synthetic abnormal communication signals even at low signal-to-noise ratio (SNR) and jamming-to-signal ratio (JSR) conditions and outperform the standard MemAE model. Besides, the parametric Pauta criterion proposed can balance the recognition accuracy of normal and abnormal signals to meet the need for diverse recognition tasks.

1. Introduction

The electromagnetic spectrum is one of our most critical, widely used, and limited natural resources [1, 2]. With the advent of new wireless communication technologies, spectrum usage has become very complex, leading to radio wave congestion and other jamming issues [3]. Wireless spectrum anomalies may occur due to spectrum abuse or jamming. To effectively regulate the radio spectrum and improve the quality of service for wireless communications, it is crucial to analyze and detect anomalous behaviors in the electromagnetic spectrum.

Various communication antijamming technologies [46] and anomaly identification technologies [7, 8] have emerged to ensure the reliable transmission of information. In the wireless communication networks, the authors in [9] propose to detect interference by comparing the predicted packet delivery rate with the actual packet delivery rate. The authors in [10] focus on improving network performance under active jamming attacks to achieve jamming suppression. For spectrum anomaly identification, the characteristics of the communication signals can be exploited, as commonly done in the existing literature, such as [1113]. The work in [11] proposes a cross-layer framework, which enables to detection of anomalous spectrum usage attacks (ASUA) in radio ad hoc networks by collecting the physical and network layer information. The authors in [12] propose a spectrum anomalous usage detection method using the spatial distribution characteristics of the received signal strength. A typical spectrum is proposed based on feature extraction and a clustering algorithm to analyze spectral data in the broadcast bands [13].

However, these methods in [1113] based on feature extraction could be of high computational complexity and consume lots of manpower and time [14]. In recent years, deep learning (DL) has achieved excellent performance in the radio field, such as interference identification [15] and spectrum anomaly detection [16]. A supervised learning-based scheme for signal detection and localization is proposed in [17], where a classification model is built by manually labeling each training sample. However, due to the complex electromagnetic spectrum situation and the unpredictable characteristics of anomalous signals, it is significantly difficult to collect and label a sufficient number of abnormal signals. Furthermore, communication signals in real-world systems are generally imbalanced. Abnormal communication signals can take many forms from the presence of unwanted signals in licensed bands to the absence of expected signals [18], which makes there are more abnormal signals than normal signals. Therefore, the performance of anomaly recognition methods based on supervised learning may be greatly degraded [19].

More recently, the unsupervised autoencoder (AE) learning model has emerged and has been adopted for anomaly detection with imbalanced data [2025]. In network intrusion detection systems, the authors in [21] propose to employ a multimodal deep autoencoder (DAE) to detect abnormal traffic in the network and forward the anomaly to an attack classifier for classification. The authors in [22] use an ensemble of autoencoders to propose an online network traffic anomaly detection approach that can detect various attacks with performance comparable to offline anomaly detectors. In the anomaly recognition task, the AE is trained by minimizing the reconstruction error of normal signals, for which the reconstruction error is used as the anomaly metric. In [23], a deep autoencoder network is applied to detect the anomalies by reconstructing the spectrogram of the received signal. An unsupervised anomaly identification method based on the convolutional autoencoder (CAE) is studied in [24] to identify radio frequency interference. The authors in [25] propose a variational autoencoder- (VAE-) based spectral anomaly detection method for unlicensed bands, where the percentile (PER) score is introduced as a new measure of anomaly. Note that, the effectiveness of these AE models for anomaly recognition heavily relies on the assumption that the reconstruction error of normal signals is generally lower than that of abnormal signals.

However, this assumption does not necessarily hold as the abnormal signals may also be reconstructed well by the AE due to its strong generalization ability [2628]. For example, if abnormal and normal signals share some common compositional features, a “strong” AE may also reconstruct the anomalies well. Thus, the reconstruction errors of the normal signals and some abnormal signals could be close to each other. Further, as there are generally no training samples of abnormal signals, it is impossible to predict the reconstruction results of abnormal signals. To address this issue, introduces a memory-augmented autoencoder (MemAE) to enlarge the reconstruction error of anomalous samples, thereby improving anomaly detection performance. However, the MemAE model does not achieve satisfactory performance for the anomaly recognition with complex image datasets (e.g., CIFAR-10), and its performance remains unknown for the datasets with abnormal communication signals. Moreover, the reconstruction error is used as the only criterion in [29] to determine whether a signal sample is normal or not. This, however, does not fully exploit the statistics (e.g., the mean and the variance) of the reconstruction error, which may be applied to define a more appropriate detection metric.

In this paper, we propose an unsupervised abnormal signal recognition algorithm based on image enhancement and improved memory-augmented autoencoder (IIMemAE). The main contributions can be described as follows: (i)We consider an abnormal signal recognition model based on unsupervised learning. To the best of our knowledge, this work is the first attempt to apply the MemAE model in abnormal communication signals recognition(ii)We propose an abnormal signal recognition algorithm based on image enhancement and improved memory-augmented autoencoder (IIMemAE). Specifically, we introduce an image enhancement module and an anomaly determination module to the standard MemAE model. Instead of using the reconstruction error directly, a parametric Pauta criterion is proposed to measure the anomaly of the reconstruction error of the signal(iii)We evaluate the performance of various aspects of the IIMemAE model to verify its effectiveness and stability. The simulation results show that the average AUC (area under curve) value of the proposed method is greater than 80% even when the SNR is 2 dB and is close to 70% at the JSR is -5 dB. Compared to the standard MemAE model, the proposed IIMemAE model achieves better and more stable recognition performance, especially under low SNR and JSR conditions. Additionally, simulation results demonstrate that the anomaly determination module based on the parametric Pauta criterion proposed can balance the recognition accuracy of normal and abnormal signals and meet the need for different recognition tasks

The remainder of this paper is organized as follows. Section 2 introduces an abnormal signal recognition system model based on unsupervised learning. The function of each module of the IIMemAE model is presented in Section 3. Section 4 presents and analyzes the simulation results. Section 5 concludes the paper.

Notations: Matrices and vectors are in bold capital and bold lower cases, respectively. Number fields are in blackboard bold. The notations , , and represent absolute value, -norm, and -norm, respectively.

2. System Model

As shown in Figure 1, we consider an abnormal signal recognition model in a wireless communication system. In addition to legitimate signals, there may be malicious or illegal signals in the signals received by the receiver in this system model. Then, the anomaly recognition module on the receiver accomplishes the reconstruction and anomaly recognition of the signal received.

We consider two ways to define abnormal signals. One is for unauthorized signals, that is, signals sent from illegal transmitters without the authorization of radio regulatory agencies. The other is an authorized signal, which suffered external malicious jamming or failures that occur on the receiver during transmission. In specific, for nonstationary signals, abnormal signals can be described as the following: (i)The signal is legal, but the parameters such as the bandwidth and the center frequency of the signal are not within the specified range(ii)This is one synthetic anomaly, where legal signals coexist with jamming signals and the jamming signal parameters are legal

The primary task of abnormal communication signals recognition is to identify whether the received signal is normal or abnormal. In this paper, we focus on the second type of anomaly (the abnormal signal recognition for the first type is left to future work.), and the abnormal signal is modeled as follows: where denotes the normal signal received and denotes the jamming.

The jamming-to-signal ratio (JSR) can be expressed as follows [30]: where and denote the jamming power and the signal power, respectively.

The problem of abnormal signal recognition can be formulated as a binary hypothesis test: where denotes the received signal, and represent the channel of the normal signal and the channel of the abnormal signal, respectively, and is the additive Gaussian white noise with zero mean and variance . The null hypothesis represents the absence of abnormal signals in the communication system operation, and the alternative hypothesis represents the presence of abnormal signals.

The variation of a signal in the time and frequency domains is the most crucial feature in spectrum usage. Time-frequency analysis can extract the characteristic information of communication signals at a specific time and frequency, accurately reflecting the relationship between the signal frequency and the time change. Therefore, the smooth pseudo Wigner-Ville distribution (SPWVD) [31] is performed to obtain a two-dimensional time-frequency spectrogram of the received signal as follows: where and are real symmetric window functions and denotes the conjugate of s. In particular, allows cross-terms oscillating parallel to the time axis to perform smoothing (i.e., time smoothing), and allows smoothing of cross-terms oscillating parallel to the frequency axis (i.e., frequency smoothing). Since smoothing is performed in both the time and frequency domains, the cross-interference terms of multicomponent signals can be well suppressed.

3. IIMemAE-Based Abnormal Signal Recognition Algorithm

In this section, we introduce the structure of IIMemAE and its implementation in abnormal signal recognition.

3.1. Overview

As shown in Figure 2, the proposed IIMemAE model consists of five main components: an image enhancement module, an encoder, a decoder, a memory enhancement module, and an anomaly determination module. In specific, the image enhancement module first improves the contrast of the time-frequency spectrogram of the signal. The encoder encodes the time-frequency spectrogram of the signal and generates query terms. The memory enhancement module includes addressing operations for recording normal signal patterns and retrieving the memory items, which are then passed to the decoder for reconstruction. The IIMemAE model does not decode directly through the decoder like standard autoencoders. Instead, the encoded results are used as query terms to retrieve normal signal patterns in memory enhancement modules and update memory terms through attention-based addressing operations.

Finally, the query result is inputted into the decoder for reconstruction. In the training process, the IIMemAE model proposed is trained by minimizing the reconstruction error and the entropy loss of the query weight of memory items. In the test phase, the model performs reconstruction using only a limited number of normal signal patterns recorded in memory. The anomaly determination module quantifies the abnormality degree of the signal according to a parametric Pauta criterion.

3.2. Image Enhancement Algorithm

The time-frequency grayscale image converted from a communication signal has low contrast, and low-contrast images result from a lack of dynamic range in the imaging pixels, which may induce the model to extract incorrect features and false matches [32]. By contrast stretching, the effect of redundant pixels in image features is reduced. Hence, we introduce an image enhancement module and apply the classical image grayscale transformation to the time-frequency grayscale image. The basic principle of the image grayscale transformation is to map the gray value of the original grayscale image to a specified range of grayscale values of the output image. By saturating parts of the time-frequency grayscale image at the lowest and highest grayscales, the contrast of the output image can be improved. In this paper, the grayscale transformation of a time-frequency spectrogram can be expressed as follows: where denotes the original image, denotes the image after grayscale transformation, and denotes the grayscale transformation function. The values of points and control the shape of the transformation function. In transforming the grayscale of the original image o to the new image , the grayscale value of the pixels in whose grayscale value is lower than is assigned as in . Similarly, the gray value of the pixel in whose gray value is higher than is also assigned as when it is transformed to .

For the gray value of any pixel in the original image, the gray value of the corresponding pixel in the new image is obtained after transformation. For example, the transformation processing can be given as follows:

3.3. Encoder and Decoder

The encoder and the decoder generally have a symmetrical structure. The input time-frequency spectrogram dataset sample space is represented as . When an input is given, the encoder converts it into an encoded in the low-dimensional feature space. The decoder is trained to reverse map a latent representation to the domain . The above process can be expressed as follows: where and denote the parameters of the encoder and the decoder, respectively. Here, is used as a query to retrieve the relevant items in memory. Then, these items are collected and fed to the decoder. Note that, for the standard AE model, is set to be [33].

3.4. Memory Enhancement Module

The memory enhancement module is designed as a matrix , including real-valued vectors of dimensions . The dimension of each memory item is the same as the dimension of the encoder encoding result . The hyperparameter defines the maximum capacity of memory. where is obtained by soft addressing of weight vector and memory . is a row vector with nonnegative entries that sum to one. The memory records the various prototypical normal patterns during training. The weight vector is obtained from and calculated by the softmax operation in Equation (10), where is the similarity measure, defined as cosine similarity.

In addition, the memory module can increase sparsity by hard shrinking the weight vector in Equation (11). Sparse addressing encourages fewer but more relevant memory items to represent query results, improving the representation accuracy of memory items. where denotes the shrinkage threshold and denotes the sparse weight vector. The elements in the weight vector that are less than or equal to the threshold value are assigned as 0 to realize the sparsity of the vectors. However, it is not easy to calculate the backpropagation of discontinuous functions. To simplify the calculation, we employ the continuous ReLU function to realize the shrinking operation as follows: where is a very small constant. The standardized operation shall be carried out again when the shrinkage is completed as follows:

3.5. Anomaly Determination Module

By querying the memory module, all prototype vectors are considered instead of the nearest prototype vectors, which solves the problem of the excessive generalization ability of the AE. This means that the reconstruction error for abnormal samples is enlarged, making recognition easier. The next phase is to design an appropriate threshold to separate the two classes instead of directly using the average reconstruction error as a metric. In the testing phase, the decoder outputs the reconstruction loss for all test samples and compares it with a predefined threshold , which can be expressed as follows:

To design an appropriate threshold, the Pauta criterion [34] is introduced: where denotes the reconstruction error and and denote the mean and standard deviation of the reconstruction error of the normal signal. To make the Pauta criterion more accommodation for the abnormal signal recognition task, we modified the Pauta criterion as follows: where denotes a positive constant. If the reconstruction error of the time-frequency spectrogram of the input signal satisfies the above formula, the signal is recognized as an abnormal signal. Otherwise, the signal is recognized as a normal signal.

3.6. Training

The training loss comprehensively considers the reconstruction loss and the entropy loss. Given a training dataset containing time-frequency spectrograms of normal signals, let be the reconstruction time-frequency spectrogram corresponding the each training time-frequency spectrogram . The minimized reconstruction error for each time-frequency spectrogram can be expressed as follows: where the reconstruction error is replaced by -norm. The entropy loss improves the sparsity of the generated address weights during training. We minimize the entropy loss:

Combining Equations (17) and (18), the training loss of IIMemAE is described as follows: where denotes a hyperparameter. The memory is updated by backpropagation and gradient descent during training.

Specifically, there are three primary stages for the implementation of the abnormal communication signals recognition system: (1)Signal reception: In the communication system operating, the receiver performs time-frequency processing on all received signals to obtain time-frequency spectrograms. The time-frequency spectrograms are then processed with an image enhancement algorithm(2)Training IIMemAE: Since the normal communication signal is known to the system, the IIMemAE model can be pretrained by the time-frequency spectrogram of the normal communication signal, which is considered to memorize the characteristics of the normal communication signal. The time-frequency spectrogram used in training should also be processed via the image enhancement algorithm(3)Anomaly recognition: All the time-frequency spectrograms obtained in (1) are used as the input of the IIMemAE model, which reconstructs all the time-frequency spectrograms. The parametric Pautu criterion measures the threshold. If the reconstruction error is larger than the threshold, it is considered an abnormal signal. Otherwise, it is considered to be a normal signal

4. Simulation Results and Discussion

In this section, the performance of the proposed IIMemAE model and its variants are evaluated through extensive simulations. The standard MemAE [29] serves as the baseline under identical conditions. All models are trained with the Adam optimizer of a learning rate of 0.01 for a maximum of 100 epochs. The hyperparameter is 0.0002. All models are implemented based on the Pytorch framework and trained by using an NVIDIA GTX1080 GPU.

4.1. Simulation Setup

In this paper, the BPSK signal as the normal signal is obtained by simulation. Any remaining signal that is not a BPSK signal is considered anomalous, and we simulate six typical synthetic anomalies [35, 36]. (i) Comb-spectrum: the comb spectrum jamming consists of multiple identically modulated subjammings. It is a set of narrowband interferers modulated over a range of frequencies. Each subjamming is superimposed in the time domain and separated in the frequency domain. The jamming spectrum is comb-shaped. (ii) Multitone: the multitone jamming consists of multiple tones. Each frequency point is randomly distributed in a specific frequency band. (iii) Pulse: the pulse jamming transmitted at a random time on a fixed frequency band. (iv) Single-tone: single-tone jamming can affect the BPSK signal by generating high power. (v) Sweeping: it performs linear frequency sweep jamming on the BPSK signal, and the center frequency of the jamming is the same as the BPSK signal. (vi) Noise-FM: the noise FM jamming with a certain bandwidth and high power is added to the BPSK signal.

The time-frequency spectrograms of one normal signal are shown in Figure 3 and six abnormal signals are shown in Figure 4. The training dataset contains 2000 normal time-frequency spectrograms, each of size . The testing dataset contains 1400 time-frequency spectrograms with 200 normal time-frequency spectrograms and 1200 anomalous time-frequency spectrograms. Each abnormal signal mentioned above has 200 time-frequency spectrograms, each of which is of size . The training and testing datasets are not duplicated. The test dataset’s 200 time-frequency spectrograms of the normal signal are taken out as the validation dataset.

The implementation of the IIMemAE model to recognize abnormal signals can be described as follows: (1)The training dataset and test dataset are processed by the image enhancement algorithm. Then, the training dataset is fed into the network to train. Only the time-frequency spectrogram of normal signals is trained during training. Figure 5 shows an example of the samples processed by the image enhancement algorithm(2)Input the validation dataset into the trained network to obtain the reconstruction error of the normal signal timefrequency spectrogram. The threshold is measured by the mean and variance of the reconstruction error(3)Input the test dataset into the trained network and get the reconstruction error of each time-frequency spectrogram. Comparing the reconstruction error with the threshold , if the reconstruction error of a time-frequency spectrogram satisfies the parametric Pauta criterion in Equation (16), it is considered an abnormal signal. On the contrary, it is considered a normal signal

As commonly done in the literature [20, 29], we use the AUC value as the evaluation index of all models, which is the area under the ROC (receiver operation characteristic) curve. We also consider the -score, which represents the harmonized average of precision and recall. It is a metric primarily used to assess imbalanced data accurately [37]. Table 1 shows the average AUC values for 20 tests on the signal dataset. The memory capacity is 1000, the signal-to-noise ratio (SNR) is 10 dB, and the JSR is 5 dB.

As shown in Table 1, the proposed IIMemAE outperforms the standard MemAE. In particular, all evaluation metrics of IIMemAE at outperform the standard MemAE.

Specifically, the model with anomaly determination module significantly outperforms MemAE without parametric Pauta criterion and exhibits higher TP (true positive). From the mean TN (true negative), MemAE (IE) can produce better results than the standard MemAE. We can conclude from Table 1 that the image enhancement module can improve the recognition accuracy of abnormal signals, and the abnormal judgment module can improve the recognition accuracy of normal signals. The IIMemAE proposed in this paper comprehensively considers the effect of the two modules on the overall recognition accuracy. Therefore, IIMemAE can yield better performance than the standard MemAE.

Moreover, we empirically study the computational complexity of the proposed method on the test dataset (i.e., contains 1400 time-frequency spectrograms) using NVIDIA GTX1080 GPU. As shown in Table 2, the proposed IIMemAE () averagely takes 0.01627 seconds to recognize a time-frequency spectrogram. Compared to the standard MemAE model that takes 0.01495 seconds for each time-frequency spectrogram, our image enhancement module and anomaly determination module induce little additional computational time (i.e., per time-frequency spectrogram).

4.2. Impact of Different Memory Capacities

We conduct experiments by setting different memory capacities, and the AUC values are shown in Figure 6. For the anomalous signal datasets, the AUC of the model is always stable even as the dimension of the memory matrix increases. It indicates that IIMemAE is insensitive to memory capacity and can robustly produce credible results. In addition, simulation results of other models show that they are equally stable and robust. All models have AUC values of 80% and above, with IIMemAE () performing the best. Compared with the standard MemAE, IIMemAE and MemAE () can produce better AUC values. The MemAE (IE) does not perform well as the standard MemAE because it has worse TP values.

4.3. Impact of Different SNRs

As shown in Figure 7, with the increase of SNR, the AUC values of the five models increase. Overall, MemAE (IE) shows the worst performance, and IIMemAE () offers the best performance. However, it is worth noting that the standard MemAE performs the worst at low SNR. These models with the image enhancement module and anomaly determination module can produce good recognition results even at low SNR.

Figure 8 shows the reconstruction error distribution for each signal under the IIMemAE () model. In the boxplots, the upper and lower bounds of the box in blue are the upper and lower quartiles of one signal reconstruction error, respectively. Therefore, the width of the box reflects the degree of fluctuation in the reconstruction error of each signal. The red line segment in the middle of the box represents the median of each signal reconstruction error. SNR is -2 dB in the left boxplots, the average reconstruction error of the BPSK signal is 32.69, and the average reconstruction error of each abnormal signal is 71.69, 49.32, 66.80, 31.35, 32.75, and 42.88, respectively. SNR is 10 dB in the right boxplots, the average reconstruction error of the BPSK signal is 13.63, and the average reconstruction error of each abnormal signal is 98.57, 64.27, 110.48, 26.09, 38.34, and 56.48, individually. As the SNR increases, the average reconstruction error of the BPSK signal decreases, while the average reconstruction error of the abnormal signal increases. Therefore, the larger the SNR, the IIMemAE model can better reconstruct normal signals and enlarge the reconstruction error of abnormal signals.

4.4. Impact of Different JSRs

Figure 9 shows the experimental results of setting different JSRs. Compared with the standard MemAE model, the IIMemAE () model offers the best performance evaluated based on AUC. In particular, all models except MemAE () perform best when JSR is 0 dB. Specifically, Figure 10 shows the reconstruction error distribution of each signal through the IIMemAE () model at different JSRs. JSR is -5 dB in the left boxplots, the average reconstruction error of the BPSK signal is 11.06, and the average reconstruction error of each abnormal signal is 17.76, 25.75, 13.62, 19.86, 16.46, and 25.54, respectively. JSR is 0 dB in the middle boxplots, the average reconstruction error of the BPSK signal is 11.06, and the average reconstruction error of each abnormal signal is 26.74, 32.87, 34.71, 17.93, 22.15, and 28.71, individually. Since the parameter JSR is set based on the abnormal signal, it does not affect the reconstruction of the BPSK signal for the IIMemAE model. As the JSR increases, the IIMemAE model enlarges the reconstruction error of anomalous signals and thus can result in better performance.

However, when the JSR increases to a certain level, the performance of the IIMemAE model begins to decline. In Figure 10, JSR is 10 dB in the right boxplots, the average reconstruction error of each abnormal signal is 55.27, 27.97, 53.07, 7.47, 11.14, and 27.61, respectively. Compared with the middle boxplots at JSR of 0 dB, the reconstruction errors of abnormal signal comb-spectrum and pulse are enlarged. However, the reconstruction errors of abnormal signals multitone, single-tone, sweeping, and noise-FM are reduced instead. Particularly, the reconstruction errors of abnormal signal single-tone and sweeping are almost as same as the normal BPSK signal.

To see it more clearly, we further plot images for the experimental results, which are performed color reversion process in Figure 11. It shows the time-frequency spectrogram, the reconstructed time-frequency spectrogram, and the reconstructed error image of the abnormal signals single-tone with a JSR of 10 dB. Comparing the reconstructed error images at the JSR is 0 dB and 10 dB, the IIMemAE model can reconstruct the abnormal signal single-tone and sweeping well with the JSR of 10 dB. In Figure 11(a) and Figure 11(b), the reconstruction error is 20.65 and 11.56, respectively. Single-tone jamming and frequency sweeping jamming characteristics are similar to the BPSK signal characteristic. From the time-frequency spectrogram, they both look like a line segment. Therefore, the performance of the IIMemAE model begins to degrade at the JSR of 5 dB.

The synthetic abnormal signal in this paper is formed by the superimposition of the jamming signal on the normal recognition. Simulation results show that the IIMemAE model outperforms the standard MemAE model even at low SNR and JSR. Specifically, in the IIMemAE model, the image enhancement module can improve the recognition accuracy of abnormal signals, and the anomaly determination module can improve the recognition accuracy of normal signals. Moreover, the anomaly determination module with the parametric Pauta criterion can balance the recognition accuracy of normal and abnormal signals and adapt to diverse abnormal signal recognition tasks. Future works will investigate the image enhancement model to further improve the performance of the IIMemAE model for signals with similar features. Besides, more research work is needed to detect anomalies in real-time signal. When the JSR increases to a certain level, the time-frequency spectrogram of the synthetic abnormal signal will mainly display the characteristics of the jamming signal. If the characteristics of the jamming signal are similar to the normal signal, the IIMemAE model will reconstruct the synthetic abnormal signal well, leading to its performance degradation. This issue also exists in the standard MemAE model and two variant models. We hope to address this issue in future work further.

4.5. Impact of Different

Figure 12 shows the effect of different parameters in the IIMemAE model on the recognition performance of abnormal signals with the SNR is 10 dB and the JSR is 5 dB. The evaluation indexes TP and AUC increase with the increase of , and the -score fluctuates relatively stably within a certain range. However, the evaluation index TN decreases as increases. It dictates that the size of can balance the classification accuracy of positive samples and negative samples. In particular, TP and TN are above 94% when the parameter is set to 2. Therefore, we can select an appropriate value for the IIMemAE model to adapt to different abnormal signal recognition task requirements.

5. Conclusion

In this paper, we have proposed an IIMemAE model to improve the performance of abnormal communication signals recognition based on the MemAE model. The proposed IIMemAE model can address the issue that the performance of anomaly recognition may be degraded due to the imbalanced communication signals in the real world. We have considered two important issues, i.e., redundant information exists in the time-frequency spectrogram, and the recognition accuracy of normal signals based on the standard MemAE model is low. We have introduced an image enhancement module and an anomaly determination module. In particular, the image enhancement module can reduce the influence of redundant information in the time-frequency spectrogram, and the anomaly determination module based on the parametric Pauta criterion can be flexibly adjusted according to the need for anomaly or classify the detected anomalies.

Data Availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work is supported by the National Key R&D Program of China (No. 2018YFB1800800) and the National Natural Science Foundation of China (No. U20B2038, No. 61901520, No. 61931011, No. 61871398, No. 61827801, and No. 62101254).