Abstract

With the development in communications, the weak pulse signal is submerged in chaotic noise, which is very common in seismic monitoring and detection of ocean clutter targets, and is very difficult to detect and extract. Based on the threshold autoregressive model, pulse linear form, Markov chain Monte Carlo (MCMC), and profile least squares (PrLS) algorithm, phase threshold autoregressive (PTAR) model and double layer threshold autoregressive (DLTAR) model are proposed for detection and extraction of weak pulse signals in chaotic noise, respectively. Firstly, based on noisy chaotic observation, phase space is reconstructed according to Takens’s delay embedding theorem, and the phase threshold autoregressive (PTAR) model is presented to detect weak pulse signals, and then the MCMC algorithm is applied to estimate parameters in the PTAR model; lastly, we obtain one-step prediction error, which is used to realize adaptively detection of weak signals with the hypothesis test. Secondly, a linear form for the pulse signal and PTAR model is fused to build a DLTAR model to extract weak pulse signals. The DLTAR model owns two kinds of parameters, which are affected mutually. Here, the PrLS algorithm is applied to estimate parameters of the DLTAR model and ultimately extract weak pulse signals. Finally, accurate rate (Acc), receiver operating characteristic (ROC) curve, and area under ROC curve (AUC) are used as the detector performance index; mean square error (MSE), mean absolute percent error (MAPE), and relative error (Re) are used as the extraction accuracy index. The presented scheme does not need prior knowledge of chaotic noise and weak pulse signals, and simulation results show that the proposed PTAR-DLTAR model is significantly effective for detection and extraction of weak pulse signals under chaotic interference. Specifically, in very low signal-to-interference ratio (SIR), weak pulse signals can be detected and extracted compared with support vector machine (SVM) class and neural network model.

1. Introduction

Weak signals are extremely weak and difficult to detect in most situations. Signal-to-noise ratio (SNR) or signal-to-interference ratio (SIR) is a measure of relative strength of signal and noise [1, 2]. Weak signal detection and extraction firstly need to analyze characteristics of noise and then use modern signal processing methods and statistical models to achieve signal detection and recovery of weak signals inundated by strong noise [3, 4]. The main methods of statistical detection and extraction of weak signals include the stochastic resonance method [58], chaotic system and control method [9, 10], neural network method [11], Duffing oscillator method [1216], and lock phase amplifier method [17]. These traditional methods can be called noise suppression methods and have weak detection capabilities for lower signal-to-noise ratio situations, especially for signal detection that is submerged under chaotic or fractal noise. With the systematic study of characteristics of chaotic or fractal signals, models and methods for characterizing chaos [18, 19] and fractals are continuously updated and iteratively, and these new theories are fully utilized to provide an effective solution to the problem of detection and extraction of weak signals under chaotic noise. The detection and extraction of weak signals under chaotic noise interference can achieve lower SNR working thresholds by fully utilizing chaotic nonlinear characteristics of observation information [20, 21]. It has a wide range of applications in the fields of wireless secure communications, marine monitoring, and bioinformatics [2228].

Pulse signals are a common signal in the fields of wireless communication, fault diagnosis, ocean monitoring, and bioinformatics [22], which are often interfered by chaotic noise other than Gaussian white noise. How to effectively detect and estimate the weak pulse signal under chaotic noise interference and improve detection limit and extraction accuracy, which are the scientific basis for reducing the cost of testing equipment and the development of high-precision testing instruments, and so it has important theoretical and practical significance. The detection and extraction of weak pulse signals under chaotic interference have become a research hotspot in academic circles, such as neural network methods [29, 30], support vector machine methods [3133], filter-based methods [34, 35], methods based on empirical mode decomposition [36], geometric methods [37], and local linear methods [38, 39]. These methods have achieved some effects, but often the detection and extraction are separated or the detection does not give an optimal threshold.

In order to realize intelligence statistical detection and improve target signal extraction accuracy of weak pulse signals under chaotic noise, this paper proposes the phase threshold autoregressive (PTAR) model and double layer threshold autoregressive (DLTAR) model for detection and extraction of weak pulse signals in the context of chaotic noise. In signal detection, phase space threshold autoregressive detection (PTAR) model is proposed, which is applied to construct one-step prediction error, hypothesis testing is used to realize detection for weak pulse signals under chaotic noise, and parameters for the PTAR model are obtained by using Markov Chain Monte Carlo (MCMC) method. This method can effectively detect weak pulse signals and increase the adaptive ability. In signal extraction, linear representation of the pulse signal is introduced and mixed into the PTAR model to construct a double layer threshold autoregressive (DLTAR) model, and these parameters are estimated using profile least squares (PrLS) algorithm. The overall framework of this paper is shown in Figure 1:

The structure of this paper is as follows: Section 2 explains weak pulse signal detection based on the PTAR model; Section 3 presents weak pulse signal extraction with the DLTAR model; Section 4 comprises simulation experiment results and analysis; Section 5 concludes the study.

2. Weak Pulse Signal Detection Based on the PTAR Model

2.1. Problem Conversion for Signal Detection

Signal detection is a binary decision problem, that is, whether there is a target signal, which is summarized below as a binary hypothesis test:where represents the observed signal, represents chaotic noise, and represents a weak pulse signal.

Since here the signal-to-noise ratio is extremely low, the target signal is often very weak, and in the case of submerging strong chaotic noise, if the hypothesis test is directly inferred by (1), it is prone to extremely high false positives. Therefore, it is considered to suppress the chaotic noise firstly; that is, if the chaotic noise is known, then (1) can be transformed into the following hypothesis test problem:

Obviously, chaotic noise is unknown, but its chaotic nonlinear characteristics can be used to obtain its estimation. So one of the key issues in signal detection is how to estimate the chaotic background signal? If the estimation of chaotic noise is obtained, then the residuals can be calculated, and the hypothesis test is used to detect the target signal. The detection model is mainly divided into four steps: (1) reconstructing one-dimensional observation signal to form a high-dimensional observation space and effectively depicting its historical similarity in high-dimensional space; (2) establishing a threshold autoregressive model in phase space for estimating the chaotic noise; (3) calculating the parameters with the MCMC algorithm and obtaining the prediction residuals; (4) using the hypothesis test and value to detect whether there is a weak pulse signal from prediction error.

2.2. Phase Threshold Autoregressive (PTAR) Model

A one-step prediction model is built in three steps: (1) reconstructing observed signals to form a high-dimensional matrix; (2) forming PTAR model; (3) estimating parameters of the PTAR model by using MCMC algorithm.

2.2.1. Reconstructing Observed Signals to Form High-Dimensional Phase Space

For received signal , its phase in high-dimensional space is

According to Takens’s theorem [40], there is a smooth mapping that makes . If you can find analytical solution of or find estimate of , you can make one-step prediction. Here, the delay time is calculated by the complex autocorrelation method, and the embedding dimension is obtained by the Cao method [41].

2.2.2. Phase Threshold Autoregressive (PTAR) Model

For reconstructed signals , based on a threshold autoregressive model [42], the PTAR model can be expressed aswhere , is the number of regions, is the delay parameter, and is divided by the real axis, , and the variance of is . The parameters in the model are recorded as .

2.2.3. Parameters Estimation for the PTAR Model

The parameters estimation of the PTAR model is based on the MCMC algorithm. Let , and . The key to the MCMC algorithm is to obtain the posterior distribution of each parameter in . The prior distributions of the parameters in the PTAR model are ; is the discrete uniform distribution; ; is the inverse gamma distribution; and is a hyperparameter.

According to Chen’s literature method [42], conditional posterior distribution of each parameter in the PTAR model is calculated:where , , , , , s satisfies , , and .where and .

According to the above posterior distribution, Gibbs sampling and random walk M-H sampling algorithm [42] are used to estimate the mean of parameters. The specific sampling plan is as follows:(1)Sampling by using the Gibbs sampling method based on the posterior distribution (5)(2)Sampling by using the Gibbs sampling method based on the posterior distribution (6)(3)Sampling by using the random walk M-H sampling method based on the posterior distribution (7)(4)Sampling by using the Gibbs sampling method based on the posterior distribution (8)

Using this sampling scheme, iteratively calculate 10,000 times, burn the first 4000 times, and use subsequent stable Markov chain to calculate the sample mean of each parameter; that is, the parameter estimation is obtained.

Bring the estimated value of into the PTAR model to obtain one-step predicted value , and then prediction error is

2.3. PTAR Statistical Detection Method for Weak Pulse Signals

Based on the prediction error obtained in Section 2.2, a hypothesis test is performed to determine whether there is a weak pulse signal in observed signal . Based on prediction error equation (9), we get statistics and (under the condition that is established), and then the value at time is , where is the sample implementation value of under assumption that the null hypothesis is established. At a significance level of 0.05, the optimal detection threshold is . The weak signal is detected by the value. If , the null hypothesis is rejected, and there is a weak pulse signal; otherwise, it does not exist.

The weak pulse detection process based on the PTAR model and hypothesis test method is shown in Figure 2.

3. Weak Pulse Signal Extraction with the DLTAR Model

3.1. Double Layer Threshold Autoregressive (DLTAR) Model

The weak pulse signal here is a multipoint skip pulse model formed by plurality of pulses at different time points, as follows:where , , and .

According to basic principle described in Section 2, combined with linear representation of the weak pulse signal and PTAR model, the following extraction model is established:

The above model is called the double layer threshold autoregressive (DLTAR) model. This model is novel, and it combines the hot spot and PTAR model of the pulse signal and contains the parametric and nonparametric parts. So it cannot be solved directly. Among them, and is the Gaussian white noise with zero mean. It can be seen from formula (11) that the DLTAR model contains parameter vector and the nonparametric part . From linear expression of the weak pulse signal, it is known that if is obtained, then signal is extracted. The parameters in the DLTAR model are estimated by using the profile least squares estimation (PrLS) method.

3.2. Estimating Parameters of the DLTAR Model with the PrLS Method

If least square estimation method is used and sum of squared errors of the DLTAR model is minimized, then the optimal estimate can be obtained, namely,

It is known from (12) that it contains two major unknown parameters and , and the form of is more complicated. Here, the profile least squares (PrLS) algorithm [43] is used to alternately update iterative estimates of and .

The procedure for estimating optimal values of parameters and by using the PrLS algorithm is divided into three steps: (1) calculating with the values of with the MCMC algorithm; (2) the value of iteratively is updated with the value of with the PrLS algorithm; and (3) the alternate iteration is repeated until the value of changes very little or the specified error is reached. The detailed steps are described as follows:(a)Given the initial value of , estimate the value of , denoted as .It can be seen from equation (11) that when a value of is given, can be calculated. The structure is the analogy of the PTAR detection model. Take the MCMC algorithm to estimate, as follows:(b)Update the value of with value of estimated in (a) and record it as .It can be seen from equation (12) that when is given, the optimal value of can be obtained by using the Newton–Raphson algorithm. The specific description is as follows:Let , where is the derivative. For a given , according to Taylor’s exhibition, there isFrom (15), there is one-step iteration formula:(c)Repeating steps (a) and (b) until converges to obtain optimal estimate .

3.3. Implementation of Weak Pulse Signal Extraction for the DLTAR Model

According to the foregoing, the DLTAR model is implemented in the form of pseudocode and process structure diagram for weak pulse signal extraction. The pseudocode (Algorithm 1) is as follows, and the extracted process structure is shown in Figure 3.

Inputs: observed signal data with chaotic noise and parameters , etc.
 Outputs: the optimal value of the parameters and and the weak pulse signal .
 Begin
 Standardize raw data and reconstruct phase space
 Initialization of parameters such as .
 Initialize the distribution, such as ; is discrete uniform distribution; ; is the inverse gamma distribution.
 While
  Check for the presence of according to Section 2;
  Calculate according to the DLTAR model constructed according to (13);
  Updating the parameter with the profile least squares algorithm;
  Bring the estimated into the pulse signal and calculate ;
 End while
 Output the optimal parameter value, and calculate the weak pulse signal.
 End
 Note: (1) The delay time and embedding dimension remain unchanged. Since is a weak signal, the delay time and embedding dimension in phase space reconstruction are basically not affected. (2) When calculating the derivative, a differential algorithm is used because all data is discretized.

4. Simulation Experiment Results and Analysis

Experimental environment is as follows: Intel CPU Core i3 processor, main frequency 3.2 GHz, memory 4 GB, 32-bit Windows 7 operating system, R 3.5.1 programming environment.

In simulation experiment, Lorenz and Rossler system is used to generate chaotic noise, and the signal-to-interference ratio (SIR) is used to measure signal and interference strength. Accuracy rate (Acc), ROC curve (receiver operating characteristic curve), and AUC (area under ROC curve) are used to evaluate performance evaluation criteria of detector. Relative error (Re), mean square error (MSE), and mean absolute percentage error (MAPE) are used as extraction accuracy.(1)(2)(3)ROC: dichotomous problem evaluation index, with FPR as the abscissa and TPR as the ordinate(4)AUC: area under ROC curve(5)(6)(7)where , , and are the mean values of and , is the variance of the white noise , TP is the number of times that there is a pulse signal and is judged to have a pulse signal, TN is the number of times that there is no pulse signal and is judged to be no pulse signal, FP is the number of times that there is a pulse signal when there is no pulse signal, and FN is the number of times when there is a pulse signal to determine that there is no pulse signal, , and .

Lorenz system equation is as follows:where is a function of time and parameters, , and sampling time is second, which is solved by the fourth-order Runge–Kutta method. Take 3000 to 7000 points to represent chaotic noise, denoted by .

Rossler system equation is as follows:where is a function of time and parameters , , and sampling time is second, which is solved by the fourth-order Runge–Kutta method. Take 3000 to 7000 points to represent chaotic noise, denoted by .

The delay time and embedding dimension of observed signals are calculated according to the complex autocorrelation method and Cao method [41] ().

4.1. Experiment 1: Detection Experiment for the PTAR Model

Take transient pulse signals at eight different times, , where ,and pulse signal data of length 4000 is generated by formula (19); at this time, SIR reaches −65.9707 dB.(1)The pulse signal is detected by the PTAR model, and the detection result is shown in Figures 4 and 5.Figures 4 and 5(a) show the original chaotic noise signal, Figures 4 and 5(b) show the superimposed chaotic noise and the observed signal of target signal, Figures 4 and 5(c) show the one-step fitting prediction error of the PTAR model, and Figures 4 and 5(d) show a plot of the value of test statistic D. Comparing Figures 4 and 5(a) with Figures 4 and 5(b), the observed signal is basically consistent with nonlinear chaotic characteristics of chaotic noise; that is to say, weak pulse signals do not fundamentally change the structure of chaotic attractor; therefore, the signal reconstruction is basically not affected, and the historical similarity of phase points in chaotic high-dimensional space can be effectively utilized to perform model fitting. From the residual value in Figures 4 and 5(c), the moment with pulse signals has a large error, which can be judged by the naked eye. However, in order to adapt to adaptive judgment of machine, the value is further calculated here, as shown in Figures 4 and 5. It can be seen from the value in Figures 4 and 5(d) that the value is lower than the critical value at the time of pulse signals, and the weak pulse signals at different times can be effectively detected. At this one time detection, Acc is 100%. So the PTAR model is effective for different chaotic noise backgrounds.(2)Comparison results of ROC curves under different models are shown in Figure 6. Here, we compare the LS-SVM model (abbreviated as LS-SVM) [26] and RBF neural network model (abbreviated as RBF-NN) [29]. It can be seen from Figure 6 that the PTAR detection model is optimal. AUC values of three detection methods are 0.9987 (PTAR), 0.9862 (LS-SVM), and 0.9571 (RBF-NN), which further quantify that performance of the detection model in this paper is superior.

4.2. Experiment 2: Extraction Experiment of Weak Pulse Signals

It is assumed that 4000 points of observation signal contain pulse signals of different amplitudes at 10 times, and the extraction results of the DLTAR model are shown in Figure 6 and Table 1.

Table 1 shows the extracted results of DLTAR models for 10 different moments and different amplitudes. From relative error (Re), the magnitude scale is 10e − 5, the maximum relative error is 0.014%, and the minimum relative error is 0.31746e − 5. Therefore, the extraction accuracy based on the DLTAR model is high, the relative error is within the acceptable range, and the extracted performance is excellent. Specific estimation results, mean square error (MSE), and mean absolute percentage error (MAPE) are as shown in Figure 6.

Since the extracted value is close to the true value, the extracted value in Figure 7(a) is offset to the right by 30 display units, and SIR is −64.2382 dB. Figures 7(b) and 7(c) show the iterative convergence curves of MSE and MAPE. MSE approximately converges to 0.1065e − 05, and MAPE approximately converges to 0.032587e − 03 and converges around 40 steps.

4.3. Experiment 3: Pulse Signal Extraction Experiment Based on the PTAR-DLTAR Model with Different SIRs

Considering weak pulse signals in the case of ten different pulse magnitudes, the pulse signals in each case are composed of four pulse signals at different time points, and the signal-to-interference ratio (SIR) is adjusted by changing the magnitude of pulse intensity. 4000 points of the observed signal is selected as sample, and the experimental results are shown in Table 2:

From Table 2, the pulse signal amplitude is from strong to weak. At this time, SIR is high to low, and extraction accuracy of the DLTAR model is degraded. When SIR is greater than −124 dB, MSE and MAPE are relatively small, and extraction of the weak pulse signal is better. However, when SIR is less than −124 dB, the accuracy of MSE and MAPE is significantly reduced. At this time, the DLTAR model gradually loses its effect probably because the pulse signal is too weak relative to the noise so that the target signal value cannot be effectively extracted and is suppressed as noise. Therefore, when SIR is higher than −124 dB, the DLTAR model is superior in recovering the weak pulse signal. At this time, both MSE and MAPE are relatively small, so DLTAR estimation model can realize the recovery of weak pulse signals under extremely low signal-to-interference ratio.

4.4. Experiment 4: Performance Comparison with SVM-Based and Neural Network-Based Estimation Models

This paper compares common machine learning estimation methods, such as dual-constrained LS-SVM model (abbreviated as D-SVM), GA-SVM model (abbreviated as GA-SVM), and LS-SVM model (abbreviated as LS-SVM) [26], and the RBF neural network model (abbreviated as RBF-NN) [29]. The pulse signal takes a value ofusing the signal-to-interference ratio and mean square error for comparative analysis of extracted performance. The specific results are shown in Figure 8 and Table 3.

It can be seen from Figure 8 and Table 3 that when SIR reaches −92.12 dB, the estimated performance is excellent, the estimated value and the true value are in controllable range, the true value is 1.5, the estimated value is 1.50023, and MSE is 0.1058e − 05. Compared with support vector machine and RBF neural network model, the estimation accuracy for the presented method is higher and signal-to-interference ratio is lower. The specific results are shown in Table 3.

5. Conclusions

Aiming at the detection and extraction of weak pulse signals under strong chaotic noise, nonlinear characteristics of chaos and linear representation of pulse signals are combined to construct the phase space threshold autoregressive detection model (PTAR detection model) and double layer threshold autoregressive extraction model (DLTAR extraction model). The extraction model built in this paper is directly related to the detection model, adopts the same model structure, and does not require additional prior knowledge. It can realize the detection and extraction of weak pulse signals under extremely low signal-to-interference ratio, which is efficient and easy. It can be known from the simulation experiment results that the PTAR detection model can effectively detect weak pulse signals from chaotic noise. The DLTAR extraction model has excellent extraction performance and high precision. Its relative error is less than 0.014%, mean square error is not higher than , and MAPE is as low as . It can be seen from the detection and extraction experiments of pulse signals under different signal-to-interference ratios that the PTAR-DLTAR model can achieve a lower signal-to-interference ratio working threshold in the background of chaotic noise and mean square error of extraction accuracy is maintained at the 10e − 4 magnitude level; that is, when SIR is −124.19 dB, MSE is . Compared with support vector machine and RBF neural network model, the detection ability of the proposed model is stronger and the extraction effect is better.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This paper was supported by the Fundamental and Advanced Research Project of CQ CSTC of China (Grant no. cstc2018jcyjX0464), National Natural Science Foundation of China (Grant nos. 11871124 and 11471060), and Chongqing University of Technology Foundation (Grant no. 2016ZD33).