Abstract

Knowing source number correctly is the precondition for most spatial spectrum estimation methods; however, many snapshots are needed when we determine number of wideband signals. Therefore, a new method based on Bootstrap resampling is proposed in this paper. First, signals are divided into some nonoverlapping subbands; apply coherent signal methods (CSM) to focus them on the single frequency. Then, fuse the eigenvalues with the corresponding eigenvectors of the focused covariance matrix. Subsequently, use Bootstrap to construct the new resampling matrix. Finally, the number of wideband signals can be calculated with obtained vector sequences according to clustering technique. The method has a high probability of success under low signal to noise ratio (SNR) and small number of snapshots.

1. Introduction

Detecting wideband source number is one of the hot and difficult problems in the area of array signal processing; meanwhile, it is widely used in many fields, such as radio monitoring [15], acoustic signal processing [610], and Internet of Things [1114]; the corresponding methods are also being studied by a lot of scholars all the time. Generally speaking, knowing number of signals is also the premier condition of estimating direction of arrival (DOA). The initial method is based on hypothesis test [15]; as the detecting threshold needs to be set artificially, it is easy to be influenced by individual subjective factor. In order to avoid this problem, Wax and Kailath introduce the information theoretic criteria [16] into the question; then minimum description length (MDL) [17] and Akaike information criterion (AIC) [18] were put forward, improving the detecting performance to some extent. Liu et al. [19] estimated signal number and directions simultaneously by spatial difference method based on uniform linear array. Han and Nehorai [20] increased the array aperture with folding technology; thus, more signals can be processed. Liu et al. [21] set the detection threshold by proposing the concept of effective source number and distinguished the signals which were close to one another. Goldstein et al. [22] proposed an effective filtering technology based on reducing dimension in the late 1990s; it does not need to compute the covariance matrix and has a preferable responsiveness to time-varying signals. However, there are few published literatures for wideband signals; one of them is based on coherent signal method (CSM) [23] proposed by Wang and Kaveh, which focuses the signals on the reference frequency, and then employs MDL or AIC to obtain the final result. All the methods above need many snapshots, or their performance will deteriorate seriously. Unfortunately, they are difficult to be implemented in actual system: firstly, sampling time is strictly confined; for example, the impulse of radar is often narrow, and the data can be limited, especially for wideband signal; secondly, the signal can not be stable for a long time. So it is difficult to acquire more effective samples, which restricts the application of the methods above.

Bootstrap is another kind of technique arising in recent years which can be used with only a few samples [2430]; some scholars highlighted the motivation for using the Bootstrap in signal processing [31, 32]. For signal number estimation, it needs to calculate the number of the minimum equal eigenvalues of the sampling covariance matrix based on the multiple hypothesis test, and it has a higher precision comparing with information theoretic criteria under the same circumstance. In 2000, Brcich used Bootstrap for estimating source number for the first time [27]; after a few years, he improved the testing threshold by the Bonferroni multiple hypothesis test, enhancing the performance further [28, 29]. Huang et al. [30] used the eigenvalues of one frequency to form the testing threshold in combination with Bootstrap resampling to estimate wideband source number; as they only exploited the information of only one frequency, the estimation accuracy is limited.

In this paper, a new method for detecting wideband source number is proposed. The approach is based on Bootstrap technique and makes full use of the information of every frequency. First, the signals are divided into some nonoverlapping subbands, following focusing them on the same frequency. Then, the eigenvalues are fused with corresponding eigenvectors of the focused covariance matrix. Subsequently, Bootstrap is employed to construct the new resampling matrix. Finally, the number of wideband signals can be calculated with obtained vector sequences according to clustering technique.

2. Array Signal Model

Consider a uniform linear array (ULA) composed by omnidirectional sensors with spacing ; the first sensor is defined as the phase reference and the array signal model is shown in Figure 1.

Assume that far-field wideband signals impinge on the array, their directions are , observed time is , and the received data by the array at time are expressed asfor , where is the output of the th sensor, is the propagation speed of the signals, and is the additive white noise subjected to Gaussian distribution .

Suppose that the frequency range of all signals is , data are collected at every frequency, and a -point discrete Fourier transform (DFT) is applied to (1); the array output of the th frequency can be expressed asHere for , , and is the sample data on the th sensor of at snapshot , and is the array steering matrix with dimension, whereThe signals , within different frequency bins, are assumed to be Gaussian distributed, and they are uncorrelated with the noise ().

3. Proposed Method

3.1. Focusing

The covariance matrix at can be solved byThen, we can acquire the covariance matrix with the single frequency by transforming the signals at each frequency into a covariance matrix at a focusing frequency based on two-sided correlation transformation (TCT) method [33], and so forth we can choose the center frequency as . Apply eigendecomposition to ; we haveThen, the eigenvalues can be obtained, and the corresponding eigenvector is , where corresponds to signal subspace and corresponds to the noise subspace.

3.2. Fusion

Since both and belong to the signal subspace, we havewhere is the corresponding coefficient, and the inner product of the eigenvector is defined byThe signal and noise subspace are orthogonal to each other, soAccording to (6) and (8), (7) can be represented asTraditional information theory criterion only exploits the information of eigenvalues, but when the snapshots are small, we should make full use of the sampling data; since the eigenvectors also carry information of the signal, let us fuse with :Here, is defined as the fusion inner product, and is the fusion inner product vector. According to [34], the optimal are chosen as

3.3. Bootstrap Resampling

We can employ Bootstrap technique to determine number of signals; on the one hand, there is no need to know the probability density distribution of the testing statistics; on the other hand, it is still effective under the circumstance of small number of snapshots. Suppose that is the Bootstrap resampling matrix of , where is a random column selected from and each selected column is uncorrelated to one another, execute the same operation to the signals of the other frequencies, and we can acquire , the corresponding focused covariance matrix , and the fusion inner product vector . Similarly, repeat the Bootstrap resampling, focusing, and fusion for times; we will calculate groups of focused covariance matrices and the corresponding inner product vector sequences ; average them to obtain the new vector , where the new element   .

3.4. Clustering

Signals and noise can be distinguished by clustering the elements in the vector ; the larger ones belong to the signals, as there are large differences among them. While the others belong to the noise, they are smaller and close to one another. Here, clustering algorithm [35] is employed to estimate the source number; it is summarized as follows:

Select the initial class mean of the signal and noise:

In the th iteration, every element is endowed with the signal or the noise class according to the criterion below, to the :where denotes the noise class and denotes the signal class; that is to say, if the ratio of a certain element and noise class mean is larger than that of the signal class mean and the certain element, then the element belongs to the signal; otherwise, it belongs to the noise; this means that we assign every element to the nearest class.

Calculate the new value of each clustering center:where is the number of the element in .

If (), there will be no new elements turning over each class, the algorithm is convergent, and the iteration is over; otherwise, turn to step and go on to next iteration; thus, the number of signals can be decided.

It is seen from the deduction above that we can estimate signals at most. Since the proposed method employs fusion, Bootstrap resampling, and clustering, it can be called FBC method for short.

Or we can omit the process of fusion, then only use Bootstrap resampling and focusing to construct the new covariance sequences , and subsequently average them; we haveThen, apply clustering above to the eigenvalues of to decide the signal number, but this method does not utilize the eigenvectors; similarly, we can call it BC for short.

4. Simulations

In order to verify the effectiveness of the method, some simulations are presented with Matlab below; consider some binary phase coded wideband signals impinging on a uniform linear array with 8 omnidirectional sensors, the center frequency is 1 GHz, the width of the band is 1 GHz × 20% = 0.2 GHz, signals are sampled with 20 frequency bins in the frequency domain, the spacing between adjacent sensors  m, FBC, BC, and MDL based on CSM (CSM-MDL) are, respectively, employed for the simulations, 200 Monte-Carlo trials have run for each variable, and, in FBC and BC, .

In the first simulation, five far-field wideband uncorrelated signals with the same power impinge on the array in ,,   , , and ; the noise is assumed to be Gaussian-distributed; the resample times ; Figure 2 presents the probability of detection as a function of SNR when the number of snapshots is 30, while Figure 3 shows that of the number of snapshots when SNR is 3 dB.

It is seen from Figure 2 that all the probabilities of success of these methods are improving with the increasing of SNR; in comparison, FBC is better than the other two methods; it is 100% when SNR reaches −6 dB and that of the BC and CSM-MDL can reach 100% when SNR is 0 dB and 3 dB, respectively. And it is seen from Figure 3 that when the snapshots are small, FBC is better than BC and CSM-MDL, and all the probabilities of success of these methods are improving with increasing of the snapshots; in comparison, the probabilities of success of FBC can reach 100% when the snapshots are 22 and that of the BC and CSM-MDL can reach 100% when the snapshots are 28 and 32, respectively.

In the second simulation, five far-field wideband uncorrelated signals with the same power impinge on the array in , ,  , , and ; the noise is assumed to be non-Gaussian-distributed; it is obtained through an autoregressive (AR) filter [36] of coefficient 0.9; the resample times ; Figure 4 presents the probability of detection as a function of SNR when the number of snapshots is 30, while Figure 5 shows that of the number of snapshots when SNR is 3 dB.

It is seen from Figure 4 that when the SNR is low, all the probabilities of success of these methods are improving with its increasing; when SNR reaches a certain extent, CSM-MDL is fluctuating and unstable, but all of them can reach 100% at last. And it is seen from Figure 5 that when the snapshots are small, all the probabilities of success of these methods are improving with the snapshots increasing. In comparison, FBC is better than BC and CSM-MDL; when snapshots reach a certain extent, CSM-MDL is fluctuating and unstable, while FBC and BC are still available relatively, but they still cannot reach 100%, so their estimations are not consistent for the non-Gaussian white noise.

In the third simulation, five far-field wideband coherent signals with the same power impinge on the array in , ,  , , and ; the noise is assumed to be Gaussian-distributed; the resample times ; Figure 6 presents the probability of detection as a function of SNR when the number of snapshots is 30, while Figure 7 shows that of the number of snapshots when SNR is 3 dB.

It is seen from Figure 6 that these methods can estimate the number of wideband coherent signals successfully when SNR is high enough, and all the probabilities of success of these methods are improving with the SNR increasing; in comparison, FBC is better than BC and CSM-MDL.

It is seen from Figure 7 that all these methods can estimate the number of wideband coherent signals successfully when snapshots are large enough; the probabilities of success of these methods are improving with the snapshots increasing. In comparison, FBC is better than BC and CSM-MDL. In general, since the course of focusing can extract the coherence, there are no distinct differences with uncorrelated signals for the result.

In the fourth simulation, five far-field wideband uncorrelated signals with the same power impinge on the array in , ,  , , and ; the noise is assumed to be Gaussian-distributed; Figure 8 presents the probability of detection as a function of resampling times when the number of snapshots is 20 and SNR is 3 dB, while we use digital signal processor ADSP-TS201S to implement these methods; the computational complexities and average calculating time are shown in Table 1.

It is seen from Figure 8 that the probabilities of success of the two methods are improving with the resampling times increasing. In comparison, as the FBC makes use of not only the eigenvalues but also the eigenvectors, it yields better detection performance than BC, so we should fully exploit the information of the eigenvectors and resampling under the circumstance of small number of snapshots.

It is not difficult to deduce that CSM-MDL needs the process of focusing and formula computation, so its computational complexity is ; BC needs focusing, resampling, and clustering, and the complexity is ; FBC needs focusing, fusion, resampling, and clustering, and the complexity is . And it can be seen from the calculating time that the resampling spends a long time; it will increase the computation time by multiples, while focusing and fusion are not obvious relatively; we can reduce the resampling times properly and employ multiple digital signal processors to increase efficiency in practical applications.

5. Conclusions

The paper proposes a method for determining the number of wideband signals based on Bootstrap resampling; after focusing the signal on the reference frequency, we make use of fusing the eigenvectors with the corresponding eigenvalues, subsequently we employ the Bootstrap to construct the resampling matrix, and then we use clustering to decide signal number. The simulations have shown that the performance of the proposed method is better than that of information theory criterion; it adapts to uncorrelated and coherent signals. Moreover, it also has an effect on the non-Gaussian noise.

Bootstrap resampling is used to estimate wideband signal number in this paper; however, due to the process of the resampling, its calculation is very complicated. Our work will be committed to optimizing the method to lower the computation complexity in future.

Competing Interests

The author declares that there are no competing interests.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (61501176) and the Education Department of Heilongjiang Province Science and Technology Research Projects (12541638).