Abstract

A spatial filtering-based relevance vector machine (RVM) is proposed in this paper to separate coherent sources and estimate their directions-of-arrival (DOA), with the filter parameters and DOA estimates initialized and refined via sparse Bayesian learning. The RVM is used to exploit the spatial sparsity of the incident signals and gain improved adaptability to much demanding scenarios, such as low signal-to-noise ratio (SNR), limited snapshots, and spatially adjacent sources, and the spatial filters are introduced to enhance global convergence of the original RVM in the case of coherent sources. The proposed method adapts to arbitrary array geometry, and simulation results show that it surpasses the existing methods in DOA estimation performance.

1. Introduction

The problem of direction-of-arrival (DOA) estimation with sensor arrays is common in various applications, such as radar, sonar, and wireless communications [14], and the requirement for finding the directions of coherent sources also widely emerges due to the factors like multipath and jamming. Most of the existing superresolution methods, for example, the subspace-based ones [1], do not adapt well to such a problem in their original form, and further research has been carried out to improve their adaptability in the coherent scenarios [2, 57]. The method in [2] aims only at CDMA signals with known codes, and those in [57] are general-purpose ones that apply to common narrowband signals. Pillai and Kwon proposed making up for the deficient rank in the array output covariance matrix via spatial smoothing [5], whose performance may deteriorate if the sources are spatially adjacent. Then Moghaddamjoo introduced the idea of spatial filtering to better decorrelate the signals [6]. After that, Delis and Papadopoulos promoted the filtering idea further by combining it with the forward and backward processing [7]. The spatial smoothing and filtering techniques require shift invariance property in the array; thus, they adapt to special array geometries such as the uniform linear arrays (ULA) only [57]. Moreover, after spatial preprocessing, the above methods exploit ordinary subspace-based techniques to realize DOA estimation; thus, they all lack adaptability to demanding scenarios, such as low signal-to-noise ratio (SNR), limited snapshots, and spatially adjacent sources, as good performances of the techniques rely heavily on high-precision covariance matrix and subspace estimates.

The recently emerging technique of sparse reconstruction has attracted much attention from various areas, and it also provides a new perspective for DOA estimation. Existing sparsity-inducing DOA estimation methods include FOCUSS [8], L1-SVD [9], JLZA-DOA [10], CMSR [11], RVM-DOA [1214], and so forth. Those methods combine the spatial sparsity of the incident signals when fitting the array output with an overcomplete model and show improved adaptation to the above mentioned demanding scenarios [814]. As they realize DOA estimation by reconstructing the array output, instead of exploiting the covariance matrix, some of them also perform well for correlated and coherent sources [810]. However, the way for different methods in describing the spatial sparsity of the incident signals diverges largely. The L0-norm cost function is canonical as it stands for the model dimension exactly, but it introduces too many local maxima in the objective function and makes the global maximum hardly attainable. Therefore, Lp-norm () cost functions are used instead to mitigate the influence of the local maxima at the expense of possibly biased global maximum [8, 9, 1215]. Another technique for combining model sparsity with data fitting is the relevance vector machine (RVM) [16], and conclusions have been obtained to indicate that the RVM has identical global maximum as the L0-norm based function and greatly decreased local maxima if the signals are independent [15]. It is even unimodal at high SNR when the basis amplitudes are highly scaled [15]. Based on those advantages of the RVM, our method of RVM-DOA surpasses the Lp-norm based ones in DOA estimation precision for independent sources [12]. Nonetheless, when the incident signals are correlated, the superiority of the RVM is no longer guaranteed [17], and unbiased DOA estimates may become unattainable.

In this paper, we extend RVM-DOA [12], which was originally designed for independent sources, to coherent source scenarios by separating the sources with spatial filters. After such separation, the original multiple-coherent-source data is decomposed into more than one single-source components, thus improved global convergence and DOA estimation precision are expected for the RVM-based methods. The filter parameters and DOA estimates are updated via sparse Bayesian learning after reasonable initialization with the independent RVM-DOA method. The new method adapts to arbitrary array geometries, which greatly relaxes the restrictions of previous methods on the shift invariance of the array that has constrained their adaptation to special arrays like ULA only. Empirical results show that the new method obtains enhanced applicability in challenging environments and improved DOA estimation precision under moderate settings.

The rest of the paper mainly consists of four parts. Section 2 reviews the coherent signal model. Section 3 introduces the new method in detail. Section 4 demonstrates the performance of the proposed method via simulations. Section 5 concludes the whole paper.

2. Model Formulation

Suppose that coherent signals impinge onto an -element array from the directions of simultaneously; the array output at time is where , is the array responding vector to the incident signal, is the phase shift of this signal when propagating from the reference point to the sensor, is the source waveform vector, and is the zero-mean white Gaussian noise with known unified covariance matrix and unknown power . In order to simplify notation, we omit the time identifier in (1) on the variables in the case of single snapshot when no confusion emerges and use and to represent the mappings from a direction and direction set to the corresponding array responding vector and matrix, respectively.

Due to the coherence between the incident signals, the source covariance matrix is singular, and the signal subspace of the array output covariance matrix is rank deficient. Thus preprocessing techniques, such as spatial smoothing and filtering, are required to make up for the deficient rank by exploiting the shift invariance property of special arrays [57]. Although those techniques help to enhance the adaptability of the ordinary subspace-based methods to coherent signal scenarios to some extent, they perform poorly when the covariance matrix and subspace estimates are greatly inaccurate in cases like low SNR, limited snapshots, and spatially adjacent sources.

Some of the sparsity-inducing DOA estimation methods apply to coherent sources directly [810]. However, in such scenarios, the snapshots can be rewritten as , where , , is the waveform of the reference signal, and contains the amplitudes and phases of the incident signals with respect to the reference one. This form of the array output implies that, when the sources are coherent, more snapshots only help increase the SNR but do not enhance global convergence of the sparsity-inducing methods by smoothing away some of the local maxima [15]. Therefore, although those methods still work in the coherent scenarios, they may not obtain unbiased and high-precision DOA estimates.

3. Spatial Filtering-Based Relevance Vector Machine (RVM) for Coherent DOA Estimation

In this section, we first review the overcomplete model for the sparsity-inducing DOA estimation methods and analyze how the previous independent RVM-based method performs in the case of coherent sources. Then in Section 3.2, we introduce the basic idea of the spatial filters to be used to separate the coherent signal components. After that, we discuss in detail how the spatial filter parameters are updated and the DOA estimates are refined iteratively via Bayesian learning. Finally, we outline the proposed method by joining the main procedures together and complementing them with reasonable initializations in Section 3.4.

3.1. Overcomplete Model and Problem Formulation

In the sparse reconstruction-based DOA estimation methods, the potential space of the incident signals is sampled discretely to obtain a direction set with cardinality of to contain all the possible signal directions. The corresponding array responding matrix is denoted by . Generally, is much larger than to meet the requirement in superresolution; thus, is overcomplete. The array output given by (1) can then be rewritten as where is the extension of from to and has nonzero amplitudes only for the -associated bases. The model mismatch between (1) and (2) due to the discretization of is skipped over to facilitate notation unless otherwise stated; such approximation does not influence the implementation of those methods. The signal directions can be estimated according to the locations of the nonzero magnitudes in if it can be estimated. However, the equation given by (2) is ill posed and it is impossible to estimate unless further sparsity constraints on , which indicates that only a few signals impinge simultaneously, are added. When multiple snapshots are collected, one needs to estimate from .

In RVM, the columns of are assumed to be independent and Gaussian distributed; that is, , with and . Such an assumption leads to the following probability function of with respect to and : where . Its logarithmic form is used to estimate and ; that is, The function of can also be rewritten as if intersource correlation exists and the diagonal structure of is deviated.

Then the expectation-maximization (EM) algorithm [18], which has already been used for wideband DOA estimation [19, 20], can be used to maximize and estimate the source directions [12]. We have shown in [12] that the RVM-based method, if well designed, surpasses its counterparts with significant margin in DOA estimation performance when the incident signals are independent. However, how it will behave in the coherent source settings still remains unknown. Fortunately, some conclusions available in the related neural source localization area have provided some clues about the behavior of the independent RVM when intersource correlation exists [17].

Denote , with standing for the row of ; then, according to Theorem  1 in [17], the following propositions can be obtained to indicate the behavior of RVM when applied to independent and correlated DOA estimation.

Proposition 1. Suppose that the SNR is high and the number of the incident sources is smaller than that of the sensors, then and own the following properties.(1)If for each two sources, then is unimodal and no local maxima exists.(2)For arbitrary , the unique global maximum of , denoted by (while is negligible), produces the true source matrix ; that is, it achieves perfect source reconstruction.

Proposition 1 is a straightforward extension of Theorem  1 in [17]. Property (1) of Proposition 1 implies that well-designed RVM is able to achieve consistent DOA estimation for independent sources, as our RVM-DOA method has done [12]. Property (2) tells that accurate source covariance matrix estimation, which is associated with accurate DOA estimates, is necessary for precise source matrix reconstruction in the case of coherent sources. However, as some of the off-diagonal elements of the coherent source covariance matrix have significant nonzero magnitudes, the original RVM should be modified to take all possible intersource correlations into consideration, so as to make global convergence achievable. Unfortunately, all the elements of should be estimated under such a modified model, and the great increase in the number of unknown parameters not only aggravates the computational burden of the RVM prohibitively, but also increases the risk of overfitting [21].

In addition, it should be noted that, although the independent RVM-based DOA estimation method is not able to perform well enough in the case of coherent sources, it still converges to the vicinity of the global maximum [22]. Therefore, we use the independent RVM-DOA method in [12] to obtain coarse DOA estimates in this paper.

3.2. Basic Ideas for Designing the Spatial Filters

According to Proposition 1 and the analysis that follows, the global maximum of can hardly be achieved by RVM in the case of coherent sources, and the corresponding DOA estimation method gets trapped in local maxima easily. In order to improve the DOA estimation precision, we design a group of spatial filters to separate the incident signals from the array output. After the separation, the original multiple-coherent-source array output decomposes into a group of single-source components, and the convergence of RVM based on each component obeys the unimodality described in Property (1) of Proposition 1.

Define the required filters by , with used to separate the source from the array output, which satisfies where is a certain function of . As and are random and unknown, (5) can be transformed into the following equations for certain : where is the indicative function that takes nonzero value of 1 only when .

Equation (6) indicates that is orthogonal to the column subspace of ; thus, the dimension of is and its explicit value is determined by . An intuitive selection of is which means reserving the array output from the first sensors. In the following, we present a way for designing such a that satisfies (6) with given by (7).

Denote the responding matrix of the subarray consisting of the to elements of the original array to sources impinging from directions of by , and also define , with its row denoted by ; then, where .

Based on the above property of the subfilters, we extend them in dimension to act on the whole array output to separate the incident signals. Define , whose row is ; then, where . It can be concluded from (9) that the signal can be extracted from the array output provided that the spatial filter is well-designed. However, the parameters of the filters, that is, the signal directions, are not available in practice. Therefore, we introduce an iterative procedure to refine the signal directions and update the filter parameters simultaneously.

3.3. Bayesian Learning for Filter Update and DOA Refinement

It has been noted at the end of Section 3.1 that the independent RVM-DOA can be used to obtain coarse DOA estimates for coherent sources, which are located in the vicinity of the true source directions with somewhat significant biases. We then refine those estimates iteratively via Bayesian learning.

To keep in line with the iterative format of the learning techniques, we use to identify the updating variables in the iteration. Denote the output of in the case of multiple snapshots by and the extension of from to by , which is zero-mean and Gaussian distributed as with , and the noise contained in , that is, , is also zero-mean and Gaussian distributed with variance of ; then, the posterior probability of with given is where the source-intrinsic noise variance of is introduced because they will be updated independently and may differ from each other, The likelihood function of with respect to and is where .

The logarithmic likelihood can then be maximized by maximizing and alternatively with respect to and . Firstly, can be maximized with the EM algorithm [18] as follows: where and are computed according to (11), , , and that represent the vector element, the matrix row, and the matrix element, respectively, and stands for the 2-norm of a vector. Based on and , the signal direction can be obtained and the noise variance is updated by projecting the filter output onto the noise subspace [12]; that is, where , and is the orthogonal subspace of . The way for estimating is presented in the following.

The obtained in the above iteration reflects the power distribution of the filter output on the predefined discrete direction set, and further efforts should be made to obtain refined DOA estimate of the source, denoted by , from it. Assume that the peak cluster (the peak corresponding to each signal generally consists of two significant magnitudes on the left and right hand side of the true signal direction; we call them a peak cluster) of is located at , and denote , that is, the direction set by removing and from , , with equaling by removing the two rows and columns that associate with ; then, can be deemed to be the covariance matrix of by removing the signal component. Complementing with the signal according to its power and direction , instead of a peak cluster, yields , and a refined direction estimate can be obtained by maximizing based on this form according to as follows [12]: where . The signal direction estimate is derived from (15) according to the peak location by scanning the scope of with required precision.

After updating the source directions with (15), the spatial filters can be redesigned with them according to and ; the filter outputs can then be recomputed to prepare for the next iteration. Such iterations are repeated until a predefined convergence criterion is satisfied.

We note here that the spatial filters used to separate the incident signals do not require any special property in the array geometry such as shift invariance [57]; hence, the new method can be applied more comprehensively than the existing counterparts.

3.4. Outline of the New Method

We name the new method spatial filtering-based relevance vector machine for DOA estimation (SF RVM-DOA for short), and outline its main procedures in Table 1.

Some supplements listed as follows should be added to complete the proposed method.

Remark 2. When turning from the Initialization procedure to the Refinement procedure, the ’s are derived from by setting the other peak clusters to zeroes.

Remark 3. The EM procedure is generally much slow in its original form and is usually speeded up with the fixed-point iteration of , where is a small positive value, such as [1214, 16].

Remark 4. The primary and final convergence criteria are application dependent. We set them as and in this paper, where and stands for the Frobenius-norm of a matrix. In the simulation section, we set and .

4. Simulation Results

In this section, we carry out simulations to demonstrate the performance of the proposed method (named SF RVM-DOA) in coherent DOA estimation. Four other existing methods are also listed for comparison, including spatial smoothing MUSIC (named SS MUSIC) [5], spatial filtering MUSIC (SF MUSIC) [7], L1-SVD [9], and independent RVM-DOA [12]. Among those methods, L1-SVD, RVM-DOA, and SF RVM-DOA adapt to arbitrary array geometries, while SS MUSIC and SF MUSIC require shift invariance property in the array geometry. Therefore, in order to facilitate comparison, we choose a 10-element ULA for the simulations, whose interelement spacing equals half wavelength. Moreover, as only RVM-DOA and SF RVM-DOA realize joint detection and estimation, while the other three methods need to know the source number beforehand, we assume that the source number is known to make the DOA estimation performance comparison fairer. The three sparsity-inducing methods, that is, L1-SVD, RVM-DOA, and SF RVM-DOA, sample the [−90°90°] space with 1° interval to obtain the direction set , and L1-SVD refines the grid to 0.01° afterwards to improve the DOA estimation precision. The SF MUSIC method is stopped after 5 repetitions, and the maximum iteration numbers of RVM-DOA and SF RVM-DOA are capped at 1000. Two equal-power sources are assumed to impinge onto the array simultaneously, and 300 independent trials are carried out in each scenario. The additive noise is assumed to be spatially white; that is, .

In the first experiment, we fix the snapshot number at 30 and the angle separation of the two sources at 10°, and their directions are and , respectively, with chosen uniformly and randomly in [−0.5°0.5°] in each trial. The differential phase shift of the two signals is set randomly within [02π] in each trial. The SNR of each signal is increased from 0 dB to 20 dB, and the averaged root-mean-square errors (RMSEs) of the DOA estimates of the two sources are given in Figure 1, with the CRLB depicted by the dash dotted line. The results in Figure 1 indicate that the three sparsity-inducing methods succeed in separating the two sources at lower SNR, but among them, L1-SVD and RVM-DOA have larger DOA estimation RMSE than the two subspace-based methods when the SNR is higher than 6 dB. SF RVM-DOA achieves high-precision DOA estimation at a SNR of about 2 dB, which is 5 dB lower than that of the two subspace-based methods, and its DOA estimation precision surpasses the other four methods above this threshold and gradually approaches the CRLB as the SNR increases.

Then fix the angle separation of the two sources at 10° and their SNR at 7 dB; the directions and phase shift of the two sources are set similarly as the first experiment. When the snapshot number increases from 1 to 100, the averaged DOA estimation RMSE of the five methods is shown in Figure 2. The results indicate that the three sparsity-inducing methods are able to separate the two sources with only 4 snapshots, while the two subspace-based methods require more than 30 snapshots. As the snapshot number increases, the DOA estimation precision of L1-SVD and RVM-DOA improves only slightly and RVM-DOA surpasses L1-SVD with a small margin. However, the RMSE of the other three methods decreases almost synchronously with the CRLB, and SF RVM-DOA obtains higher precision than the other two methods and is closer to the CRLB.

Finally, fix the SNR of the two sources at 7 dB and the snapshot number at 30, and increase the angle separation of them from 6° to 16°. The directions of the two sources are set to be and , with and the intersource phase-shift randomly chosen as that in the above two experiments. The averaged DOA estimation RMSE of the five methods in such a scenario is given in Figure 3. It can be concluded that, although L1-SVD owns enhanced superresolution than the other methods, its performance improves only slightly as the two sources are farther apart. When the two sources are separated by more than 10°, L1-SVD performs the worst among all the five methods. RVM-DOA and SF RVM-DOA surpass L1-SVD in DOA estimation precision when the angle separation is larger than 8° and improves at a higher speed than it, and SF RVM-DOA gains the highest precision. The two subspace-based methods can only separate two sources spaced by more than 10°, with their DOA precision lower than SF RVM-DOA all along, and is also exceeded by RVM-DOA when the angle separation of the two sources is larger than 14°.

5. Conclusions and Discussions

In this paper, we introduced the relevance vector machine (RVM) to estimate the directions of coherent sources and designed a group of spatial filters to separate the incident signals from the array output and guide the RVM to better convergence. The proposed method adapts to arbitrary array geometry, and simulation results show that it shares similar superiority with other sparsity-inducing methods in adaptation to low SNR, limited snapshots, and spatially adjacent sources. Moreover, it performs much better than all the subspace-based and sparsity-inducing counterparts in DOA estimation precision in moderate settings.

The spatial filters designed in this paper can also be used to replace the ordinary spatial filtering and smoothing techniques [57] and combined with the ordinary subspace-based DOA estimation methods, so as to extend them to arbitrary array geometry. However, the performance of such variants may still be constrained by the dependence of the subspace-based methods on high-precision covariance matrix and subspace estimates; thus, they may deteriorate significantly in the demanding environments of low SNR, limited snapshots, or spatially adjacent sources.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The work is supported partially by the National Natural Science Foundation of China (no. 61302141) and the Foundation of the State Key Laboratory of Complex Electromagnetic Environment Effects on Electronics and Information System (CEMEE) (no. CEMEE2014Z0202B).