Journal of Sensors

Volume 2019, Article ID 8169404, 14 pages

https://doi.org/10.1155/2019/8169404

## Signal Detection from Permutated Observations Using Distributed Sensors

^{1}Nanjing University of Information Science and Technology, China^{2}Nanjing Marine Radar Institute, China^{3}Tsinghua University, China^{4}Nanjing University of Aeronautics and Astronautics, China

Correspondence should be addressed to Naiti Jiang; moc.361@ee_tngnaij

Received 29 December 2018; Accepted 20 February 2019; Published 7 August 2019

Academic Editor: Antonio Lazaro

Copyright © 2019 Naiti Jiang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

#### Abstract

In this paper, distributed constant level detection in wireless sensor networks (WSNs) is investigated. The permuted linear model with a scalar parameter and additive heteroscedastic Gaussian noise is introduced, where the associations between the observations and the sensors are assumed to be unknown. Several detectors such as the approximations of the generalized likelihood ratio test (GLRT) detector, mean detector, and energy detector are proposed, and their receiver operating characteristics (ROCs) are evaluated. Numerical simulations are performed, and it is shown that the performance degradation of the GLRT detector is small, compared to the permutation known as Neyman-Pearson (NP) detector.

#### 1. Introduction

Decentralized estimation and detection in wireless sensor networks (WSNs) has attracted considerable attention in the past decades [1–11]. Early in [1], an overall description of decentralized detection was presented. Years later, a two-part review summed up the basic results of distributed detection more detailedly [2, 3]. In part I, the authors reviewed the fundamentals and discussed several important issues, including the computational complexity, network design with general topologies, and applications in different areas. In part II, further works on new ideas of asymptotically optimum, nonparametric, robust, and sequential centralized detection are investigated. Advancements in these directions provide more efficient detection schemes by optimizing more general performance criteria.

In general, observations collected from WSNs can be mathematically expressed as a linear noisy model. The most common assumption in existing literature is that the observation matrix is perfectly known and the observations are perfectly labeled. Then the unknown signal can be detected or directly recovered from the observations by the least squares methods. One variant of the scenario is that there exists model uncertainty where multiplicative noise is introduced [7, 8, 12–15]. A more complex scenario corresponds to the case that the observation coefficients are partially known, which is also known as unlabeled sensing [16–22]. In this setup, the detection problem from unlabeled data is closely related to the problem of parameter estimation. To obtain the generalized likelihood ratio test (GLRT), usually one has to estimate both the permutation matrix and the unknown parameters via numerical algorithms. In [16], a branch and bound global optimization algorithm combined with the signal’s sparsity information is utilized, and numerical results demonstrate that permutation matrix can be recovered correctly under certain circumstances. In [17], it is shown that the underlying signal can be recovered correctly with probability one, given that the sensing matrix is a random matrix with i.i.d. entries and the number of measurements is twice as many as that of unknowns. Taking the observation noises into account, it has been shown that recovery of the permutation matrix depends sharply on the signal-to-noise ratio, the number of measurements, and the estimated parameters [18]. One important variant of the above problem is the unlabeled ordered sampling problem, i.e., only a subset of measurements is kept and the relative order is preserved, an alternating minimization algorithm is proposed, and phase transition phenomenon was revealed in [19]. In [20], the permutated linear regression problem with a scalar parameter and additive heteroscedastic Gaussian noise is studied, and an alternating minimization algorithm is proposed to jointly recover the permutation matrix and the underlying parameter. In [23], signal amplitude estimation and detection from unlabeled binary samples are studied, and the number of quantizers needed to recover the permutation matrix is provided. It is noteworthy that the unlabeled sensing problems appear on many applications such as archaeological measurements, time jitter in sampling, and multitarget tracking [24–26].

In this paper, the problem of distributed constant level detection from unlabeled network observations is studied. Technically, it would be a tough task to accurately associate enormous observations generated by heterogeneous sensors in the big data era, which directly motivates the present work. More explicitly, we consider the case that the association between the sensors and the observed data is unknown while the time association is accurate. The goal is to detect whether the desired scalar signal is present or absent based on the permutated data. Compared to the deterministic detection problem discussed in [27], we are faced with a unknown signal contaminated by additive noises. More importantly, the detection problem is considered in a sensing communication model, where each sensor transmits the unlabeled noisy data to the fusion center (FC) for decision making. While theoretical analysis for parameter estimation under the same model has been published in [20], unknown scalar detection is still an open problem to be addressed. We fill this lack in this work and main contribution of the presented paper is the valuable solutions to the detection problem as well as the proposed methods to improve the detection performance.

#### 2. Problem Setup

The system model is described in this section, under which the ML estimation with reference to the observations and the clairvoyant detector are derived. As mentioned earlier, we study the detection problem from permuted data in WSNs. Considering such a detection scenario, there are *N* sensors collecting the data of a noisy scalar parameter *θ*. Given that *i* and *j* denote the sensor and the time index, respectively, we suppose that the randomness in the unknown scalar signal **X** can be summarized in the following binary hypothesis testing problem: where is the unknown scalar signal and . Then all sensors transmit the noisy data to the fusion center (FC) through a channel. Here, we suppose that the channel between the *i*th sensor and the FC is time-invariant, and the channel coefficient is denoted by *h*_{i}. Then the observations after the transmission *Y*_{ij} can be expressed as follows: where *υ*_{ij} is an *i.i.d.* noise sequence satisfying and is independent of *W*_{ij}.

Assume that FC only knows that *Y*_{ij} belongs to one sensor, i.e., the time association is accurate and imposes a preprocess procedure on the raw data as follows:

Under hypothesis , equation (3) can be simplified as follows: where **e** is defined as follows:

Let **e** = [*e*_{1},⋯ ,*e*_{N}]^{T}. It follows that ,where **W** is a diagonal matrix whose diagonal element is . Since data is unlabeled and it follows that where is the unlabeled observations of **y**, and **Π** is an unknown *N* × *N* permutation matrix which disorders the rows of the labeled observations. As a consequence, the detection problem can be formulated as follows:

It is noted that if the permutation matrix is recovered as , then the ML estimation of *θ* is as follows:

##### 2.1. Clairvoyant Detector from Labeled Observations with the Knowledge of *θ*

If the parameters *θ* and **Π** are both known, the optimal detection statistic in the Neyman-Pearson sense is as follows: where the threshold*γ* is chosen such that the false alarm probability is set to a desired level. This detector is usually referred as the clairvoyant detector, and the corresponding detection probability is as follows: where is the tail probability of the standard normal distribution, *Q*^{−1} (·) denotes its inverse, and *P*_{FA} is the false alarm probability. Because the test statistic has the same PDF as the classical mean-shifted Gauss-Gauss problem, the deflection coefficient *d*^{2} characterizes the detection performance and it can be defined as follows:

Here, the deflection coefficient *d*^{2} of the clairvoyant detector is as follows:

If *N* is large and , by Jensen’s inequality, we have the following:

Note that the upper bound approximation is tight in the case of being zero.

##### 2.2. Signal Detection from Labeled Observations without the Knowledge of *θ*

For the case of unknown amplitude *θ* and known permutation matrix **Π**, it can be shown that a uniformly most powerful (UMP) test does not exist. As a consequence, we resort to a suboptimal GLRT detector. The GLRT decides if where is the MLE of *θ* assuming is true. Consequently, we obtain the GLRT as follows: which is a correlator that accounts for the unknown sign of *θ* by taking the sign of its absolute value. The detection probability can be calculated as follows [27]:

#### 3. Detection with Permuted Observations

In this section, we investigate the detection problem (7) under the circumstance that both the desired parameter *θ* and nuisance parameter **Π** are unknown. In order to detect the presence of the signal, the recovery of the permutation matrix is indispensable. But the joint optimization of both unknowns is difficult due to the nonconvexity of **Π**. To circumvent the dilemma, the joint optimization problem will be decomposed into two subproblems. To be more specific, previously, we first analyze the problem of detection with labeled observations in Section 2.1 and Section 2.2. Now in Subsection 3.1, we solve the detection problem from permuted observations with the knowledge of *θ*. Finally in Subsection 3.2, the complete detection problem (7) is studied.

##### 3.1. Signal Detection from Permuted Observations with Knowledge of *θ*

Now we consider the case that the signal *θ* and the channel coefficients are known while the observations collected from the sensors are permutated. Under this assumption, we first propose an approximated GLRT as follows [27]: w he re t he s ea rc hi ng s pa ce o f p os si bl e permutation matrix set is *N*!. Under either hypothesis or , or can be decomposed as . For problem it can be easily shown that the permutation matrix can be found by sorting (element-wise square) according to **h**^{2}. For problem which is decomposed as , one can swap the elements of to minimize the objective function for a given *θ* to obtain a smaller objective function value, which involves *O*(*N*^{2}) computation complexity.

In order to construct a more computational efficient detector, the linear approach is further incorporated to eliminate the effects of the permutation matrix. Observe that given any permutation matrix **Π**_{1},**Π**_{2} and observation , **A****Π**_{1} = **A****Π**_{2} holds if the rows of the matrix **A** have constant entries. As a consequence, the linear approach reduces to the simple mean detector as follows: where

T he f al se alarm and detection probability are, respectively, as follows: Moreover, the receiver operating characteristic (ROC) of the detector can be expressed as follows: DefineFrom (13) and (25), the SNR loss incurred by the mean detector w.r.t. the clairvoyant is obtained as follows:

If , i.e., **h** is a constant vector, the loss is zero, which implies that the detector uses the mean statistic performer as well as the optimal clairvoyant detector. This is obvious because the permutation has no effect on the observation given a constant **h**. For zero mean vector **h** (*μ*_{h} = 0), the loss achieves its maximum 1. This is because the mean detector can not detect the signal given the mean of **h** is zero. Before ending the present subsection, we slightly digress to emphasize that the result presented in (26) is accurate for , while holds approximately for the general nonzero small .

##### 3.2. Signal Detection from Permuted Observations without Knowledge of *θ*

For the complete detection problem (7), we should jointly optimize the parameters *θ* and **Π**. In [20], an alternating minimization algorithm is proposed to find the maximum likelihood estimation of *θ* and **Π**. The alternating minimization algorithm works as follows: Given , we update the permutation matrix and obtain . Then we fix the permutation matrix as and update *θ*. The alternative steps are repeated until the algorithm converges. In this paper, we use the above approach to evaluate the performance of an approximation of the GLRTas follows: Notice that although the optimality of the proposed algorithm is not guaranteed, the algorithm appears to work well when the desired parameter *θ* is accurately estimated. As the number of sensors *N* increases, the probability of perfect permutation matrix recovery decreases [18]. On the other hand, numerical simulations have shown that the accuracy of the estimation of *θ* improves with increasing *N* [20]. As a consequence, it is interesting to evaluate the performance in terms of *N*, which will be addressed by performing numerical experiments in the followingsection.

In [28], it has been proved that in the absence of additive noise **V**, the permutation matrix can be recovered exactly. Taking into consideration that a reasonable initial point of *θ* can be chosen as follows: We then use the initial estimation *θ*_{init} to iteratively update the permutation matrix and unknown *θ*.

In addition, if the channel information is also missing, the test statistic would be a function of observations **y** such as the previous mean detector. Utilizing the second order information yields the following energy detector: The performance of the energy detector will also be evaluated via Monte Carlo (MC) simulations.

#### 4. Numerical Simulation

In this section, various numerical simulations are conducted to evaluate the performances of the proposed detectors; the first three experiments are performed in both the high noise level and low noise level scenarios, separately. For the first simulation, the results are presented in the form of ROC curves, i.e., *P*_{D} versus *P*_{FA}. Next, the relationship between the detection probability *P*_{D} and the observation time *K* is investigated, where the false alarm probability is set to a desired level. Then, the detection probability *P*_{D} versus the number of the sensors *N* is studied, where the false alarm probability is also set to a desired level. Finally, the probability *P*_{Per} of successfully recovering the permutation matrix **Π** versus the noise level SNR_{w} and SNR_{υ} is demonstrated, where , , and *P*_{Per} is defined as follows: where is the estimated permutation matrix, and **Π**_{0} denotes the true one. We set *θ* = 1 unless stated otherwise. The channel coefficients *h*_{i} are drawn from the i.i.d. Gaussian distribution and is averaged such that . The performances of various detectors except the clairvoyant and the labeled GLRT detectors are evaluated via MC simulations, and the trial number of MC simulations is 10,000. For simplicity, the approximations of the detectors are referred without the prefix ”approximation” in this section.

##### 4.1. ROC Curves

###### 4.1.1. High Noise Level

The results are presented in Figure 1with various values of *μ*_{h} and under high noise level scenario. The parameters are set as follows: *N* = 20, *K* = 5, , and . The four subgraphs in Figure 1 refer to , , , and , respectively. From Figure 1(a), it can be seen that the labeled GLRT detector performs similarly to that of the unlabeled detector with the knowledge of *θ*; both detectors detect better than unlabeled detector without the knowledge of *θ*, which demonstrates that the label information play an important role in reliable signal detection when the variance is comparable to the mean of **h**. For unlabeled detectors, it can be shown that the mean detector is very attractive because of its simple implementation and competitive performance. It is also noted that the unlabeled detector with known *θ* has a similar detection performance to that of the mean detector. From Figure 1(b), it can be seen that the performance of the energy detector is comparable to the other unlabeled data-based detectors; all these three detectors perform inferior to that of the labeled detector. Another observation gleaned from Figure 1(a) is that the mean detector is the worst detector for zero mean channel coefficients, which coincides well with the theoretical analysis presented in Section 2. In the case of constant channel coefficients, the clairvoyant detector, the mean detector, and the unlabeled GLRT with known *θ* detector have equal performance, and the performance of the labeled GLRT detector exhibits the same performance as the unlabeled GLRT with unknown *θ* detector, as can be seen from Figure 1(c). What we can conclude from Figure 1(d) is that the unlabeled GLRT detector without the knowledge of *θ* has the similar performance to that of the mean detector, and both detectors perform worse than the unlabeled detector with the known *θ*.