ICTEAM Institute, Université Catholique de Louvain, 1348 Louvain-la-Neuve, Belgium
We address the problem of power allocation in a wireless sensor network where distributed sensors amplify and forward their partial and noisy observations of a Gaussian random source to a remote fusion center (FC). The FC reconstructs the source based on linear minimum mean-squared error (LMMSE) estimation rule. Motivated by the availability of limited energy in the sensor networks, we undertake the design of power allocation based on minimization
of the reconstruction distortion subject to a constraint on the network transmit power. The design is based on the following two cases: (i) exact knowledge of the channel gains and (ii) the estimates of the channel gains. We show that the distortion can be represented as a convex function of the transmit powers of the sensors. Moreover, we show that the power allocation based on
this distortion function does not bear any closed form solution. To this end, we propose a novel design based on the successive
approximation of the LMMSE distortion, which turns out to be simple, computationally efficient, and exhibits excellent convergence
properties. The simulation examples illustrate that the proposed design holds considerable performance gain compared to a uniform power allocation scheme.
1. Introduction
Wireless sensor networking is an emerging technology which finds application in many fields including environment and habitat monitoring, health care, automation, military applications such as battlefield monitoring and surveillance, and underwater wireless sensor networks (UWSNs) for marine environment monitoring [1, 2]. A wireless sensor network (WSN) consists of spatially distributed sensors that cooperatively monitor physical or environmental conditions, for example, temperature, vibration, pressure, motion, or pollutants.
We consider a system in star topology where sensors amplify and transmit their noisy observations of a common source, via some orthogonal multiple access scheme such as frequency division multiple-access (FDMA), to a central processing unit called fusion center (FC) which reconstructs the source in a way that the overall distortion (e.g., mean squared error) be minimized. Conceptually, the system is similar to the CEO problem [3, 4].
The sensors in the network have partial and spatially correlated observations of the underlying source. The correlation exists where sensors measure data in same geographical location, for example, acoustic sensors that are sensing a common event produce measurements that are correlated. In addition, observation noise and communication channel may not have same conditions across the sensors. Therefore, transmission of the observations based on uniform power allocation is not an optimal strategy.
In this paper, we study the problem of adaptive power allocation given a network power constraint with the objective to minimize the reconstruction MSE. The optimal power allocations are jointly determined at the FC which are then conveyed to the individual sensors via feedback channels. The communication channels from the sensors to the FC experience independent flat fading. The channel from the sensor to the FC is usually estimated using some training sequence. The receiver noise and the limited available power means that the channel estimation always incurs some estimation error. Consequently, the design of power allocation scheme should also take into account the channel estimation errors [5, 6]. In this paper, first we design the power allocation scheme based on perfect knowledge of the channel state information (CSI) and subsequently, in the design, we incorporate the effect of imperfect CSI.
In a sensor network measuring a memoryless Gaussian source uncoded transmission, that is, amplify and forward (AF), outperforms the separate coding and transmission over the multiple-access channel [7–9]. Motivated by this result, Vuran et al. in [10] considered the estimation of a random source with distributed sensors and suggested a sensor selection procedure which exploits the spatial correlation to minimize the estimation error (based on the LMMSE estimation criterion). The sensor selection procedure suggests that the sensors with high correlation with the source and low cross-correlations should be selected. The procedure does not take into account the fact that even if a sensor has high correlation with the source and low cross-correlations with the other sensors, it can still be a bad selection in terms of energy efficiency if its observation noise is high and/or the communication channel to the FC is bad. A recent related work appears in [11]. Bahceci and Khandani in [12] proposed a power allocation scheme where each sensor observes a separate source albeit correlated. Reference [13] presented a power scheduling scheme for sensor networks to detect a source based on the binary hypothesis testing rule which exploits the correlation in the observation noises at the sensors. Other works like [14–18] proposed power allocation schemes for parameter estimation in wireless sensor networks without considering the spatial correlation. In this paper, we present a novel framework which incorporates adaptive power allocation (APA) in the network by taking into account the spatial correlation and cross-correlations of the observations, observation quality, and communication channel to the FC. The power allocation design also takes into account the channel estimation errors.
We assume that the FC reconstructs the underlying source using linear minimum mean squared error (LMMSE) estimation rule. The power allocation design is based on minimization of the reconstruction distortion subject to a constraint on the total transmit power of the sensors. Due to the spatial correlation among the sensor observations, the design of the power allocation scheme based on the given optimization problem presents a unique challenge because the LMMSE estimation/reconstruction error of the underlying source contains nonlinearly coupled optimization variables. Herein, first we prove that the estimation distortion can be represented as a convex function of the sensor transmit powers, then we show that the power allocation design based on this distortion function turns out to be complicated and does not bear a closed solution. Subsequently, we propose a novel design based on the successive approximation of the LMMSE estimation distortion. The resulting power allocation algorithm is simple, computationally efficient, and exhibits excellent convergence properties. The proposed designs hold considerable performance gain compared to a uniform power allocation scheme. To the best of our knowledge, in the present literature, there is no such work on the design of power allocation for the sensor network under consideration which jointly exploits spatial correlation, observation noises, channel gains, and their estimation errors.
The rest of the paper is organized as follows. Section 2 describes the system set-up. The power allocation problems and their solutions are presented in Sections 3 and 4, respectively for perfect and imperfect knowledge of the CSI. Section 5 evaluates performance of the power allocation designs. Section 6 concludes the work.
2. System Model
Consider the system model shown in Figure 1 in which spatially distributed sensors observe an unknown zero-mean real Gaussian random source , and communicate with the fusion center (FC) via orthogonal multiple-access channels. Each sensor has a partial and noisy observation of the source, and sends an amplified version of it to the FC. The FC collects the signals from all sensors and reconstructs the source according to a given fidelity criterion, for example, minimum mean-squared estimation error. The and respectively, denote the partial observation of the source and the noise corrupting this observation such that the noisy observation at sensor is
The estimation of the source is done on a sample by sample basis, and its procedure is same for all samples. Therefore, for clarity, in the subsequent formulation we drop the time index. We assume that the sensors amplify and forward their observations to the FC via orthogonal channels where each channel experiences flat fading independent over time and across sensors.
Figure 1: Block diagram of the system.
The optimality of the AF scheme is established for the Gaussian network with nonorthogonal multiple-access channel from the sensors to the FC [7]. However, for the network with orthogonal multiple-access channel it has been shown in [19, 20] that the separate source channel coding outperforms the AF scheme. The optimality of the coded source-channel communication in general requires coding over long block lengths and will require some data processing at the sensors. This will increase the power consumption at the sensors and will lead to longer processing delays which may not be tolerable in many applications. Therefore, due to simplicity, low latency, and ease of implementation, in this paper we adopt the AF transmission strategy.
The received signal at FC from sensor is
where is a amplifying factor and is a circularly-symmetric Gaussian receiver noise. The fading channels between the sensors and the FC are with gain factors which are Rayleigh distributed. Noting that , we can write (2) as
where the exponential term can be merged into the variable without changing its statistical properties—due to the circular-symmetry property of [21]. Since the underlying source and the noisy observation are real-valued, therefore, we only need to consider the component of the noise which is in-phase with the observation , that is,
where and .
For the analysis in this work, we assume that the observation noise , (similarly the receiver noise , ) is independent across the sensors and is also independent of , (). Moreover, we assume that the source , the observation at sensor , the observation at sensor , the observation noise at the sensor, and the receiver noise at the FC are jointly Gaussian across sensors ( and ) with zero mean and covariance () specified by
We also assume that the samples of , , and are individually independent in time.
In (5), the correlation coefficient represents the correlation between and and the coefficient denotes the correlation between and . The values of these correlation coefficients depend on the distance of the sensors w.r.t. the position of the source and w.r.t. each other, respectively, and can be characterized as follows:
which is a power exponential model for correlation [10, 22]. In (6), is the distance between the source and sensor , and is the distance between the sensors and . The parameter controls how fast the correlation decays with distance and is called range parameter. The other parameter is called a smoothness or roughness parameter which is . Equation (6) shows that the correlation decays with distance with limiting values of 1 and 0 as and respectively. Therefore, the correlation changes with the change in the elative positions of the source and the sensors. The change may happen due to movement of either the sensors or the source or both, for example, an animal may kick a sensor node to a different location. We assume that the relative positions of the sensors with respect to each other and the underlying source are perfectly known. Moreover, we assume that the positions remain unchanged for at least one estimation cycle.
Based on the correlation model, therefore, we can say that the FC in essence is interested to reconstruct the source which is located at a specific location by collecting observations from spatially distributed sensors where correlation of the observations with the source and among the sensors, respectively, depends on the spatial location of the sensors w.r.t. the source and w.r.t. each other. Note that the location of the source and the sensors can be in two or three dimensional space.
The equation (4) can be written equivalently in matrix-vector notation as follows:
where denotes the matrix-vector transpose operation. At the FC, the optimal estimator in minimum mean-squared error (MMSE) sense is the conditional mean of given the observation , that is, , where denotes the mathematical expectation. Under the jointly Gaussian assumption of the and , the conditional mean estimator turns out to be linear and is called linear minimum mean-squared error estimator (LMMSEE). Therefore, we seek the estimate of the source like which can be written as [23]
where is a row-vector of LMMSEE weighting coefficients. The resultant distortion of the estimate in comparison to the original signal is measured by the mean-squared error and is given by [23]
where , , , and . The estimation distortion can also be written as follows:
which is obtained by using the Woodbury identity for matrix inversion [24] (see Appendix A). Let , where . Now we can write (10) as
Theorem 1. The estimation distortion function in (11) is convex over for .
Let be a vector of the transmit power of the sensors. The proof of the theorem consists in showing that the Hessian of the distortion function in (11) with respect to is positive semidefinite. To this end, a detailed proof is given in Appendix B.
Remark 1. The reconstruction distortion is upper bounded by the variance of the source , and lower bounded by the variances of the observation noises and spatial correlation and cross-correlation values as given by
The lower-bound distortion is achieved when the observations of the sensors are received at the FC via ideal communication channels which can be verified by setting , in (11). Note that it is not possible to achieve distortion less than . Moreover, when observation noise variances are then the lower bound distortion reduces to . The achieved distortion is equal to the upper bound value (i.e., ) when either no signal is received at the FC from the sensors or the observations are uncorrelated with the source or both, which can be verified from (11).
3. Power Allocation with Perfect CSI
In this section, we assume that the channel state information (CSI), that is, the channel gains are perfectly known at the FC. The case of imperfect CSI is considered in the next section.
3.1. Minimization of the Distortion Subject to the Power Constraint
We base our adaptive power allocation design on the following optimization problem.
Prob
Minimize the distortion subject to
where denotes the total power in the transmitted signal of sensor . The sum power constraint in (13) enables a fair comparison between the networks of different sizes. Moreover, for a sensor network which forms part of a bigger network where each subnetwork performs different sensing task but share the same frequency band to transmit observations, to limit the interference between the subnetworks, the total power emitted from each subnetwork is upper bounded. Furthermore, recent studies have shown that the ICT (Information and Communication Technology) power consumption is a significant contributor to the global warming [25]. Therefore, in the context of sensor networks, putting cap on the total power consumption conserves energy and limits the contribution to the global warming.
Since the optimization problem in (13) is convex (the objective is convex and the constraints are linear), therefore we can use the Lagrangian method of multipliers to find the optimal 's [26]. The Lagrangian cost function is
where and are dual variables or Lagrange multipliers. The associated Karush-Kuhn-Tucker (KKT) conditions are
where is a diagonal matrix with unity at place and all other elements are equal to zero.
The expression in (15) is a complicated function of the optimization variables. Therefore, a closed form solution for this problem is not tractable. However, we can resort to numerical methods (e.g., bisectional search over 's in (15) and gradient method to update ) to find the optimal in an iterative manner as outlined under Algorithm 1. From (17), note that the active sensors have corresponding Lagrangian multipliers . The sensors with are removed from the system. The parameter in * denotes the step-size. Since the objective function of the optimization problem is convex and bounded and the constraints are linear, therefore the algorithm can achieve convergence to the absolute minimum (the KKT point) of the problem provided the step-size is selected properly [27]. Unfortunately, Algorithm 1 will be computationally quite expensive (unless the network size is small) due to (i)a number of matrix inversions involved in (15) while numerically searching for in each iteration; (ii)the dependence of the convergence properties on the step-size [27, 28].
In the sequel, based on a successive approximation (SA) principle, we present a novel quasianalytical solution of the optimization problem which is simple and the associated algorithm is computationally efficient compared to Algorithm 1, exhibits remarkable convergence properties, and achieves distortion very close to the global optimum of the Algorithm 1 with no appreciable performance gap. This so-called SA-based design can be viewed as the joint optimization of the transmit powers and the modified LMMSE coefficients as we will see in the subsequent development.
According to the idea of successive approximation, a modified function is constructed from the given function in some special way [29–31]. Then that modified function is solved iteratively/successively to find the solution for the underlying problem. The solution obtained by the SA approach can be viewed as quasianalytical solution. We apply the idea of successive approximation to the reconstruction distortion function and solve the problem of power allocation in the sensor network. To this end, at the FC, to form the estimate of the source and to characterize the resultant mean-squared distortion , we proceed as follows. We can write the distortion as
and by solving we get the following expression for the LMMSE weighting coefficients:
The variables and are, respectively, defined as follows:
where . With (20) and (21), the distortion in (18) simplifies to
Equation (21) forms a set of coupled equations which constitute the Wiener-Hopf equation for the LMMSE filter coefficients (, ). If we know the transmit powers then for given covariance and the channel gains , we can find the coefficients by solving (21). For the solution, it is convenient to employ the matrix-vector form as follows:
where , , for and for . Note that denotes the element of the matrix . However, the point here is that we do not know the transmit powers . The following subsection presents an alternative solution to the problem of optimizing the transmit powers of the sensors under the network-wide power constraint such that the distortion be minimized. Therein, to derive an algorithm for the solution of , the underlying idea is to assume as constant. Based on this assumption, we derive an iterative algorithm which computes using the values of from the previous iteration. This successive approximation (SA) of the distortion function in (22) makes the solution of the power allocation problem simple and easy to compute as will be seen in the ensuing development. Note that the resulting design for power allocation can be viewed as a joint optimization of and .
3.2. Minimization of the Distortion Subject to the Power Constraint-SA
Herein, we solve the optimization problem in (13) based on the distortion function in (22) and using the successive-approximation principle outlined in the preceding subsection. For given , it is easy to verify that the distortion function is convex with respect to the optimization variables . Therefore, the KKT conditions are sufficient for optimality [26] which are given as follows:
Solving (24) for active sensor (i.e., , from (26)), we get
for , where denotes the channel SNR for sensor and . Based on (27), following observations are in order. (1)There exists a cut-off value such that for , the power allocation policy follows waterfilling on channel SNR, that is, increases with increasing ; and for , the power allocation is according to inversion in the channel SNR, that is, increasing decreases . (2)The sensors with higher observation noise variances are given less power. For sensor , in the limiting case (i.e., ) then . (3)The sensors with weak correlation with the source are allotted less power. For instance, if then .
Combining the aforementioned points we can see that the final power allocation policy for the sensors depends on the spatial correlations, variance of the observation noises, and the channel SNRs. Moreover, we see that depending on the values of these system parameters some of the sensors may be switched-off altogether. For sensor to be active, following condition must hold:
which stems from the fact that if sensor is active. Let denotes the set of active sensors defined as follows:
Since the problem is convex, the minimum of the objective function occurs at the sum power constraint boundary, that is, the constraint is active. Therefore, the transmit powers must satisfy the power constraint with equality, that is, , which gives
Based on the solution from (27) through (30), Algorithm 2 can be proposed which iteratively optimizes the transmit powers and the variables , while minimizing the reconstruction distortion subject to the power constraint. If during iterations any sensor does not fulfill the condition in (28), it is switched off and the algorithm continues with the remaining sensors until the convergence criterion is fulfilled. Regarding the convergence properties of the algorithm, consider the following. (i)Since in each iteration (successive approximation) we are minimizing a convex function over the convex-set of the transmit powers , therefore the optimality of the transmit powers in each approximation (for given ) combined with the optimality of for given as per (23) means that the algorithm achieves monotonic decrease in the distortion, that is,
and consequently it does converge to a unique minimum point.(ii)The algorithm consistently arrives at the same combination of the transmit power tuple and achieves the same minimum distortion for a wide range of different initialization points. In other words, we can say that the algorithm exhibits start point independence (for a wide range of initialization points). Moreover, the algorithm asymptotically achieves the lower-bound distortion with increasing transmit power . In Section 5, we illustrate the monotonic decrease, start point insensitivity, and the asymptotic convergence to with several simulation examples. There we also show that the convergence may be achieved in as few as two or three iterations.(iii)We have shown that the original problem is convex and therefore the objective function has a global minima under the power constraint which can be achieved by Algorithm 1. Now the question is how closely does the successive approximation-based algorithm converge to the global minimum value? The simulation examples in Section 5 show that the distortion achieved by both algorithms are extremely close and the performance gap between the full-optimization and the successive approximation based algorithms is virtually negligible.
It is a quite remarkable that the algorithm exhibits such excellent convergence properties which illustrates that the proposed successive approximation strategy works quite well. Finally, compared to the power allocation Algorithm 1, the ease of computation and simplicity of the design based on the successive approximation principle can be appreciated from the simple and elegant structure of (27)–(30).
4. Power Allocation with Imperfect CSI
Heretofore, we have assumed perfect knowledge of the channel gains . However, in practice, we have estimates of the actual channel gains. One way to estimate the channel is by a training sequence whereby each sensor transmits a known sequence of data symbols called pilots. Then based on the received data, the FC estimates the channel. Let denote the pilot symbol transmitted by sensor in the channel estimation phase. The corresponding received signal is and based on which the LMMSE estimate of is
where denotes the complex conjugate operation. The variance of the estimation error is
wherein is power of the transmitted pilot. Note that the variance of channel estimation error is finite for finite and . The actual channel can be represented as a sum of the estimate and the estimation error, that is,
where . Such an approach to model the channel estimation error can be viewed as the Bayesian approach [6].
One way to design the power-scheduling scheme is by replacing and respectively, by and in the formulations of the foregoing section. This constitutes a naive-approach because it ignores the error in the channel estimate. An alternative design originates by substituting (34) in (2) as follows:
in which can be viewed as total receiver noise corresponding to sensor with , , and , . Noting that , we can write
where the exponential term can be absorbed into , that is, into the Gaussian variables and without changing their statistical properties—thanks to their circular symmetry. Since the underlying source and the observation are real-valued, as a consequence only the part of the noise in-phase with the sensor observation is relevant for estimation of the source . Therefore, we can write
where , and . Following a procedure similar to Section 3, it can be shown that the mean-squared reconstruction distortion of the estimate with respect to is given by
where
The solution of the optimization problem in (13) with the objective to minimize the distortion defined in (38) subject to the constraint on the total network power can be obtained by using the method of Lagrangian multipliers and is outlined as follows.
(1)The power allotted to sensor is for , where defines the channel SNR based on the channel estimate.(2)For sensor to be active, that is, , the following condition:
must hold, otherwise it is switched-off.(3)The index-set of the active sensors is