Abstract

We introduce the superimposed training strategy into the multiple-input multiple-output (MIMO) amplify-and-forward (AF) one-way relay network (OWRN) to perform the individual channel estimation at the destination. Through the superposition of a group of additional training vectors at the relay subject to power allocation, the separated estimates of the source-relay and relay-destination channels can be obtained directly at the destination, and the accordance with the two-hop AF strategy can be guaranteed at the same time. The closed-form Bayesian Cramér-Rao lower bound (CRLB) is derived for the estimation of two sets of flat-fading MIMO channel under random channel parameters and further exploited to design the optimal training vectors. A specific suboptimal channel estimation algorithm is applied in the MIMO AF OWRN using the optimal training sequences, and the normalized mean square error performance for the estimation is provided to verify the Bayesian CRLB results.

1. Introduction

In recent years, the use of relays has gained significant interest for the advantage of enhancing link reliability and increasing channel capacity in a wireless network. By transmitting the signal through one or more relays located at geographically separated nodes, the effect of signal fading due to multipath propagation and strong shadowing can be compensated through the exploitation of spatial diversity provided by the network nodes, as well as the efficient use of power resources, which can be achieved by a scheme that simply receives and forwards a given information, yet designed under certain optimality criterion. In particular, the source can cooperate with the relay to forward the replicas of the source signal to the destination thereby providing additional diversity and improved signal quality at the destination. There has been several relaying strategies, of which the most dominant are amplify-and-forward (AF) and decode-and-forward (DF).

Multiple-input multiple-output (MIMO) technology, by providing significant improvements in terms of spectral efficiency and link reliability, has been introduced to achieve high data rates required by the next-generation wireless communication systems. Since MIMO systems are able to support high-data rates by combating fading and interference, coupling relay networks with MIMO techniques is a natural extension to the state of the art. The basic idea is to introduce relays that forward the data to the destination, which is otherwise out of the reach of the source. With two-hop relaying, we can increase the rank and consequently the capacity of ill-conditioned (rank deficient) MIMO channels. Relaying information on two-hops also decreases the need for high power at the transmitter. For MIMO relay channels where every terminal in the wireless network can be deployed with multiple antennas, studies are mainly concentrated on spatial multiplexing (SM) systems. The potentiality of MIMO relay channels in wireless networks has been discussed and several results on capacity bounds have been found in previous works [13], and the optimal relay beamforming design and the corresponding transceivers design have been studied in MIMO scenarios [48].

Before enjoying all the benefits brought by the MIMO relay network, an accurate channel state information (CSI) is required at the destination (for AF) or at both relay and destination (for DF). However, most of the existing works are based on the assumption that all nodes have perfect CSI, which is actually the perfect knowledge of backward (source-relay) and forward (relay-destination) channels. Recently, the estimation algorithms for the cascaded channel of source-relay-destination link have been proposed in [912] for AF-based MIMO relay systems, as data detection at the destination only requires the knowledge of the cascaded channel.

However, only the cascaded channel information is not enough in some other applications, and the individual information of both the backward and forward channels is needed for the destination to perform certain operations. For example, in relay beamforming schemes [13, 14], especially in noncoherent MIMO relay networks, where the optimization process is performed at the destination [7], the individual CSIs are utilized to design the optimal amplifying matrix at the relay and the power allocation between the source and relay so as to maximize the instantaneous capacity of the two-hop MIMO AF relay system. Another application is the subcarrier pairing, which is performed at the relay to allocate power over subcarriers on the two-hops such that the instantaneous rate of the relay link is maximized [15, 16], the destination also needs the individual CSIs in order to know the pairing strategy used at the relay.

Conventionally, the backward and forward channels can be estimated directly at the relay and destination, respectively, and then the relay spends additional time slots to send the backward channel estimation to the destination through the feedback channel. Apart from the additional energy and time slots consumed on the feedback transmission, the feedback information are subject to further distortion such as the quantization errors of the channel estimates and the errors in the communication through the feedback channel. An alternate way to estimate the individual channels at the destination is to apply a three-phase training strategy, that is, after the two phase training as in the estimation of the cascaded channel, the relay sends its own training to the destination so that the individual channels can be estimated [17, 18]. However, a critical drawback of this strategy is its incompatibility with the two-phase data transmission. Moreover, as demonstrated in [14, 19, 20], the performance of a two-hop MIMO AF relay system can be improved if the individual CSIs are exploited.

Therefore, in a practical scenario, it is important to design a MIMO relay scheme that provides both in accordance with the two-hop data transmission and the estimates of individual channels at the destination. In [21], a least square (LS) method is developed to estimate the individual channels from the singular value decomposition (SVD) of the cascaded channel.

Inspired by the superimposed training in point-to-point communications, [22, 23] designed a superimposed training strategy in AF OWRN, where all the nodes are equipped with single antenna. We extend the research to multiantenna scenario and introduce the superimposed training strategy into MIMO AF OWRN in this work. The individual channel estimation is accomplished at the destination, and the closed-form Bayesian Cramér-Rao lower bound (CRLB) is derived for the estimation of flat-fading MIMO channels, which is further used to guide the optimal training design. Specifically, the relay superimposes its own training signal over the received signal before forwarding it out, which provides the separated channel information, and the total relay power is allocated between the two parts reasonably. The simulation is provided to verify the Bayesian CRLB results by the normalized mean square error (NMSE) performance of a specific suboptimal channel estimation algorithm. The extracted individual channel information can be used to help predict the actions of the relay for data transmission, such as power allocation and carrier permutation, without the need of feedback from the relay.

The rest of the paper is organized as follows. The system model of superimposed training in the MIMO AF OWRN is given in Section 2. In Section 3, the Bayesian CRLB of superimposed training-based channel estimation is derived and used to design the optimal training sequences. A suboptimal channel estimation algorithm is described in Section 4. Finally, the simulation results are provided in Section 5 and conclusions are drawn in Section 6.

Notations. Vectors and matrices are boldface small and capital letters. The Hermitian, inverse, and pseudoinverse of the matrix are denoted by , , and , respectively. and are the trace and the Frobenius norm of . is the entry of , and denotes a diagonal matrix with the diagonal element constructed from . is the identity matrix, and is the basis vector with the element 1 and 0 otherwise. denotes the statistical expectation, and denotes the Kronecker product.

2. System Model

Consider a nonregenerative MIMO relay system as in Figure 1, there is one source with antennas, one destination with antennas, and one relay with receive antennas and transmit antennas. It is assumed that there is no spatial correlation at the relay, and the direct link between the source and destination is ignored due to the larger distance and additional pathloss compared to the relay link. When SM is applied at the source, it requires and such that the compound channel can support independent substreams. For expositional purpose, we assume without loss of generality in the following. However, our results below can be extended easily to the case . Through this paper, we assume perfect synchronization among all terminals, and the total transmit power for the source and the relay is the same.

We restrict our discussion to the case of a slow, frequency-flat block fading model. The data transmission takes place in two time slots using two-hops. During the first time slot, the source transmits the signal to the relay and yields the received signal as where is the received signal vector at the relay, is the transmit signal vector with covariance matrix , and denotes the transmit power of the source. denotes the backward channel matrix and its entries are assumed to be zero-mean circular symmetric complex Gaussian (ZMCSCG) random variables of unit variance. is the complex circular additive white Gaussian noise (AWGN) vector at the relay with zero mean and covariance matrix .

The relay processes the received signal and superimposes a new training vector over it. The forwarded signal at the relay can be written as where the scalar controls the power allocation between the received signal and the superimposed training, is a suboptimal but convenient forwarding scaling matrix with fixed as 1, and corresponds to the transmit power of the relay. The following constraint should be satisfied for the training block: For the power allocation scheme at the relay, define as the average power assigned for the superimposed training at the relay, and then we have

The received signal at the destination is represented as where denotes the forward channel matrix, and its entries are assumed to be ZMCSCG random variables of unit variance. is the complex circular AWGN vector at the destination with zero mean and covariance matrix . is defined as the cascaded channel from the source to the destination, and is the overall noise at the destination.

For channel estimation purposes, we assume that two sets of known training sequences and of length is transmitted, where . The optimal training sequences are orthogonal across all transmitting antennas [23], that is,

Then the data model during one training period can be expressed as where is the compound noise matrix during the training block. The task of channel estimation in the MIMO OWRN is to find and from at the destination.

3. Bayesian Cramér-Rao Bound and the Training Design

For many practical estimation problems, popular estimators such as the maximum likelihood (ML) estimator or the maximum a posteriori (MAP) estimator are infeasible, so one has to resort to suboptimal estimators, which are typically evaluated by determining mean square error (MSE) through simulations and by comparing this error to theoretical performance bounds. In particular, the family of Cramér-Rao bound (CRB) has been shown to give tight estimation lower bounds in a number of practical scenarios.

3.1. Bayesian Cramér-Rao Bound

The CRB for the estimation of deterministic parameters is given by the inverse of the FIM, and Van Trees derived an analogous bound to the CRB for random variables, referred to as “Bayesian CRB” (BCRB). Unlike the standard and modified CRBs, the statistical dependence is naturally considered within the BCRB framework. With the assist of BCRB, the performance of the suboptimal estimators in the MIMO OWRN can be assessed, and the optimal training design can be obtained. Given the parameter vector , the Gaussian random vector to be estimated, where and are the vector forms of and , and the Bayesian FIM is defined as where the expectation is taken over the joint probability density function (pdf) .

Lemma 1. The Bayesian FIM is computed as where and are the covariance matrices of and , respectively, and

Proof. See Appendix A.

Before proceeding, we give the Bayesian FIM results for two special cases: and . Fix the parameters , , , and corresponds to the case when all the relay power is allocated to the superimposed training, then we have

The results for are in line with our expectation, since the superimposed training occupies all the relay power, which means the original training from the source is discarded by the relay, the direct outcome is that the training power only works on the estimation of .

As for , all the relay power is used to amplify and forward the received signal from the source, and , then we have where

The results for are also expectable since the training power is concentrated in the original training sequences from the source, which works on the estimation of both and at the same time.

Since and , the BCRBs for and can be separately expressed as and the channel error covariances are lower bounded by where means that is positive semidefinite. Furthermore, the channel estimation MSEs are lower bounded by

3.2. Training Design from Bayesian Cramér-Rao Lower Bound

For the purpose of minimizing the MSEs in the channel estimation, the two optimizations on the training sequences in the MIMO AF OWRN are formulated as

Lemma 2. The training sequences satisfying are optimal solutions to and .

Proof. See Appendix B.

The BCRBs become diagonal matrices with the optimal training in and , and the bounds for MSEs of individual channel taps can be obtained as

4. Suboptimal Channel Estimation Algorithm

Using the optimal training vectors, we refer to the suboptimal estimators to verify the Bayesian Cramér-Rao Lower Bound (CRLB) results in the MIMO AF OWRN, since the channel statistics are assumed known. A linear minimum mean square error (LMMSE) estimator is designed to estimate the two sets of MIMO channels. The corresponding covariances and are given as below, and the calculations are listed in Appendix C,

Recalling , and and are mutually independent, we have

The LMMSE estimate of is expressed as and the corresponding estimate error can be computed as

Similarly, the LMMSE estimate of is expressed as and the corresponding estimate error can be computed as

With and , the estimate of can be computed as

Other estimation algorithms can always be applied to achieve better MSE performance with additional computational complexity accompanied, and here we give an example of modified LMMSE algorithm. Since can be directly obtained from , we can further exploit for the estimation of , that is, remove the impact of over from according to the interference cancellation principle, and take into account the fact that the overall noise component at the destination contains certain information about .

The modified estimation process is as follows, after obtaining the LMMSE estimate , the received signal can be rewritten as where , then the LMMSE estimate of can be computed as where .

5. Simulations

In this section, we provide numerical results to verify our studies. For the sake of simplicity and without loss of generality, we set , , , and . The optimal training sequences designed from the Bayesian CRLBs are used, and the total training power is the same in all the scenarios. The NMSE is defined as the channel estimation MSE divided by the product of the number of transmit antennas and receive antennas, for example, .

The theoretical Bayesian CRLBs are then calculated for and as the function of and SNR, as in Figures 2 and 3, respectively. It is observed that the Bayesian CRLB of is always smaller than that of for any given , which is expected because the superimposed training is devoted to the estimation of while the training sequences from the source contribute the same power to the estimation of both and . Moreover, as increases, the power allocated for the original training from the source becomes greater, which means that the training power for the estimation of becomes larger, thus the NMSE performance is improved and the Bayesian CRLB of gets lower accordingly. Meanwhile, the training for the estimation of comes from both the original training from the source and the superimposed training from the relay, so the training power is always balanced automatically, and thus the Bayesian CRLB of maintains at a stable level regardless of the values of .

We then examine the NMSE performance of the suboptimal estimator, as shown together with the theoretical Bayesian CRLBs in Figures 4, 5, and 6 for different values of . Since is obtained by removing the effect of , where the error in the estimated should be taken into account, the NMSE performance of channel estimation for is always better than that for , which is consistent with the comparison between the two Bayesian CRLBs. Moreover, as increases, the power allocated for the superimposed training from the relay becomes smaller; thus, the direct estimate of is deteriorated accordingly, which can be seen from the increasing gap between the channel estimation NMSEs and the Bayesian CRLBs of . As for the estimation of , although the power allocated for the original training from the source becomes greater as increases, the error effect from the estimation of also becomes greater, and thus the gap between the channel estimation NMSEs and the Bayesian CRLBs of remains basically unchanged for different values of .

We also provide the simulation results considering different numbers of antennas at the relay and destination, for example, same , different and in Figure 7 with fixed as 0.5. It is seen that when the same transmit power is applied for the source and relay, the antenna number affects the channel estimation in MIMO AF OWRN a lot. The channel estimation NMSE decreases when , the number of receive antennas at the destination, increases, especially the NMSE of , which is a consistent phenomenon as in the traditional MIMO channel estimation. On the other hand, the channel estimation NMSE increases when , the number of transmit antennas at the relay, increases. Note that, in the traditional multiple-input single-output systems, the channel estimation MSE is only related with the transmit power but is not related with the number of antennas. However, the reason for the degrading performance in MIMO AF relay networks is because the relay forwards additional noise to the destination too, while the total training power remains the same.

The NMSE performance of the modified LMMSE algorithm is depicted in Figure 8 with fixed as 0.5. As we can see, the estimated could directly benefit from the interference cancellation and the information of contained in the noise covariance, especially in low SNR region. The gap between the estimation NMSEs and their corresponding CRLBs is caused by the utilization of low-complexity LMMSE estimator instead of the optimal MMSE or MAP estimator.

The simulation results verify our proposal that with a fraction of relay power devoted to the superimposed training in the MIMO AF OWRN, the individual channel information of and can be directly obtained at the destination, and the channel estimation of and can be simplified while the corresponding NMSE performance maintains the same level as the conventional channel estimation in the case of the same power. Furthermore, we can expect that, the estimates in our proposal can be further improved, especially the estimate of , if the relay power in the MIMO AF OWRN can be increased, since more power can be allocated to the training sequences at the relay.

6. Conclusion

In this paper, we have investigated the superimposed training strategy in MIMO AF TWRN, which superimposes a new set of training vectors at the relay and provides the individual channel information at the destination to accomplish and simplify the channel estimation. The closed-form Bayesian CRLB has been derived and then used to guide the optimal training design in MIMO AF OWRN. The simulation results have been provided to verify the Bayesian CRLB results by the practical NMSE performance of a suboptimal channel estimator. With the individual channel information, the joint optimization of the throughout, as well as the tradeoff between rate and detection, can be achieved at the destination.

Appendices

A. Proof of Lemma 1

The FIM can be modified as where the first and the second expectations are over and , respectively.

Since and are mutually independent, we have , and the second term in (A.1) becomes

Denote , and . Conditioned on , the mean value of can be defined as or equivalently

Then the received signal at the destination after vectorizing can be rewritten as or equivalently

For computational simplicity, we further define which is the FIM for the deterministic .

The log-likelihood function of the received signal at the destination can be written as The element of the deterministic FIM is computed as Conditioned on a specific channel realization, the noise term in is AWGN with variance , where

It can be obtained that where . Define a new matrix whose entry is the second term on the RHS of , Let , where is the length of the vector , and then could be expressed in a compact form Note that the columns of are orthogonal to each other.

can be obtained as

Note that . Following a similar approach, we obtain The Bayesian FIM can be found from averaging over .

It is assumed that there is no spatial correlation at the relay. According to the widely used Kronecker correlation model [24, 25], let and be the spatial correlation matrices (positive semidefinite and Hermitian) at the source and destination, then and can be expressed as and , respectively, where and are and independent and identically distributed (i.i.d.) ZMCSCG matrices. Note that and for normalization and that and are independent mutually which means that the backward link is independent from the forward link . Moreover, every Hermitian matrix is a normal matrix, and the finite-dimensional spectral theorem applies. It says that any Hermitian matrix can be diagonalized by a unitary matrix, and that the resulting diagonal matrix has only real entries. This implies that all eigenvalues of a Hermitian matrix are real, and that has linearly independent eigenvectors. The EVD of is denoted as , where , and is a unitary matrix.

From the above properties, it can then be calculated that and is an matrix. and is an matrix.

B. Proof of Lemma 2

We first derive the optimal solutions for . First observe that is a constant diagonal matrix, and the diagonal elements of are constant . We know that for any positive-definite matrix , there is where the equality holds when is diagonal. Applying this inequality, we obtain where the RHS is not related with and can immediately serve as the lower bound of . The lower bound is achieved when . Obviously, the choice satisfies the requirement.

Similarly we can obtain the optimal solution for . As for , is a constant diagonal matrix, and the diagonal elements of are cosnstant ( . Applying the inequality, we obtain where the RHS is not related with or and can immediately serve as the lower bound of . The lower bound is achieved when . Obviously, the choice satisfies the requirement.

Combing both proofs yields the conditions of and .

C. Calculation of Correlations

is computed as where is given as Thus, is computed as

Acknowledgment

This research was supported by the National Science Foundation of China (Grant no. NFSC 61071083).