Abstract

This paper proposes a novel multiuser (MU) multiplexing scheme for temporally correlated multiple-input, multiple-output (MIMO) channels, suitable for systems employing low-rate feedback links. A decentralized solution is obtained by the mobile receivers, which employ interference rejection combiner (IRC) linear filters and command the update of the corresponding per-user antenna transmit weights through compact feedback messages, thus avoiding explicit transmission of channel information. The proposed limited-feedback algorithm outperforms existing MU-MIMO solutions employing quantized matrices, operating at the same feedback overhead. A compensation mechanism is presented, which enables the proposed solution to operate under moderate probabilities of feedback errors, at the expense of a small downlink overhead.

1. Introduction

Limited feedback techniques are required in MIMO systems employing frequency division duplex (FDD), because the transmitter cannot directly acquire knowledge of the forward wireless channels of the mobile receivers. These techniques are used to enable different resource management tasks, namely, user scheduling, adaptive modulation, transmit precoding for single user, and transmit precoding for multiuser multiplexing. The work presented in this paper relates to MU-MIMO with linear precoding that is based on limited feedback information.

MU-MIMO systems employing linear precoding have been studied under the assumption of full channel side information (CSI), see, for example, [1, 2]. These solutions can be implemented in time division duplex (TDD) systems, where the CSI can be obtained directly from uplink measurements. In FDD systems, however, the feedback links impose a limitation on the amount of CSI that can be obtained by the transmitter, thus motivating the study of limited feedback MU-MIMO techniques. Throughout this paper, we employ the term centralized MU-MIMO solutions to refer to schemes where the multiplexing solution is computed by the transmitter. Although centralized solutions are commonly derived under the assumption of full-CSI, their practical FDD implementations may still employ quantized channel matrices provided by the feedback links, see, for example, [3, 4].

This work, in contrast, presents a novel decentralized solution based on the classical interference rejection combiner (IRC) receiver [5] and the limited-feedback eigenbeamforming algorithm [6]. In the proposed scheme, the receivers optimize their respective access point-transmit antenna weights iteratively, to maximize the received signal-to-noise ratio in the presence of intracell interference. The feedback messages thus carry weight update commands, as opposed to the channel matrices themselves, and the role of the transmitter is to apply the changes to the corresponding antenna weights. The weight adaptation feedback features a recursion based on unitary coordinate plane transformations, also known as complex-valued Givens rotors. A minimum of one rotor per update is employed, each rotor requiring the feedback of two quantized angles. If allowed by the feedback bit budget, more rotors per update increase the tracking capabilities of the algorithm. The proposed scheme is tailored for the case of single stream transmission per user, where each receiver must have more than one antenna in order to be able to use an IRC structure. Furthermore, the performance of the proposed decentralized solution under weight recursions other than [6] is not treated here.

We compare the performance of the proposed algorithm to that of two variants of the (centralized) regularized block diagonalization (RBD) [1], when quantized channel matrices are provided by dominant eigenvector Linde-Buzo-Gray (DE-LBG) codebooks [4], which have been specifically designed for use with the RBD. Additionally, two recursive channel feedback techniques are also benchmarked. These schemes are based on the limited-feedback eigenbeamforming algorithms D-JAC [6] and ALE-PUB-S [7]. Since the simulation study considers only spatially uncorrelated channels, multiplexing solutions based on channel distribution information (CDI) such as [8] have not been considered.

2. System Model

We consider an MU-MIMO system using FDD, where users with antennas each move slowly with respect to a fixed transmitter with antennas. The transmission is organized into slots of symbol periods, each representing the transmission of a different symbol vector , where every user is allocated one data stream. The users are assumed to experience slowly fading frequency-flat channels, represented by matrices , . The transmitter employs a linear precoding matrix , with the per-user beamforming vectors stacked as .

At the end of the slot, each user can transmit an bits feedback message , upon which the transmit weights are updated. We assume that the feedback channel is delay- and error-free, therefore, the information sent at the end of the slot produces updated weights that are used from the beginning of the next slot. The frequency of the feedback message is related to the slot length and the symbol vector frequency by , .

Let be the data symbol for user at symbol period . The received signal for user reads where is additive circular Gaussian noise such that , and denotes Hermitian transpose. In (1), denotes the slot index and the update index of .

Let be the covariance of the total disturbance suffered by user , conditioned on the channel matrices , the weight vectors and the white noise power . This can be written as where the same power is allocated to each user.

The receivers employ linear IRC combiners based on the receive-diversity combiner [5]. The input to the detector is given by which is implemented at the receiver through a short-term structured estimate of defined in (2). It is assumed that the receiver acquires perfect knowledge of from common pilot signals.

3. Decentralized Closed-Loop MU-MIMO

This section presents a novel approach to multiuser multiplexing in closed-loop MIMO systems with temporally correlated channels. The mobile users employ the MIMO version of the classical IRC filter [5], as defined in (3). This allows a spatial whitening of the intracell interference produced by the other users. The weight adaptation is based on a decentralized alternating optimization (DAO) procedure. The per-user metric is the received signal-to-interference-plus-noise ratio (SINR).

It can be shown that given and , the combiner (3) produces an SINR Equations (2) and (4) determine how the transmit weight vectors affect the user SINRs, which in turn determine their link-level performance. The DAO approach to maximize the users SINRs would execute the steps.(1)Given estimates of , each user optimizes (4) by computing and feeding back the dominant eigenvector of . (2)The transmitter uses the transmit weights specified by each user, without attempting to jointly optimize the transmit weights. If the weights change, the covariance matrices change as well, according to (2). (3)Go to Step .

The DAO procedure resembles the alternating optimization (AO) technique [9], but differs in that the DAO does not use a single objective function. This comes from the fact that each receiver has no information about the performance of the others. Therefore, it is difficult to make general claims about convergence, and we resort to static channel simulations to illustrate some convergence properties of the DAO solution. This is given in Section 4.3.

Assuming that the channel coherence time allows for convergence and tracking of the optimal solution, it remains to specify a limited-feedback mechanism to transmit the vectors computed on each iteration, using a small number of feedback bits. This is described next.

One possibility to implement the DAO for MU-MIMO concept is to use the distributed Jacobi eigenbeamforming (D-JAC) algorithm [6], as means to track the optimal beamforming vectors with limited feedback. The D-JAC algorithm features an update based on unitary coordinate plane transformations, also referred to as complex-valued Givens rotors or Jacobi transformations. This method is designed to track the eigenbasis of a general Hermitian matrix according to its descending-order sorted eigenvalues. It is also possible to track only the dominant eigenvector of the matrix, which makes the method suitable for the proposed DAO. In the following, both transmitter and receiver keep a copy of a unitary matrix which contains as the first column, and the update of is done synchronously through the feedback message . Upon arrival of , the transmitter updates by doing where is an identity matrix, except for the entries , , , and , which contain the nontrivial terms and and is the first column of the identity matrix of size . The angle pair is retrieved from and the coordinate plane is selected sequentially with . Each rotor angle is quantized independently and uniformly, and the ranges for and are and , respectively [6]. Note that one can apply (5) more than once per update, provided that the angle pairs for all the rotors are contained in . This speeds up the convergence of the D-JAC algorithm, but requires larger . In the absence of feedback errors, both transmitter and receiver have identical copies of .

The receivers are responsible for computing the angle pairs for the rotor(s) involved in the update, and for feeding them back to the transmitter. How closely the weights match the dominant eigenvector of depends on the antenna array sizes , the feedback message frequency, the channel fading rate and also on how many rotors per update are encoded in . When using only one rotor per update, it can be desirable to limit the variations of the transmit weights between updates, to allow for convergence of the D-JAC update. Therefore, instead of tracking the dominant eigenvector of , the algorithm tracks the eigenvector of , where is a low-pass filtered covariance matrix defined as with an estimate of defined in (2) and a parameter controlling the filter memory.

The filtered matrix is used to compute an auxiliary channel correlation matrix , which is the input to the D-JAC algorithm and is defined as where the most current channel matrix should be used. In the system model of Section 2, this is the channel at the end of the slot, that is, . The matrix and the receive-side version of define the update of through one or more Jacobi transformations, as detailed in [6]. For each rotor to be applied in the update, an angle pair is computed, quantized, and included in . Furthermore, the computation of the rotor angles guarantees that the first column of tracks the dominant eigenvector of . This is so because the D-JAC algorithm operates similarly to the Jacobi method for Hermitian matrices [10].

The proposed decentralized MU-MIMO is summarized in Table 1 and referred hereafter to as DAO-D-JAC.

3.1. Compensation of Feedback Errors

In case of feedback errors, the angles that define the Givens rotor used in the weight update (5) are not the same at the transmit and receive side. This causes that the transmit weights used by the transmitter start to drift from the weights that the receiver is specifying. If uncorrected, this deteriorates the convergence and tracking performance of the proposed solution. To compensate for this effect, two mechanisms are considered. (1)A simple strategy where the matrices are reset to identities on both sides, after a fixed number of slots. This prevents accumulation of errors, but affects the tracking performance, since some slots will be required for readjusting the transmit weights. The periodicity of this resetting procedure needs to be adjusted according to the mobile speed and the probability of feedback errors.(2)A strategy where the received angles are forwarded by the transmitter to each user. The receiver compares the forwarded angles with its local values. Upon difference, reverts to its previous value, and a rotor based on the forwarded angles is applied. In this fashion, the matrices are synchronized at transmit and receive side, but random perturbations are introduced in the algorithm's convergence, which need to be corrected in subsequent updates. This concept is illustrated in Figure 1.

Current standards like the Third Generation Partnership Project (3GPP) Long Term Evolution (LTE) include basic feedback message forwarding as part of the downlink control information (DCI). This comes in the form of the transmit precoding matrix indicator (TPMI), which communicates the receiver which codebook entry has been employed by the transmitter [11]. These bits have channel coding protection, which lowers the probability of undetected errors in the forwarded messages. If the TPMI is to be used with recursive algorithms such as the proposed DAO-D-JAC, then the mechanism () can be employed to limit propagation of infrequent errors in the forwarded messages.

A thorough coverage of the compensation of feedback errors is beyond the scope of this paper. Simulations illustrating the performance of mechanism () are given in Section 4 under the assumption that no errors occur in the forwarded messages.

4. Simulations

We consider a case of , , and a single stream transmission for each user, with 16 QAM constellation and a symbol rate of 240 ksymb/s. Both users have the same SNR conditions () and speed relative to the transmitter. The carrier frequency is 2.1 GHz, there are symbol periods per slot and the feedback message frequency (once per slot) is 1500 Hz. Rayleigh fading channels are spatially uncorrelated and temporal correlation is generated by a fourth order Butterworth filter adjusted to the maximum Doppler frequency. The feedback overhead is the same for the considered schemes.

The DAO-D-JAC feedback messages contain bits each, encoding a one-rotor update per user. Each angle is quantized independently and uniformly with three bits, on its respective range. This bit budget per angle gives close to optimal performance for a single-rotor update, in slowly fading channels. The filter parameter has been optimized for the mobile speed of 3 km/h scenario and set to . We assume that each receiver can build a perfect estimate of its covariance matrix from (2) locally.

4.1. Benchmarking Methods

We compare the performance of the proposed decentralized solution with that of centralized RBD and centralized minimum mean square error (MMSE) solutions. The RBD is chosen, since specific feedback methods have been presented in literature. On the other hand, the MMSE solution serves as a good benchmarking reference under full CSI conditions.

Two types of feedback methods for the centralized MU-MIMO are considered: nonrecursive and recursive. All the methods under consideration provide the transmitter with a vectorized and normalized version of , namely, Note that this preserves the power relationship between the channel matrix columns, but does not preserve the power relation between channels of different users, as present in the full-CSI scenario. In the following, we summarize briefly the most important details of the benchmarking methods.

4.1.1. RBD

The RBD [1] is a decomposition-based approach, where the precoder matrices are factored as = . The terms separate the users, so that the total multiuser interference (MUI) plus noise at the receivers is minimized. This is done for each user through the singular value decomposition (SVD) of a matrix containing the channel matrices of the other users. Then the terms are computed independently for each user, to optimize the transmission through the precoded channel , for example with eigenbeamforming. Finally, the scalar factor normalizes the total transmit power and there is no explicit control of how much of the total transmit power is allocated to each user. This technique is referred to as “RBD full CSI” in the simulations results. Whenever quantized matrices are used instead of the true channel matrices, the channel feedback method is specified.

The RBD solution tends to null the MUI at high signal-to-noise ratios, implying that the IRC filter becomes a matched filter under low noise conditions. A matched filter has been used in the BER simulations. Note, however, that the feedback quantization generates increased MUI and an IRC filter could improve the performance by spatially whitening the MUI. This has not been explored here. Also, the performance of RBD refinements such as the iterative RBD (I-RBD) [1] have been left out.

4.1.2. Total MMSE

The mean square error given channel matrices and noise power can be written in terms of the precoding matrices for each user as where is the MUI plus noise term from (1), is the covariance matrix of from (2), is the receive filter matrix from (3) and is the average power per symbol per stream. The total MMSE solution for can then be found with numerical methods when (3) is used to replace and the objective function is the sum of the MSE of each user. This is similar to the “direct optimization” approach of [12], except that we write the MSE explicitly as a function of , rather than a function of the receive filters .

4.1.3. Nonrecursive Feedback Methods

The nonrecursive methods are based on static dominant eigenvector Linde-Buzo-Gray (DE-LBG) codebooks, which have been presented in the context of limited-feedback RBD [4]. The chosen vector is selected using a minimum chordal distance criterion, and the index is fed back to the transmitter, which reshapes the column vector into the channel matrix estimate. The codebooks are obtained by using the Linde-Buzo-Gray algorithm [13]. The chordal distance is used as distortion measure, and the centroids are computed as the dominant eigenvector of the sample covariance matrix of each cluster. The combination of RBD and nonrecursive DE-LBG codebooks of size is referred-to as “RBD-DE-LBG bits” in the simulations section.

A similar approach for feeding back the optimal beamformer in the MISO case (no vectorizing/reshaping is need) was already considered in [14], where a maximum received SNR criterion is used to partition the training vectors. This is equivalent to the chordal distance criterion in [4]. Another application of the LBG algorithm with eigendecomposition of the cluster covariance matrices can be found in [15], where the goal is to feed back quantized orthogonal bases associated to the channel matrices.

4.1.4. Recursive DE-LBG Codebooks

In general, a given codebook entry can be chosen in two or more consecutive slots, depending on the fading rate and the codebook size. Clearly, these consecutive repetitions of the codebook index do not increase the CSI accuracy at the transmitter, and the associated feedback bits are wasted. For fixed fading rates and vector sizes, it is possible to design hierarchical codebook structures, operating in a nested fashion: a codeword is chosen at the beginning of a period and the following feedback messages indicate codewords from a finer codebook. More details on the design and performance of these nested codebooks can be found in [15]. We implement a similar strategy here, but restrict the codebook hierarchy to two layers and the same codebook sizes on both layers. This is done to keep the feedback rate constant. We use a set of training vectors and first produce one DE-LBG codebook. Then we produce one codebook of the same size for each cluster produced in the previous step. For the particular mobile speed of 3 km/h, we have chosen periods of ten slots, where the coarse codebook index is followed by nine slots conveying indices within the associated nested codebook. This benchmarking method will be labeled “RBD-2-layer nested DE-LBG” in the simulation results.

Related works in recursive codebooks can be found as switched codebooks [16], which aim to adapt to spatial or temporal channel correlation. A different approach of rotating and scaling a codebook to increase the beamforming accuracy has been explored in the context of spatial correlation in [17].

4.1.5. Recursive Feedback Based on Eigenbeamforming

Tracking the vectorized and normalized channel defined in (8) is easily related to single-stream eigenbeamforming methods, which signal changes in the dominant eigenvector of the channel correlation matrix . By replacing this matrix with the rank-one matrix , one can convey to the transmitter, where a phase rotation to the whole vector may appear, depending on the feedback method of choice. This ambiguity, however, does not impact the performance of the RBD and is also inherently present in codebook designs based on the chordal distance.

The first of the eigenbeamforming-based channel feedback methods employs the D-JAC algorithm [6], which is also used to track the dominant eigenvector of in the proposed decentralized algorithm. Here, however, the Givens or Jacobi rotors operate on planes for the vectorized channel, as opposed to the DAO-D-JAC, where only planes are required. We refer to this method as “RBD-EBF-CHFB ( D-JAC )” in the simulation results. Only one rotor per update is employed, with .

The second method is based on a partial update (PU) eigenbeamforming scheme. The alignment-enhanced partial update beamforming (ALE-PUB) algorithm [7] computes and signals the PU of the optimal beamforming vector as to maximize the received maximum ratio combining (MRC) power, constrained to updating only one vector element on each slot. In particular, the ALE-PUB-S operates sequentially (round-robin) on the elements of the vectorized channel . For each slot, the power and phase of the updated element are uniformly quantized and fed back to the transmitter. For , three bits are used for power and phase quantization, respectively. The associated simulation curves are labeled “RBD-EBF-CHFB (ALE-PUB-S 6 bits)” and the power and phase of the PU are quantized with three bits each.

Other recursive eigenbeamforming methods can also be employed, but have not been benchmarked. For example, in [18] the optimal beamformer is parametrized as angles corresponding to a cascade of Givens rotors and unit-modulus scalings, and each parameter is tracked with a single-bit structure similar to the adaptive delta modulation concept from early speech coding techniques [19]. However, as the number of real-valued parameters in the considered scenario is , it is not possible to update all the parameters every slot.

4.2. Bit Error Rate and Ergodic Capacity

The average BER simulations are shown in Figure 2. It can be seen that the DAO-D-JAC algorithm outperforms centralized alternatives operating at the same feedback rate. We note that nonrecursive DE-LBG suffers from error floors, which is consistent with [3, 4] because the codebook size does not scale with the SNR. On the other hand, the estimation of may produce a slight degradation on the performance of DAO-D-JAC. This is an implementation-dependent feature beyond the scope of this work. Furthermore, the estimation can be improved if the transmitter forwards the used transmit weights in a common control channel. It is also shown that the limited-feedback degradation for RBD is less severe when smaller-size constellations such as QPSK are used.

Figure 3 gives the degradation of the BER performance as function of the speed. Note that increasing the speed decreases the channel coherence time and thus limits the time available for convergence and tracking, even though the processing delay of the feedback message is neglected. It is shown that the centralized solution based on closed-loop eigenbeamforming outperforms the nonrecursive DE-LBG up to mobile speeds of approximately 40 km/h, and that the proposed DAO-D-JAC degrades less than the recursive centralized solution, as the speed increases. It can be seen that the filtering parameter does not have a noticeable impact at low SNRs.

Figure 4 shows the average of the instantaneous total rate , where the per-user SINRs are defined in (4). We note that the RBD-DE-LBG 7-bit codebook curves generally agree with Figure of [4], and that a one stream per user strategy is preferable over the two-stream alternative, when the nonrecursive 7-bit codebook is used. It is also shown that the two-layer nested DE-LBG with improves the performance, compared to the nonrecursive DE-LBG with . However, additional layers would be required to further boost the throughput, as illustrated by the results for recursive channel feedback using ALE-PUB-S and D-JAC.

Finally, the performance of the angle forwarding mechanism as compensation for feedback errors is shown in Figure 5, for the slowly fading case. The simulations assume Gray encoding of the quantized rotor angles and i.i.d. feedback bit errors with a given probability. It can be seen that the compensation is effective, and allows the proposed AOD-JAC to outperform the centralized alternative up to feedback bit error probabilities of 10%. Furthermore, the per-user overhead incurred in the downlink path is small, namely . The effect of the feedback errors under 1% probability is illustrated for static channels in Figures 6 and 7. This suggests that as long as no errors are found in the forwarded angles, reasonably low amounts of feedback errors do not compromise the convergence of the DAO technique.

4.3. Convergence in Static Channels

In this section, we briefly explore the convergence properties of the DAO technique which underlies the proposed decentralized MU-MIMO solution. The results presented in the previous section suggest that the DAO solutions are good for small systems. To further characterize the properties of the DAO method, we study the performance of the transmit weights obtained upon convergence in static channels, under full-CSI conditions. Note that even though the DAO is similar to the AO technique [9], we cannot trivially derive converge properties from existing AO results, and therefore rely on simulations with large number of channel realizations. All the DAO-D-JAC solutions have been obtained with a fixed number of iterations, where the precoders of the last step are used. We noticed some oscillations in the per-user SINR learning curves when increasing the probability of feedback errors.

Figures 6 and 7 give the distributions of the per-user SINRs obtained after convergence of the DAO weights, when one stream per user is transmitted and the ratio of transmit power to noise power is dB. Figure 6 shows that the DAO solution for two users performs comparably to the RBD, but is suboptimal when compared to the MMSE solution in Section 4.1.2.

Figure 7 shows the simulation results when the number of users is increased to four. It can be seen that the DAO solution performs considerably worse than the MMSE solution, but it does not break down. This does not necessarily mean, however, that the limited-feedback DAO will perform worse than the centralized MMSE, since it is not known how the MMSE solution degrades when channel feedback mechanisms are used. A limited-feedback performance comparison for this scenario is deferred to further research. We also note that the RBD solution can be refined by means of the iterative RBD (IRBD) technique, as described in [1].

5. Conclusion

This paper presented a limited-feedback decentralized multiuser MIMO solution, tailored for the particular case of single data stream per user transmission and low mobility scenarios. The proposed technique exploits interference rejection combiner receivers to achieve a receiver-based MU-MIMO solution which avoids feeding back the channel matrix explicitly, where the transmitter adapts the transmit weights according to compact feedback messages sent by the mobile receivers. The weight recursion features low computational complexity and is based on complex-valued Givens rotors, where a minimum of one rotor per update can be used. The proposed transmission scheme is shown to obtain uniform average BER and throughput performance improvements, when compared with existing centralized MU-MIMO solutions using channel feedback methods operating at the same feedback rate. A feedback error compensation mechanism is presented, which enables the operation of the proposed algorithm under moderate probabilities of feedback bit errors.