Abstract

Approximate minimum symbol error probability transceiver design of single user MIMO systems under the practical per-antenna power constraint is considered. The upper bound of a lower bound on the minimum distance between the symbol hypotheses is established. Necessary conditions and structures of the transmit covariance matrix for reaching the upper bound are discussed. Three numerical approaches (rank zero, rank one, and permutation) for obtaining the optimum precoder are proposed. When the upper bound is reached, the resulting design is optimum. When the upper bound is not reached, a numerical fix is used. The approach is very simple and can be of practical use.

1. Introduction

Since multiple-input and multiple-output (MIMO) is a very promising technology for mitigating the spectrum scarcity problem, many MIMO transceiver designs have been published recently. The designs have been based on a variety of criteria, for example, maximum capacity, minimum mean square error (MMSE) and minimum bit error rate (Min BER). Considered in this paper is the minimum symbol error probability transceiver design subject to the per-antenna power constraint. This is due to a number of reasons. Firstly, the minimum symbol error probability criterion is directly related to the system performance. Secondly, the per-antenna power constraint is more practical than the commonly used total power constraint in MIMO systems (because each antenna has its own power amplifier and each power amplifier has a limited dynamic range). Lastly, both criterion and metric are difficult to tackle.

Minimum symbol error probability and the related Min BER design problems have been formulated in various different ways (e.g., [1–7]). For example, [1] performs its minimum symbol error probability design by maximizing a lower bound for the minimum distance of the symbol hypotheses. As the metric is a lower bound, the design is approximate in nature. However, it is also thus independent of the symbol alphabet. Their lower bound is the minimum eigenvalue of a positive definite system matrix thus making the problem a max min eigenvalue one. For the sum power constraint, their optimum design is neatly obtained by making all eigenvalues equal.

In this paper, we follow the formulation in [1] because it involves interesting and challenging signal processing issues. As already mentioned, the constraint here however is the per-antenna one. The upper bound of the cost function is established using the per-antenna power constraint and the special structure of a system matrix (which involves the precoder matrix, noise covariance matrix, and channel matrix). The necessary conditions and structures of the transmit covariance matrix for reaching the upper bound are discussed in detail. Three numerical approaches (rank zero, rank one, and permutation) for obtaining the optimum precoder are proposed. When the upper bound is reached, the resulting design is optimum. When the upper bound is not reached, a numerical fix is used. Extensive numerical studies have been performed to assess the performance of the proposed methodology. Although the upper bound is not reached in most cases, good performances in mutual information and signal to interference plus noise ratio (SINR) are achieved. Moreover, the approach is very simple and can be of practical use.

Notations: boldface letters denote either vectors (lower case) or matrices (upper case). 𝐀𝑇, π€βˆ—, π€βˆ’1, tr(A), E(A), πœ†max(A), and πœ†min(A) stand for the transpose, conjugate transpose, inverse, trace, expectation, maximum eigenvalue, and minimum eigenvalue of A, respectively. πœ†π‘–(A) stands for the 𝑖th eigenvalue of A when its eigenvalues are arranged in increasing order. Ir is the π‘ŸΓ—π‘Ÿ identity matrix. 0 is the zero matrix of appropriate dimension. ei denotes the 𝑖th column of the identity matrix (the size of which will be clear from the context). diag(a) denotes the diagonal matrix with a on the main diagonal. On the other hand, if A is a square matrix, diag(A) is the main diagonal of A. 𝐀>𝐁 and 𝐀β‰₯𝐁 mean that π€βˆ’π is positive definite and positive semidefinite, respectively. AΒ°B denotes the elementwise product of A and B. max(π‘Ž,𝑏) means the maximum of real numbers π‘Ž and b. CN(0,𝐑) denotes a zero-mean circularly symmetric complex normal random vector with covariance matrix R.

2. Background, Problem, and Overview of Design

The received signal of the considered MIMO system is 𝐲=𝐇𝐅𝐬+𝐧 where H is the full rank 𝑛×𝑛 channel matrix, F is the full rank 𝑛×𝑛 precoder, s is the 𝑛×1 data vector, and n is the 𝑛×1 received noise vector. The full rank 𝑛×𝑛 decoder G is applied to y to yield Gy. For convenience and without loss of generality, let the source covariance matrix 𝐸(π¬π¬βˆ—)=𝐈 and the noise covariance matrix 𝐸(π§π§βˆ—)=𝐑𝐧𝐧>𝟎. As the transmitter’s power must be constrained, the precoder F is required to satisfy the per-antenna power constraintπžβˆ—π‘–ξ€·π…π…βˆ—ξ€Έπžπ‘–β‰€π‘‘π‘–,βˆ€π‘–.(1) Here, 𝑑𝑖>0,βˆ€π‘–. Note that (1) also results in the average total power being upper bounded by P =βˆ‘π‘›π‘–=1𝑑𝑖. Let 𝑆 denote the set of all feasible F’s.

2.1. Problem Formulation

The goal of this paper is to design F and G to approximately minimize the probability of error in an alphabet independent manner. To accomplish this, we will begin as in [1]. Define π€β‰œπ‡βˆ—π‘βˆ’1𝐧𝐧𝐇 and𝚿(𝐅,𝐆)β‰œπ…βˆ—π‡βˆ—π†βˆ—ξ€·π†π‘π§π§π†βˆ—ξ€Έβˆ’1𝐆𝐇𝐅.(2)πœ†min(Ξ¨(𝐅,𝐆)) actually lower bounds the minimum distance between the symbol hypotheses. In addition,𝚿(𝐅,𝐆)β‰€π…βˆ—π€π…(3) with equality when 𝐆=π…βˆ—π‡βˆ—(π‡π…π…βˆ—π‡βˆ—+𝐑𝐧𝐧)βˆ’1, that is, the MMSE decoder is used. Thus, we will choose 𝐆=π…βˆ—π‡βˆ—(π‡π…π…βˆ—π‡βˆ—+𝐑𝐧𝐧)βˆ’1 and design the precoder F according to the optimization problem:maxπ…βˆˆπ‘†πœ‡(𝐅),πœ‡(𝐅)β‰œπœ†minξ€·π…βˆ—ξ€Έπ€π….(4)

Note that if the equality average total power constraint tr(π…π…βˆ—)=𝑃 was used instead, the solution to (4) would be given by [1]. That is,π•πš²βˆ’1/2ξƒ©π‘ƒξ€·πš²trβˆ’1ξ€Έξƒͺ1/2=argmaxξ€·trπ…π…βˆ—ξ€Έ=𝑃,π…βˆˆβ„‚π‘›Γ—π‘›πœ‡(𝐅),(5) where the unitary matrix V and the diagonal matrix Ξ› are obtained from the eigenvalue decomposition of 𝐀=π•Ξ›π•βˆ— (eigenvalues in descending order).

2.2. Overview of Design of F

The optimization problem, (4), is very complicated. Though not mentioned in Section 2.1, we desire a low complexity algorithm to design F. We will thus take some simplifications.

Before detailing the simplifications and the algorithm, we will first need some analysis on the cost function of (4): since π…βˆ—π€π… and π€π…π…βˆ— have the same eigenvalues, π€π…π…βˆ—π³=πœ‡(𝐅)𝐳 where z is an eigenvector of π€π…π…βˆ— generated by the minimum eigenvalue πœ‡(𝐅). Noting that 𝐀>𝟎,π³πœ‡(𝐅)=βˆ—π…π…βˆ—π³π³βˆ—π€βˆ’1π³β‰€π±βˆ—π…π…βˆ—π±π±βˆ—π€βˆ’1𝐱,(6) where x is any nonzero 𝑛×1 vector. Define 𝐁=[𝑏𝑖𝑗]β‰œπ€βˆ’1 for convenience. Plugging in ei for x in (6) and using the constraint (1), it is then clear that, for all π…βˆˆπ‘†,πœ‡(𝐅)≀min𝑖𝑑𝑖𝑏𝑖𝑖=min𝑖𝑑1𝑏11,𝑑2𝑏22𝑑,…,π‘›π‘π‘›π‘›ξ‚Όβ‰œπ‘‘π‘˜π‘π‘˜π‘˜β‰œπœŒ.(7) That is, the cost function in (4) is upper bounded by ρ.

The upper bound ρ may be reachable. Consequently, the algorithm to design F is as follows. We first try to find a precoder Fρ∈S with maximum power (i.e., satisfies (1) with equality) and which reaches the upper bound ρ:πœ‡ξ€·π…πœŒξ€Έ=𝜌.(8) Any of the three approaches in Section 4 can be used for this search. If successful, we have found an optimal solution to (4) and are done; set F equal to Fρ. If unsuccessful, we get F by implementing a fix for the approach used to search for Fρ. This fix is simply lowering the power of the kth antenna and is explained in Section 5. The low complexity of the three approaches in Section 4 is due to Section 3 which reveals necessary structure and conditions for π”πœŒ=π…πœŒπ…βˆ—πœŒ.

3. Necessary Structure and Conditions

If Fρ exists, the transmit covariance matrix Uρ would have a special structure and certain conditions would be true. Assume Fρ exists. Then, observing from (6), ek must be an eigenvector of AUρ paired with the eigenvalue ρ, that is,π€π”πœŒπžπ‘˜=πœŒπžπ‘˜βŸΊπ”πœŒπžπ‘˜=πœŒπ€βˆ’1πžπ‘˜=πœŒππžπ‘˜.(9) In particular, the kth column of Uρ must equal the kth-column of B multiplied by ρ. Consequently, if 1<π‘˜<𝑛 and [𝐛𝑇1π‘π‘˜π‘˜π›π‘‡2]𝑇 denotes the kth column of B, then, necessarily,π”πœŒ=π…πœŒπ…βˆ—πœŒ=⎑⎒⎒⎒⎒⎣𝚺1πœŒπ›1π‹πœŒπ›βˆ—1π‘‘π‘˜πœŒπ›βˆ—2π‹βˆ—πœŒπ›2𝚺2⎀βŽ₯βŽ₯βŽ₯βŽ₯⎦𝐈>𝟎,(10)π‘˜βˆ’1∘𝚺1𝑑=diag1β‹―π‘‘π‘˜βˆ’1,πˆξ‚„ξ‚π‘›βˆ’π‘˜βˆ˜πšΊ2𝑑=diagξ‚€ξ‚ƒπ‘˜+1⋯𝑑𝑛.(11) For the sake of clarity, the cases when k=1 or 𝑛 are omitted in this paper.

3.1. Necessary Conditions for π”πœŒ (1st Representation)

As to be expected, the remaining unspecified elements of Uρ (i.e., L and the off diagonal elements of 𝚺1 and 𝚺2) are not arbitrary. By reducing Uρ to direct sum form using elementary block row and column operations, it can be seen that Uρ > 0 if and only if𝚺1>𝟎,𝚺2>𝟎,𝚺2βˆ’π‹βˆ—Ξ£1βˆ’1𝐋>𝟎,(12)𝑏2π‘˜π‘˜π‘‘π‘˜>π›βˆ—1𝚺1βˆ’1𝐛1+𝐛2βˆ’π‹βˆ—πšΊ1βˆ’1𝐛1ξ€Έβˆ—ξ€·πšΊ2βˆ’π‹βˆ—πšΊ1βˆ’1π‹ξ€Έβˆ’1𝐛2βˆ’π‹βˆ—πšΊ1βˆ’1𝐛1ξ€Έ.(13) Since 𝚺2βˆ’π‹βˆ—πšΊ1βˆ’1𝐋>𝟎 (see (12)), the following necessary condition can be derived from (13): 𝑏2π‘˜π‘˜π‘‘π‘˜>π›βˆ—1𝚺1βˆ’1𝐛1.(14) Furthermore, since 𝚺1>𝟎, a redundant though useful necessary condition can be derived for the antenna powers {𝑑1,…,π‘‘π‘˜}, which depends only on the known parameters b1, and π‘π‘˜π‘˜:𝑏2π‘˜π‘˜π‘‘π‘˜>π›βˆ—1𝐛1πœ†maxξ€·πšΊ1ξ€Έβ‰₯π›βˆ—1𝐛1ξ€·πšΊtr1ξ€Έ=π›βˆ—1𝐛1𝑑1+𝑑2+β‹―+π‘‘π‘˜βˆ’1.(15)

An alternate necessary and sufficient condition for Uρ > 0 can be given by simply interchanging the subscripts 1 and 2, and L* and L in (12), (13). Thus, one can easily show that𝑏2π‘˜π‘˜π‘‘π‘˜>π›βˆ—2𝚺2βˆ’1𝐛2,𝑏2π‘˜π‘˜π‘‘π‘˜>π›βˆ—2𝐛2π‘‘π‘˜+1+π‘‘π‘˜+2+β‹―+𝑑𝑛(16) are also necessary.

3.2. Necessary Conditions for π”πœŒ (2nd Representation)

Use an appropriate invertible, symmetric, real 𝑛×𝑛 permutation matrix P to permute π‘‘π‘˜ to the upper left corner:πβˆ—π”πœŒβŽ‘βŽ’βŽ’βŽ£π‘‘π=π‘˜πœŒΜƒπ›βˆ—πœŒΜƒβŽ€βŽ₯βŽ₯βŽ¦π›π.(17) (The structures of P, Q, and ̃𝐛, are omitted here). It turns out that π”πœŒ > 0 if and only if P*UρP > 0. Thus, Uρ > 0 if and only if (noting that π‘‘π‘˜>0 given in (1))π‘‘πβˆ’π‘˜π‘2π‘˜π‘˜Μƒπ›Μƒπ›βˆ—>𝟎.(18) Since both (12)-(13), and (18) are necessary and sufficient for Uρ > 0, (12)-(13) and (18) are equivalent to each other. Conveniently, the diagonal elements of 𝐐,π‘žπ‘–π‘–, 𝑖=1,…,π‘›βˆ’1, are just the diagonal elements of 𝚺1, 𝚺2 permutated. Consequently, requiring that the π‘žπ‘–π‘–, 𝑖=1,…,π‘›βˆ’1, be equal to the correct antenna powers is equivalent to requiring conditions (11). Moreover, a redundant though useful necessary condition can easily be derived from (18) which only uses the antenna powers, ̃𝐛, and π‘π‘˜π‘˜:π‘žπ‘–π‘–βˆ’π‘‘π‘˜π‘2π‘˜π‘˜||𝑏𝑖||2>0,(𝑖=1,…,π‘›βˆ’1).(19)

4. Three Approaches for Obtaining Uρ

Each of the following three approaches seeks to find an Uρ (i.e., L, 𝚺1, and 𝚺2 in (10) or equivalently Q in (17)) which satisfies the necessary structure and conditions in Section 3.

4.1. Rank 0 Approach (R0A)

In this simple approach, the matrix L is chosen to have rank 0 (hence the name of the approach), that is, equal to all zeros. 𝚺1 and 𝚺2 are chosen to be diagonal matrices with the diagonal entries (𝑑1, …, π‘‘π‘˜βˆ’1) and (π‘‘π‘˜+1, …, 𝑑𝑛), respectively. Such a choice for L, 𝚺1, and 𝚺2 automatically satisfies (11), (12). If the last remaining necessary condition (13) is satisfied, construct Uρ using (10) and check whether ρ is the minimum eigenvalue of AUρ. If both of these conditions are satisfied, decompose Uρ to get a Fρ (and a corresponding G) and an optimum solution has been found. If either condition fails, use the fix in Section 5. It is interesting to note that the decomposition from Uρ to Fρ is not unique; indeed, using Fρ right multiplied by a unitary matrix is also a valid decomposition.

4.2. Rank 1 Approach (R1A)

Choose 𝚺1 and 𝚺2 as in R0A. If π›βˆ—1𝚺1βˆ’1𝐛1=π›βˆ—2𝚺2βˆ’1𝐛2, use another approach. If π›βˆ—1𝚺1βˆ’1𝐛1>π›βˆ—2𝚺2βˆ’1𝐛2, choose a rank one choice of L,𝐛𝐋=1π›βˆ—2π›βˆ—1Ξ£1βˆ’1𝐛1.(20) This L makes the right-hand side of (13) as small as possible and independent of 𝚺2. If π›βˆ—1𝚺1βˆ’1𝐛1<π›βˆ—2𝚺2βˆ’1𝐛2, choose an alternative rank one choice,𝐛𝐋=1π›βˆ—2ξ€Έξ€·π›βˆ—2𝚺2βˆ’1𝐛2ξ€Έ.(21) With these 𝚺1, 𝚺2, and L, the power constraints in (11) and the first two conditions of (12) are automatically satisfied. Furthermore, the condition 𝚺2βˆ’π‹βˆ—πšΊ1βˆ’1𝐋>𝟎 in (12) is also satisfiedβ€”use the fact that π›βˆ—1𝚺1βˆ’1𝐛1>π›βˆ—2𝚺2βˆ’1𝐛2 when (20) is used and that π›βˆ—1𝚺1βˆ’1𝐛1<π›βˆ—2𝚺2βˆ’1𝐛2 when (21) is used. However, (13) needs to be checked. In addition, whether ρ is the minimum eigenvalue of AUρ also needs to be checked. Same as in R0A, if both conditions are satisfied, decompose Uρ to get a Fρ (and a corresponding G) and an optimum solution has been found. If either condition fails, use the fix in Section 5.

4.3. Permutation Approach (PA)

This third approach, unlike the previous two, searches for Uρ using the 2nd representation of necessary conditions (Section 3.2). It is based on two facts. The first is that Q = Q* andπœ†min(𝐐)>πœ‚minβ‰œπ‘‘π‘˜π‘βˆ’2π‘˜π‘˜Μƒπ›βˆ—Μƒπ›(22) together imply (18). The second is that a Hermitian Q satisfying (22) and having the correct diagonal entries (the antenna powers permutated) exists if and only if πœ‚min<𝑑𝑖, for all i (just apply the Schur-Horn Theorem [8]).

Granted that πœ‚min<𝑑𝑖, for all i, the approach is as follows. First choose some πœ€>0 such that πœ‚min+πœ€β‰‘π›Ύβ‰€π‘‘π‘–, for all i. Next, find a Hermitian Q whose diagonal entries are the correct antenna powers, {π‘žπ‘–π‘–}, and whose eigenvalues are lower bounded by Ξ³. Once a Q is found, construct (in light of (17))π”πœŒ=ξ€·πβˆ—ξ€Έβˆ’1βŽ‘βŽ’βŽ’βŽ£π‘‘π‘˜πœŒΜƒπ›βˆ—πœŒΜƒβŽ€βŽ₯βŽ₯βŽ¦ππ›πβˆ’1.(23) If ρ is the minimum eigenvalue of AUρ, obtain Fρ and a corresponding G from Uρ as in R0A and R1A; an optimum solution has been found. If there is a smaller eigenvalue than ρ, use the fix in Section 5.

There are various ways to find a Hermitian Q with diagonal entries {π‘žπ‘–π‘–} and eigenvalues lower bounded by Ξ³. A closed form solution is shown in Appendix A and a projection approach is shown in Appendix B.

5. A Fix

Consider a L, 𝚺1, and 𝚺2 which satisfies (11) and (12). If (13) does not hold and/or if ρ = π‘‘π‘˜/π‘π‘˜π‘˜ is greater than the smallest eigenvalue of AUρ, proceed as following. First, temporarily change the per-antenna power constraint for the kth antenna by replacing π‘‘π‘˜ by π‘‘ξ…žπ‘˜. This makes ρ become π‘‘ξ…žπ‘˜/π‘π‘˜π‘˜, Uρ become ⎑⎒⎒⎒⎒⎣𝚺1πœŒπ›1π‹πœŒπ›βˆ—1π‘‘ξ…žπ‘˜πœŒπ›βˆ—2π‹βˆ—πœŒπ›2𝚺2⎀βŽ₯βŽ₯βŽ₯βŽ₯⎦,(24) and so forth. Second, lower π‘‘ξ…žπ‘˜ (maintaining π‘‘ξ…žπ‘˜>0) until (13) holds. Then, continue lowering π‘‘ξ…žπ‘˜ (maintaining π‘‘ξ…žπ‘˜ > 0) and thus ρ = π‘‘ξ…žπ‘˜/π‘π‘˜π‘˜ until ρ is the smallest eigenvalue of AUρ. As Appendix C shows, one can always lower π‘‘ξ…žπ‘˜ until this happens. Using this fix, (8) is thus obtainedβ€”granted, for a lower power constraint.

Lastly, now that π‘‘ξ…žπ‘˜ is low enough, decompose Uρ to get Fρ (and a corresponding G). This Fρ is full rank since (12-13) hold. In addition, it satisfies the inequality per-antenna power constraint (1) with the true π‘‘π‘˜ and the 𝑑𝑖, for all π‘–β‰ π‘˜. Indeed,πžβˆ—π‘˜ξ€·π…πœŒπ…βˆ—πœŒξ€Έπžπ‘˜=π‘‘π‘˜ξ…ž<π‘‘π‘˜.(25) Thus, set F equal to Fρ.

Lowering π‘‘ξ…žπ‘˜ until (13) holds is understandableβ€”F needs to be full rank. But, why lower it to satisfy (8) for a lower power constraint? Instead of lowering π‘‘ξ…žπ‘˜ further, decomposing Uρ to get F at that point would yield a legitimate precoder. So, why continue to lower π‘‘ξ…žπ‘˜? The reason is that it is observed numerically that continuing to lower π‘‘ξ…žπ‘˜ actually raises the minimum eigenvalue of AUρ. See Figure 6 and the discussion for it.

6. Numerical Results

The numerical results are divided into two parts. In the first part, two examples are given to demonstrate the proposed approaches of Section 4 and the fix of Section 5. In the second part, Monte Carlo simulations are used to investigate how suboptimal, if at all, is the proposed design methodology for F. It also investigates how often the fix is needed.

6.1. Demonstration of the Proposed Design Methodology for F

Two examples are given here, each of which corresponds to one H and 𝐑𝐧𝐧 (i.e., one A). Without loss of generality, consider 10 antennas (𝑛=10) with identical power constraints (𝑑𝑖 = 10, 𝑖=1,…,𝑛). Thus, maximum allowable total power βˆ‘π‘ƒ=𝑛𝑖=1𝑑𝑖=100.

For the first numerical example, (8) is achieved by the R0A, R1A, and PA. The resulting eigenvalues of AU for each of them are plotted in Figure 1. The upper bound ρ and the optimum result under the total power constraint (see (5)) are also plotted as benchmarks. Several interesting observations can be made. Firstly, since βˆ‘π‘ƒ=𝑛𝑖=1𝑑𝑖, the optimum solution for the total power case always is at least as good as that of the per-antenna case. Indeed, in Figure 1, the πœ†1(𝐀𝐔) (i.e.,πœ†min(𝐀𝐔)) for the total power case is greater than ρ, an upper bound for the πœ†1(𝐀𝐔)’s of the per-antenna case. Secondly, the R0A, R1A, and PA all result in optimum F’s here; πœ†1(𝐀𝐔) for each of the three approaches is numerically equal to ρ. Thirdly, the eigenvalues are all equal for the total power case (as is always the case. See [1]). However, the eigenvalues for any of the approaches subject to the per-antenna constraint are clearly not all equal.

In the second numerical example, the R0A’s solution does not satisfy (13). Thus, the fix in Section 5 is applied to it. The eigenvalues and the ρ resulting from the fix (i.e., π‘‘ξ…žπ‘˜/π‘π‘˜π‘˜) are plotted in Figure 2. The original ρ and the optimum result under the total power constraint 𝑃=100 are also plotted for reference. As it is supposed to be, πœ†1(𝐀𝐔) for the fixed solution is numerically equal to the lowered ρ. The π‘‘π‘˜ξ…ž is approximately equal to 4.4138.

6.2. Investigation into the Effectiveness of the Proposed Design Methodology for F

In this subsection, we use Monte Carlo simulation to assess the effectiveness of the proposed approach and to show how far our suboptimum solution is from the optimum solution. We did not prove that the upper bound in (8) is always achievable. How then do we get the optimum solution needed for this comparison? For a 𝐑𝐧𝐧, P, and H, we obtain it by the following methodology. We do not specify the per-antenna power constraints {𝑑𝑖} at the beginning. Instead, we calculate the closed-form precoder in (5) and set its antenna powers as the {𝑑𝑖}. In other words, if Fo denotes the precoder from (5), we set πžβˆ—π‘–(π…π‘œπ…βˆ—π‘œ)πžπ‘– as 𝑑𝑖, for all i. For this resulting problem, Appendix D proves that the precoder from (5) is an optimum solution to (4). Moreover, it achieves the upper bound in (8). With a 𝐑𝐧𝐧, H, and {𝑑𝑖} in hand where we know the upper bound ρ is achievable, we can run our algorithm to get F and analyze its performance.

More specifically, this simulation is run as follows. The noise covariance matrix 𝐑𝐧𝐧 is set equal to 𝜎2I. The transmit signal-to-noise ratio (SNR) is defined as =10log10(𝑃/𝜎2). Transmit SNRs of 0, 6, 12, and 18 dB are run. Both 4 and 8 antenna scenarios (𝑛=4,8) are run. For each transmit SNR and n, 1000 H’s are randomly generated; elements of H are independent identically distributed CN(0,1) random variables. After the {𝑑𝑖} are determined by the total power closed-form solution (5), the R0A and, if necessary, the fix are run.

First, consider the 8 antennas case (𝑛=8). For all the transmit SNRs and all the randomly generated H, the R0A does not achieve the upper bound in (8) and the fix in Section 5 is employed. The top figure in Figure 3 shows the histogram of the ratio between the new π‘‘ξ…žπ‘˜ (after the fix) and π‘‘π‘˜ for the transmit SNR 6 dB case. Since π‘‘ξ…žπ‘˜ is smaller than π‘‘π‘˜, the total power of the proposed F is smaller than the maximum total power allowed, P. The bottom figure in Figure 3 thus shows the histogram of the ratio between the total power of the proposed F and 𝑃. The histograms for the other transmit SNRs are not shown since they are so similar to Figure 3. Recall that if the fix is used, the obtained cost function value is π‘‘ξ…žπ‘˜/π‘π‘˜π‘˜. Since the optimum solution obtains the cost function value of ρ = π‘‘π‘˜/π‘π‘˜π‘˜, the top histogram of Figure 3 also shows how suboptimal the proposed algorithm is.

The optimal solution in (5) is better than the proposed solution with respect to the cost function in (4). However, the proposed solution has a much larger mutual information than the optimal solution (see Figure 4). According to [1], the mutual information for a F islog2||𝐈+π‡π…π…βˆ—π‡βˆ—π‘βˆ’1𝐧𝐧||,(26) when the MMSE decoder is usedβ€”as is done here. The reason for the observed larger mutual information is as follows. The optimum solution in (5) diagonalizes the equivalent channel matrix GHF and equalizes all eigen-channels so that the resulting SINRs for all data streams are the same and equal to ρ (see Appendix D and [1]). But, for the proposed solution, the SINRs of all the data streams are not the same and, moreover, most of them are larger than ρ(see Figure 5). The SINR for the 𝑖th stream when 𝐅=[𝐟1β‹―πŸπ‘›] and 𝐆=[𝐠𝑇1⋯𝐠𝑇𝑛]𝑇 are used is simply||π π‘–π‡πŸπ‘–||2π π‘–π‘π§π§π βˆ—π‘–+βˆ‘π‘—β‰ π‘–||π π‘–π‡πŸπ‘—||2.(27)

Figure 5 shows the normalized SINRs for the 8 data streams of the proposed solution for a sampling of the channel realizations. The transmit SNR is 6 dB. The figures for the other channel realizations and transmit SNRs are not shown since they are so similar to Figure 5. For a given channel realization, the normalization factor is the same for all 8 data streams. It is ρ, the SINR of every data stream when the optimum solution is used (see Appendix D). For the channel realizations shown in Figure 5, only one normalized SINR (out of 8) is less than 1. Thus, only one data stream for the proposed solution has a lower SINR than the SINR of the 8 data streams for the optimal solution. This is also roughly the case for the other channel realizations as well.

Note that the optimal solution may have a smaller symbol error rate than the suboptimum solution if a ML receiver is used (according to the logic in Section 2.1). If the ML receiver is not employed, the optimal solution may not have any advantage over the proposed solution. For practical implementations, appropriate modulation and coding schemes can be selected to maximize the throughput when the precoder derived from the low complexity proposed algorithm is employed.

Section 5 said that numerical results showed that continuing to lower π‘‘ξ…žπ‘˜ actually raises the minimum eigenvalue of AUρ. Figure 6 shows a typical plot of the results of continuing to lower π‘‘ξ…žπ‘˜ for a channel realization. At each iteration, π‘‘ξ…žπ‘˜ is lowered by one percent. In the figure, ρ = π‘‘ξ…žπ‘˜/π‘π‘˜π‘˜ and Uρ is given by (23) in accordance with the notation in Section 5. Indeed, the minimum eigenvalue of AUρ increases as π‘‘ξ…žπ‘˜, and thus ρ = π‘‘ξ…žπ‘˜/π‘π‘˜π‘˜, is lowered. In fact, the increase is always monotonic (the value for an iteration is greater or equal to that of the previous iteration) in all the simulations save for one exception. In some channels (e.g., the one for Figure 6), πœ†1(π€π”πœŒ) for the last iteration may be smaller than πœ†1(π€π”πœŒ) for the second to last iteration. This may be due to overshooting since the step size for π‘‘ξ…žπ‘˜ is fixed.

Next, consider the 4 antennas case (𝑛=4). Figure 7 shows the histogram of the ratio between π‘‘ξ…žπ‘˜ (after the fix) and π‘‘π‘˜. It also shows the histogram of the ratio between the total power of the proposed F and 𝑃. Figure 8 shows the mutual information for the optimum precoder and the proposed precoder. Figure 9 shows the proposed solution’s normalized SINRs for the 4 data streams for some channel realizations. For a given channel realization, the normalization factor is ρ, the SINR of the 4 data streams when the optimum precoder in (5) is used.

Basically, the observations made for the eight-antenna example are still applicable here for the four-antenna example. Some notable differences and points are as follows. Firstly, the histogram of π‘‘ξ…žπ‘˜/π‘‘π‘˜ shifts to the right when going from 𝑛=8 to 𝑛=4. Secondly, the mutual information is smaller in the 𝑛=4 case. The transmit SNR gap between the two approaches is also smaller here as well. Lastly, for the 𝑛=4 case, the normalized SINRs suffer some degradation as transmit SNR increases. For the lower transmit SNRs, there are roughly 3 data streams with normalized SINRs above 1 and 1 data stream with normalized SINR below 1. For the higher transmit SNRs, there are roughly 2 data streams with normalized SINRs above 1 and 2 data streams with normalized SINRs below 1.

7. Conclusion

Considered here is the approximate minimum symbol error probability transceiver design subject to the practical per-antenna power constraint. The metric to be maximized is a lower bound for the minimum distance of the symbol hypotheses. As in [1], the bound used is the minimum eigenvalue of a positive definite system matrix involving the precoder matrix, noise covariance matrix, and channel matrix. This max min problem is both interesting and challenging because the differentiation of the minimum eigenvalue cannot be performed explicitly. Remarkably, we are able to develop approaches to solve the design problem without using differentiation of eigenvalues or the popular method of Lagrange multipliers.

First, the upper bound for the cost function is established using the special structure of the system matrix and the power constraint. Then, necessary conditions and structures of the transmit covariance matrix for reaching the upper bound are obtained. Based on these necessary conditions and structures, three numerical approaches (rank zero, rank one and permutation) for obtaining the optimum precoder are developed. Since the upper bound is not always achieved, a possibly suboptimum fix is also given to be used, when necessary, after the proposed approaches.

In the total power constraint case, the eigenvalues of the optimum solution in [1] were always equal. Interestingly, the numerical results here show that this is not always the case for the per-antenna power constraint. Extensive numerical studies have been performed to assess the performances of the proposed methodology. Although the upper bound is not reached in most cases, good performances in mutual information and SINR are achieved. Moreover, the approach is very simple and can be of practical use.

Appendices

A.

Consider the (π‘›βˆ’1)Γ—(π‘›βˆ’1) matrix Y= Ξ³In-1 + X where X β‰₯ 0. For any X β‰₯ 0, Y is Hermitian and has eigenvalues lower bounded by Ξ³. So, as long asξ€Ίπ‘ždiag(𝐗)=11βˆ’π›Ύ,π‘ž22βˆ’π›Ύ,…,π‘žπ‘›βˆ’1,π‘›βˆ’1ξ€»βˆ’π›Ύ,(A.1)Y is a valid choice of Q.

To find a 𝐗β‰₯𝟎 that satisfies (A.1) is not difficult. First, choose any (π‘›βˆ’1)Γ—(π‘›βˆ’1) matrix 𝐙β‰₯𝟎 with positive diagonal entries {𝑧𝑖𝑖}. Then, form𝐑≑diagξƒ©ξƒ¬π‘ž11βˆ’π›Ύπ‘§11,ξ‚™π‘ž22βˆ’π›Ύπ‘§22ξƒŽ,…,π‘žπ‘›βˆ’1,π‘›βˆ’1βˆ’π›Ύπ‘§π‘›βˆ’1,π‘›βˆ’1ξƒ­ξƒͺ.(A.2) The product RZR is a valid choice of X. Consequently, Y= Ξ³In-1 + RZR is a valid choice of Q and can be used in the PA.

B.

The projection approach finds a Q as follows. It begins by randomly generating a (π‘›βˆ’1)Γ—(π‘›βˆ’1) unitary matrix V(0) and π‘›βˆ’1 real numbers, πœ‚1β‰₯β‹―β‰₯πœ‚π‘›βˆ’1β‰₯𝛾. It then creates the initial guess at Q:𝐐(0)=𝐕(0)πœ‚β‹…diagξ€·ξ€Ί1,…,πœ‚π‘›βˆ’1⋅𝐕(0)ξ€Έβˆ—.(B.1) At the 𝑗th iteration step (𝑗>1), it proceeds as follows.(i)Force the diagonal elements of 𝐐(π‘—βˆ’1) to be equal to {π‘žπ‘–π‘–}.(ii)Then, decompose 𝐐(π‘—βˆ’1) as 𝐐(π‘—βˆ’1)=𝐕(π‘—βˆ’1)πœ‰β‹…diagξ€·ξ€Ί1,…,πœ‰π‘›βˆ’1⋅𝐕(π‘—βˆ’1)ξ€Έβˆ—(B.2) to get the unitary matrix 𝐕(π‘—βˆ’1) and the real numbers πœ‰1β‰₯β‹―β‰₯πœ‰π‘›βˆ’1.(iii)If πœ‰π‘›βˆ’1 β‰₯Ξ³, set Q equal to 𝐐(π‘—βˆ’1) and stop iterating. If not, let πœ‰π‘– = max(πœ‰π‘–, Ξ³), for all i, and create𝐐(𝑗)=𝐕(π‘—βˆ’1)πœ‰β‹…diagξ€·ξ€Ί1,…,πœ‰π‘›βˆ’1⋅𝐕(π‘—βˆ’1)ξ€Έβˆ—.(B.3)(iv)Move onto the (𝑗+1)th iteration step.

A Q is always found. That is, the above iteration always converges. The reason is as follows. The iteration simply projects between two closed, convex subsets of the Hilbert space ℂ𝑛×𝑛:ξ€½π“βˆˆβ„‚π‘›Γ—π‘›ξ€Ίπ‘žβˆ£diag(𝐓)=11β‹―π‘žπ‘›βˆ’1,π‘›βˆ’1ξ€»,𝐓=π“βˆ—ξ€Ύ,(B.4)ξ€½π•Ξ˜π•βˆ—βˆ£π•βˆ’1=π•βˆ—βˆˆβ„‚π‘›Γ—π‘›,πœƒΞ˜=diagξ€·ξ€Ί1,…,πœƒπ‘›ξ€»ξ€Έβˆˆβ„π‘›Γ—π‘›,πœƒπ‘–ξ€Ύβ‰₯𝛾(B.5) As the intersection of the two subsets is nonempty (by the Schur-Horn Theorem [8]), the iteration will converge [9].

C.

Assume that (10)–(13) hold but not (8). It will be proved here that (8) can be satisfied as well by simply lowering π‘‘π‘˜ and thus ρ(maintaining π‘‘π‘˜, 𝜌>0). The proof can be split into four parts. The first part is to realize that (10) implies that ρ is an eigenvalue of AUρ (use (9)). The second part is to find a (π‘›βˆ’1)Γ—(π‘›βˆ’1) matrix whose eigenvalues are precisely AUρ’s other π‘›βˆ’1 eigenvalues. To this end, introduce the partitionβŽ‘βŽ’βŽ’βŽ’βŽ’βŽ£π‚π€=11𝐜12𝐂13πœβˆ—12𝑐22𝐜23π‚βˆ—13πœβˆ—23𝐂33⎀βŽ₯βŽ₯βŽ₯βŽ₯⎦,(C.1) where 𝑐22 is the kkth element of A. In addition, note that A times the kth column of Uρ is equal to ρek due to 𝐁=π€βˆ’1. With some straightforward steps, it can thus be seen thatπŒξ€·π‘‘π‘˜ξ€Έ=βŽ‘βŽ’βŽ’βŽ£π‚11𝐂13π‚βˆ—13𝐂33⎀βŽ₯βŽ₯⎦⎑⎒⎒⎣𝚺1π‹π‹βˆ—πšΊ2⎀βŽ₯βŽ₯⎦+π‘‘π‘˜π‘π‘˜π‘˜βŽ‘βŽ’βŽ’βŽ£πœ12π›βˆ—1𝐜12π›βˆ—2πœβˆ—23π›βˆ—1πœβˆ—23π›βˆ—2⎀βŽ₯βŽ₯⎦(C.2) is a suitable candidate matrix. The third part is to realize that the eigenvalues ofβŽ‘βŽ’βŽ’βŽ£π‚11𝐂13π‚βˆ—13𝐂33⎀βŽ₯βŽ₯⎦⎑⎒⎒⎣𝚺1π‹π‹βˆ—πšΊ2⎀βŽ₯βŽ₯⎦,(C.3) the first term in M(π‘‘π‘˜), are positive and independent of π‘‘π‘˜. To this end, note that (C.3) is the product of two positive definite matrices. Then, note that the eigenvalues of such a product must be positive [10]. Finally, using the fact that β€œthe eigenvalues of a square…complex matrix depend continuously upon its entries” ([11]: Appendix D) and a limiting argument, the last part is to realize that one can lower π‘‘π‘˜ and thus ρ (maintaining π‘‘π‘˜,ρ > 0) until all the eigenvalues of M(π‘‘π‘˜) are β‰₯ ρ.

D.

Let Fo denote the precoder from (5). As it is an optimal solution to the optimization problem in (5), tr{π…π‘œπ…βˆ—π‘œ}=𝑃. But, what are its antenna powers? It turns out that the 𝑖th antenna has power 𝑏𝑖𝑖𝑃/tr(Ξ›βˆ’1) becauseπ…π‘œπ…βˆ—π‘œ=π•Ξ›βˆ’πŸ/πŸπš²βˆ’πŸ/πŸπ•βˆ—ξƒ©π‘ƒξ€·πš²trβˆ’πŸξ€Έξƒͺ𝑃=πξ€·πš²trβˆ’πŸξ€Έξƒͺ.(D.1) Here, we used the fact that 𝐁=π€βˆ’1 and 𝐀=π•Ξ›π•βˆ—.

Let us say, for all i, we set 𝑑𝑖 equal to Fo’s 𝑖th antenna power:𝑑𝑖=π‘π‘–π‘–π‘ƒξ€·πš²trβˆ’πŸξ€Έ.(D.2) Then from (7) and (D.2), we have the upper boundπ‘ƒπœŒ=ξ€·πš²trβˆ’πŸξ€Έ.(D.3) It turns out that Fo reaches this upper bound. Direct computation shows this. From (5), we have π…βˆ—π‘œπ€π…π‘œ=Ξ›βˆ’1/2π•βˆ—π•πš²π•βˆ—π•πš²βˆ’1/2ξƒ©π‘ƒξ€·πš²trβˆ’πŸξ€Έξƒͺ𝑃=πˆξ€·πš²trβˆ’πŸξ€Έξƒͺ.(D.4) From (D.3) and (D.4), we haveπœ†minξ€·π…βˆ—π‘œπ€π…π‘œξ€Έ=ξƒ©π‘ƒξ€·πš²trβˆ’πŸξ€Έξƒͺ=𝜌.(D.5) In summary, if the 𝑑1,…,𝑑𝑛 of the per-antenna power constraint (1) are defined using the antenna powers of Fo, Fo reaches the upper bound ρ and is thus an optimum solution to (4).

Disclosure

A part of this manuscript appears in Enoch Lu’s dissertation, submitted to the Faculty of the Polytechnic Institute of New York University in partial fulfillment of the requirements for the degree Doctor of Philosophy (Electrical Engineering) January 2012.

Acknowledgment

The authors would like to express their gratitude to Professor Dante Youla for his comments.