Abstract

We consider the optimal precoder design with the assumption that the transmitter only has channel covariance information, for the multi-input multi-output (MIMO) information and energy transmission system. The objective of the system design is to maximize the average system information rate, meanwhile meeting the minimum energy requirement of the energy receiver. Following this objective, we formulate the problem as a semidefinite programming (SDP) and further transform it into a dual problem. Two methods are proposed to solve this problem: the first method decomposes the transmission covariance as a product of precoders so that the constrained optimization becomes an unconstrained one, whereas the second method derives the structure of the optimal transmission covariance analytically. Both methods are proved to be convergent and their overheads and complexity are also analyzed. The achievable rate-energy (R-E) regions for the proposed methods are presented in the simulation. Under various system settings, the superiority of the proposed methods is shown by comparing with a few existing transmission schemes.

1. Introduction

The rapidly development of wireless sensor networks has created many applications such as environmental monitoring, telemedicine system, and intelligent house system [1, 2]. However, the main bottleneck of these applications is the energy limitation of mobile equipment [35]. Energy harvesting is a promising technique to deal with this problem, prolonging the lifetime of the network, and hence has attracted a great deal of attention [69].

Nowadays, the techniques for wireless power transfer (WPT) can be divided into three categories [7]: near-field power transfer; far-field directive power beaming; far-field power transfer based on radio frequency (RF). Note that low-power RF signals are ambient and can be harvested by receivers from remote transmitters such as base stations and free WiFi hotspots. Further, since RF signals carrying energy can also be used for information transmission, a novel research direction, simultaneous wireless information and power transfer (WIPT), appears. It is clear that, compared with the former two techniques, RF-based WPT is more suitable for simultaneous wireless information and power transfer (WIPT). In fact, most researches on WIPT are based on the third kind. Therefore, this article also applies RF signals in energy scavenging.

In general, the researches on WIPT were gone from simpleness to complication and from single-input single-output (SISO) systems to multiple-antenna systems [1034]. For the SISO WIPT system, the problems on the trade-off between the rate and power, the circuit design, the transmission power allocation, and so on, were thoroughly investigated [1023].

The technical papers [10, 11] are two pioneer researches on how to transmit energy and information jointly. In [10], the concept of the achievable rate-energy (R-E) region was first proposed and a few R-E expressions were derived under some discrete channels. In [11], the authors considered the WIPT over a frequency-selective channel with Gaussian noise, providing the solutions of power allocation to the discrete and continuous versions, respectively. The authors in [12] studied the designs of both the information and energy receiver over time-varying fading channels. They presented several practical circuits with different multiplexing methods. In [15], the authors studied an amplify-and-forward (AF) relaying system, in which the energy of the received RF signal is harvested by the relay and then used for the information exchange between the source and destination. Two relay protocols were proposed to facilitate the WIPT of the relay. There were also a few researches that considered the multiuser access problem, wireless resource management, or other issues [1820].

For the WIPT system with multiple antennas, the issues on the design of transmission covariance, the achievable R-E region, the system performance, and others, were addressed in [2134]. For instance, in [21], a robust beamforming scheme was designed for the multiple-input single-output (MISO) WIPT system. This article makes an assumption that channel state information (CSI) obtained at the transmitter is not perfect and the design is based on the criterion in which the worst-case harvested energy for the energy receiver is maximized while guaranteeing that the minimum information rate for the information receiver. Utilizing a similar system model, in [24], aiming at maximizing the harvesting energy, the authors proposed an adaptive energy beamforming method according to the instantaneous CSI. In addition, the relationship between transmit power, transfer duration, and limited feedback amount was also presented. In [26], a unified study on MIMO WIPT systems was conducted, in which two practical designs were proposed in the case of the colocated receiver and their respective achievable R-E regions were also characterized. In [27], the joint two sources and relay beamforming design problem was studied for orthogonal space-time block code (OSTBC) based AF relay systems. In [29], a low-complexity technique of antenna switching for WIPT MIMO relay systems was proposed to minimize the system outage probability.

Usually, in the above researches, channel state information (CSI) at the transmitter is indispensable for the design of transmission scheme, power control, and others issues. Some of them assumed that the transmitter is able to know complete instantaneous CSI. With such an assumption, the authors in [11, 26] presented the achievable R-E regions for SISO systems and MIMO systems, respectively. Whereas there were also a few studies that considered that, in practice, it is quite difficult to obtain perfect CSI, due to Doppler shift, channel noise, or other factors, and hence assumed that only incomplete instantaneous CSI can be accessed by the transmitter. For instance, the authors in [21] designed a robust beamformer for the MIMO WIPT broadcasting system, with the assumption that the transmitter only has the incomplete CSI. In the multiantenna WIPT system, the authors in [24] analyzed the performance of energy beamforming to maximize the energy efficiency with the limited CSI feedback. Besides, a robust beamformer was designed by the use of the hybrid CSI, including the statistical CSI and instantaneous CSI [31].

It is clear that most existing researches use the instantaneous CSI at the transmitter to improve the system performance, including the cases of perfect CSI and imperfect CSI. It is suitable for the feedback of instantaneous CSI in slow fading channels. However, in the case of fast fading channel, frequent CSI exchange will inevitably bring much burden on the communication system. Under such a scenario, the scheme of feeding back the instantaneous CSI is no longer appropriate. On the other front, in general, the channel statistical information does not change or changes very slowly, feeding back which can reduce the amount of CSI exchange significantly. It is no doubt that studying the precoder design with channel statistical information feedback makes sense. In fact, the optimal transmission covariance matrices with only statistical CSI feedback were thoroughly investigated in point-to-point MIMO systems or interference systems [3237]. However, these cannot be applied to MIMO WIPT systems directly. To the best of our knowledge, the precoder with only statistical CSI feedback in WIPT systems has not been studied yet.

Therefore, this article focuses on the scenario in which the transmitter can only have the statistical CSI. Instead of using the instantaneous information rate as the optimization objective, the objective function becomes the expected system information rate. Consequently, the optimal precoder is designed under two constraints which include maximizing the expected information rate and meeting the minimum energy requirement. The contributions of this article are briefly summarized as follows:(i)Unlike most existing works, we design the optimal precoder by only use of statistical CSI, channel covariance feedback. Two precoding methods are proposed: one is a numerical method by employing the classical gradient-decent algorithm; the other one is an analytical method by use of the analytical expression of the optimal precoder. In the analytical method, we present the close-form expression of the optimal transmission covariance matrix via a proposition, as a function of the number of transmitter antennas, channel covariance matrices, and so on.(ii)We analyze the complexity and overheads of the proposed methods. A few classical existing works are also analyzed and compared, including time-switching method [26], isotropic transmission, and the precoder with hybrid CSI [31]. To the best of our knowledge, such work has not been done in existing literature. Furthermore, under various system settings, we compare the system performance of the proposed methods with that of these existing methods.(iii)Besides, the boundary of R-E region is characterized, analytically resenting two important points of the boundary.

The rest of this paper is organized as follows. Section 2 describes the WIPT MIMO system model. Section 3 formulates the problem and proposes two precoding methods. Section 4 analyzes the complexity and overheads of the proposed methods. Simulation results are presented in Section 5, with concluding remarks in Section 6.

Notations. Vectors are denoted by boldface lowercase letters and matrices are denoted by boldface uppercase letters. stands for the statistical expectation; stands for the sets of complex matrices. For a vector , means that follows a complex Gaussian distribution with mean and covariance matrix ; for a matrix , the notations , , , and denote its square root, trace, Hermitian transpose, and conjugate, respectively; we write to mean that is positive semidefinite; besides, is an identity matrix.

2. System Model

With reference to Figure 1, this paper considers a three-node MIMO WIPT system, which includes a transmitter , an energy receiver , and an information receiver . The numbers of antennas for , , and are , , and , respectively. There are two kinds of links: one is the broadcast link for data or power transmission; the other is the feedback link for statistical CSI feedback [3841]. The signal , sent from the transmitter, has independent data streams with its autocorrelation matrix . Before transmission, we multiply the signal by a precoder matrix , and after passing through flat-fading channels, the received signals of the information receiver and energy receiver are given by where and are flatting MIMO channels from the transmitter to the information receiver and energy receiver, respectively; and are i.i.d Gaussian noise vectors following and , respectively. In the followings, for the sake of convenience, we refer to and as the information channel and energy channel, respectively.

The channel matrices and can be further modeled asin which and are channel covariance matrices for the information channel at the transmitter end and receiver end, respectively; and are channel covariance matrices for the energy channel at the transmitter end and receiver end, respectively. and are random matrices having i.i.d. zero-mean and unit variance complex Gaussian random variables, representing the uncorrelated scattering.

After measuring channels for a period of time, the transmitter acquires channel statistical information, including , , , and . Using the information, the optimal precoder is designed to maximize the system information transmission rate, meanwhile satisfying the minimum energy requirement. Clearly, the scheme with statistical CSI feedback can reduce the overheads, compared to the scheme with instantaneous CSI feedback. However, it may bring the loss of information rate or energy.

Finally, for the sake of performance evaluation, we define the system signal-to-noise-ratio (SNR). Since the autocorrelation of the user signal is an identity matrix, the transmission power is expressed aswhere is the maximum transmission power. Obviously, is the transmission covariance matrix after precoding. With (5), the system SNR is defined aswhere is the noise power of the information channel . Without loss of generality, we assume that .

3. The Proposed Precoding Methods with Covariance Feedback

In this section, we first formulate the precoding problem as a SDP and then transform it into a dual problem by introducing some auxiliary variables and constructing a dual function. Based on these, two methods are proposed to solve the dual problem. Their differences mainly lie on how to solve the dual function: one does it by the classical numerical searching, whereas the other does it by exploiting the structure of the covariance matrix.

3.1. Problem Formulation

First, with (1), the information transmission rate is given bywhere the matrix , defined by , is the covariance matrix of the transmitted signal.

Then, with (2), the received energy per unit time can be written asWith (4), (8) can be further expressed as [31]where the notation is the energy transfer efficiency. Without loss of generality, we set it to 1.

Finally, we try to find the optimal that maximizes the expected information rate, meanwhile meeting the minimum energy requirement of the energy receiver, . The problem, termed as P1, can be summarized as follows:Notice that unreasonable may make the problem P1 not feasible, and hence we present the following lemma about the range of (see Appendix A).

Lemma 1. When , the problem P1 is not feasible, where is the maximum eigenvalue of ; when , the energy constraint of P1 is inactive, where is the solution of the following problem :Note that is a transmission optimization problem for the transmitter only has channel covariance information. Similar problems were well investigated and solved in the past years [3337]. For instance, in [33], the optimal transmission covariance matrices for two cases were presented, including channel covariance and mean feedback of rank one. In [34], the precoding methods for MIMO interference channels with covariance feedback were presented. In fact, can be solved by the proposed method in [33]. In addition, similar to [26], we also use the rate-energy (R-E) region to describe all achievable rate and energy pairs, which is defined as follows:Clearly, given , the rate solution of is the vertical ordinate of some boundary point of the - region.

Remark. Recently, there are practical nonlinear models being proposed [42]. As pointed in [42], for low power, the harvested RF power increases with increasing input power; however, there are limitations on the maximum possible harvested energy. In fact, from Figure 2 of [42], we find that the nonlinearity mainly occurs in high input power region and the use of the linear energy harvesting (EH) model in low-power region is still suitable. If there is no light-of-sight (LOS) path or the distance is long enough between the transmitter and the receiver, the assumption that the received power is in low region is likely to be reasonable. Hence, assuming the received power is low, the conventional linear EH model is adopted in this article.

3.2. The Proposed First Precoding Method

In this section, a numerical algorithm is proposed to solve P1 and the optimal can be obtained subsequently. Obviously, once acquiring the optimal , the optimal precoder can be readily solved by taking the square root of .

First, the Lagrangian of P1 is given bywhere and are two introduced auxiliary variables. Observe that the Lagrangian combines the objective function with the transmission power and energy constraints of P1. By doing so, the number of the constraints on the covariance matrix is reduced. The aim of introducing the Lagrangian is to transform the original problem P1 into an easier problem. With (13), a dual function is defined asConsequently, the variable does not exist in this function.

With (14), the dual problem of P1 can be formulated as (P2)Clearly, P2 only involves two introduced variables and and it is much simpler than P1. It is easy to verify the Slater’s condition is satisfied (see Appendix B). Therefore, there is no dual gap between P1 and P2. In other words, the two problems are equivalent.

Next, we consider how to solve P2. The problem-solving trait is composed of two steps: given , obtain the optimal Q and ; search over the two-dimension space of and find the minimum value of . To do those, the objective function needs further arrangement. With (13) and (14), we have [20]where , . Note that the matrix must be semidefinite, otherwise will go to infinity. This implies that , where is the largest eigenvalue of .

Since is semidefinite and hence can be decomposed as , where is the precoder. Replacing with , the semidefinite constraint of will disappear. Hence, (16) can be further written asDifferentiating with respect to , we haveObserve that solving is an unconstrained optimization problem. Given , we propose the following subalgorithm to solve and obtain and afterwards.

Given , the proposed subalgorithm (Algorithm 1) will output the optimal and corresponding . Based on Subalgorithm 1 (see Algorithm 1), a decent algorithm to solve the dual problem P2 is proposed. We term it as Main Algorithm 1 and summarize it in Algorithm 2.

(1) Initialization: , the iteration index , and the searching step .
(2) Repeat
  At the -th iteration, compute according to (18);
  update according to ; .
(3) Until , where is a pre- scribed threshold.
(1) Initialization: the iteration index , , ,
, the searching step .
 Compute and according to Sub-algorithm 1 (see Algorithm 1).
(2) Repeat
  Update according to
, (a)
, (b)
  where and are the partial derivatives of
  with respect to and at and , respectively.
  Compute and via Sub-algorithm 1 (see Algorithm 1); .
(3) Until converges to the predetermined accuracy.

It is worth noting that, in Main Algorithm 1 (see Algorithm 2), and are derived as

The flow chart of Main Algorithm 1 (see Algorithm 2) is depicted in Figure 2. With the proposed Main Algorithm 1 (see Algorithm 2), we can obtain the optimal , denoted as , and the minimum value of . According to the duality principle, the optimal precoder of the original problem P1 is , and the maximum transmission rate of P1 is ’s minimum value.

Remark 2. In Subalgorithm 1 (see Algorithm 1), the initial can be set to an arbitrary matrix except a zero one. Since is not a convex function of , we had better run this algorithm several times and try different initial points to find a best value.

Remark 3. In the simulation, we compute the expected information rate in (7) by the following two steps. With (3), generate some information channel samples, denoted as , where is the index of the samples. Compute the following formula to approximate the expected rate in (7): where is the number of samples.

3.3. The Proposed Second Precoding Method

This subsection presents an indirect method for solving P2. To find the optimal covariance matrix, we show its structure by a proposition and then propose an iterative algorithm to solve an unknown diagonal matrix, which is similar to the so-called power-allocation matrix.

First, with (16), we have the following proposition.

Proposition 4. The optimal covariance matrix has the following form:where , the columns of are taken from the right-singular vectors of , and with .

Proof. See Appendix C.

From the above proposition, given , the unknown parameter to be determined is the diagonal matrix .

Then, we propose an iterative algorithm, like the so-called power-allocation algorithm, to solve . With a few manipulations (Appendix C), in (14) can be expressed aswhere , the definitions of and are shown in (C.3), and .

With (21) and noticing , can be further expressed asTo solve , introducing auxiliary variables , , we construct the following Lagrangian function:in which the vector is the -th column of . Consequently, the KKT (Karush-Kuhn-Tucker) optimality condition is given byAfter a few manipulations, the optimality condition above can be expressed asNotice that (27) is similar, but not identical to [32, Eq. ], because the problem of solving and that in [32] are somewhat different. Furthermore, as in Appendix D, (27) can be written in a simpler form,where . With (28), it is derived that (also see Appendix D)Similar to the processing technique introduced in [32], we present the iterative algorithm (see Algorithm 3) to solve , resulting from (29).

(1) Initialization: the iteration index , , where .
(2) Repeat
  Update according to , ; .
(3) Until all ’s converge to the predetermined accuracy.
(4) If there exists some such that but does
   not converge to 0, find , set
   and =0, update , and go to Step ; otherwise end this procedure.

Consequently, given , the algorithm of solving based on Proposition 4 is proposed and summarized in Algorithm 4.

(1) Obtain the matrix ,
  according to Sub-algorithm 2.1 (see Algorithm 3).
(2) Calculate the matrix Q according to (21).
(3) Calculate by substituting Q into (16).

Finally, with Subalgorithms 2.2 (see Algorithm 4), the second algorithm of solving P2 is proposed as Algorithm 5.

(1) Initialization: the iteration index , ,
  , and . Compute
  and according to Sub-algorithm 2.2 (see Algorithm 4).
(2) Repeat
  Update according to (a) and (b) in Algorithm 2,
  respectively; compute and , according
  to Sub-algorithm 2.2 (see Algorithm 4); increase .
(3) Until converges to the prescribed accuracy.

Note that, for (a) and (b) (see Algorithm 2) mentioned in Main Algorithm 2 (see Algorithm 5), the partial derivatives of should be modified asSimilar to the conclusion of the previous subsection, the proposed Main Algorithm 2 (see Algorithm 5) will output the optimal covariance and the minimum . According to the duality principle, the maximum transmission rate for P1 is the minimum and the optimal precoder can be readily acquired by taking the square root of .

Remark 1. In Subalgorithm 2.1 (see Algorithm 3), the initial power stream should not be set to zero; otherwise it will be unchanged afterwards. To prove the convergence of this algorithm analytically is quite difficult; however, we show that it does converge rapidly by extensive simulations and similar conclusions have also been drawn in [32].

Remark 2. Since the dual function is convex [43] and the two constraints and are linear, the problem P2 is convex and its solution is unique. Searching from an arbitrary feasible initial point is able to reach its optimal value after a few iterations. On the other hand, it is easy to find that given and , , that is, has a lower bound. Furthermore, the objective function in either Main Algorithm 1 or 2 (see Algorithm 2 or Algorithm 5) is nonincreasing because of the gradient-decent property of the proposed algorithm. Therefore, the proposed two algorithms are convergent.

3.4. Discussion

This subsection discusses two issues, including the difference between the proposed first method and second method and the difference between the proposed methods and the method in [32].

First, the two proposed methods solve the same optimization problem, i.e., solving in two different ways. The first one is a numerical method, which transforms a constrained optimization problem into an unconstrained one, and hence can adopt the classical gradient-decent method to solve the problem, whereas the second one is an analytical method, which presents the structure of the optimal transmission covariance matrix in Proposition 4. Therefore, the second algorithm exhibits a deeper understanding of the original problem, by revealing the structure of the optimal precoder.

Second, we study the precoder design or covariance matrix in three-node WIPT systems, as compared with that in point-to-point systems [32]. Clearly, the structure of the optimal covariance matrix (21) is quite different from that in [32, Theorem 1]. The power-allocation algorithm is only a part of the structure of the covariance matrix. We draw on the trait of solving the power-allocation problem in [32] and the two problems do look similar to some degree. However, there are also a few differences between them. In addition, solving the power-allocation problem also yields new contributions.(i)The two problems are different in form: one is given by (23) and the other is shown in [32, Eq. ]. The derivation methods of solving the two problems are somewhat different. The authors in [32] derives the optimal matrix from the first principle [44, Section 15.2], a derivative version of the KKT conditions. However, the principle cannot be applied to the problem in this paper. Hence, we perform the derivations via the classical KKT conditions.(ii)When solving the power-allocation problem, we believe that the derivation from [32, Eq. ] to [32, Eqs. ] is not such straightforward. The two formulas (27) and (29) in this paper are the counterparts of [32, Eq. ] and [32, Eqs. ], respectively. We give the details on how (29) is derived from (27) in Appendix D. Using the techniques in Appendix D, one can derive [32, Eqs. ] from [32, Eq. ].

4. Overhead and Complexity Issues

This section analyzes the overheads and computational complexity of the proposed precoding methods and a few other existing methods.

4.1. The Overhead

Table 1 lists the overheads of the proposed methods with other methods. We mainly consider the overhead that is required to compute the precoder.

Then, we make a brief review of the other three existing methods listed in Table 1. The isotropic transmission refers to the case in which the transmitter sends data streams with equal power, i.e., the precoder matrix . The time-switching scheme refers to the case in which each transmission period is divided into two time slots, one for data and the other one for energy. Let with be the percentage of transmitting time for the data time slot. The achievable R-E region of time-switching scheme is given byin which and are the transmission covariance matrices for data and energy transmission phases, respectively. For the precoder with hybrid CSI feedback [31], only statistical CSI is available between the transmitter and energy receiver, while instant CSI is available between the transmitter and data receiver. What is slightly different from the precoder [31], herein we do not consider the channel estimation error brought by limited-length pilots and noises, since it is not the focus of this article.

From Table 1, observe that the isotropic transmission does not need the overhead; however, it has the worst performance, which will be shown in the simulations. Compared with the precoder with hybrid CSI feedback, the proposed methods only require the statistical channel information and do not need the instant CSI of the information channel, which reduces the overall overhead significantly in the long run. In addition, the proposed methods have the same overhead with the time-switching method.

4.2. The Computational Complexity

The number of floating-point flops is often used to measure of the complexity of an algorithm. Here, a flop is defined as one multiplication or division of two floating-point numbers, and one flop has the computational complexity . The addition and subtraction operations are neglected, since they are much quicker. Note that the matrix product requires flops, where , ; for an matrix, both its inverse and eigendecomposition operations require flops [45]; for an matrix, its singular value decomposition (SVD) requires flops [45].

For the precoder with hybrid CSI feedback, the problem of solving it has the similar form with P3 [26] and hence can be solved by the proposed method in [26]. Note that the complexity for searching and in this method is [46]. For each iteration (searching and ), the main computations lie in [26, Eq. of Theorem 3.1] and have the complexity of . Therefore, if the precoder with hybrid CSI feedback is solved by the method [26], the complexity is .

For the time-switching method, it needs to compute two precoders. The first precoder is for energy transmission and the second is for information transmission. For the first one, it mainly involves the calculation of the eigenvectors of and hence has the complexity of . For the second one, it can be solved by the proposed method in [27] and this method has the complexity of . For the above, it is concluded that the time-switching method has the complexity of .

For the proposed first method, suppose that, for Main Algorithm 1 and Subalgorithm 1 (see Algorithms 2 and 1), the numbers of iterations required to reach the given accuracy are and , respectively. Observe that the main computation of this method lies in (18) and has the complexity of . Therefore, the complexity of the proposed first method is .

For the proposed second method, suppose that the number of iterations required to reach the given accuracy for Main Algorithm 2 (see Algorithm 5) is . Observe that there are two loops in Subalgorithm 2.1 (see Algorithm 3), termed as the outer loop and inner loop. Suppose that the number of iterations for the outer loop is and clearly . For the inner loop, assume that the average number of iterations required to calculate the power allocated to each eigenvector is . Observe that the main computations lie in Step of Subalgorithm 2.1 (see Algorithm 3) and (21). With the above, it is easy to find that the overall complexity of this method is . Note that, we find and usually in the simulation. The complexity of the proposed methods and a few existing methods is listed in Table 2.

5. Simulation Results

Computer simulation is employed to evaluate the performance of the proposed precoders of solving the optimal precoder and compare a few other existing methods. A three-node MIMO WIPT system is considered, where all nodes have the same number of antennas. We adopt the exponential correlation model as the channel correlation matrix, with its ()-th entry being , in which the constant is the correlation coefficient. Unless specified otherwise, for , , , and , their correlation coefficients are set to 0.1, 0.7, 0.2, and 0.3, respectively.

5.1. The Convergence of the Proposed Methods

Figure 3 shows the convergence of the proposed Subalgorithm 2.1 (see Algorithm 3). Two cases with and are considered. For both cases, the system SNR is 5 dB, the energy threshold is set to 10, and the initial power matrix . As in Figure 3(a), observe that, after 10 iterations, the power matrix converges to about . Similar behavior can be found in Figure 3(b), with the corresponding power matrix converging to . These results have well demonstrated the convergence of Subalgorithm 2.2 (see Algorithm 4).

Figure 4 plots the convergence of the proposed two precoding methods which correspond to Main Algorithms 1 and 2 (see Algorithms 2 and 5). In this figure, , , and the initial pairs are (0.1, 0.8) and (0, 0.5) for the first method and the second method, respectively. Two cases with SNR = 5 dB and SNR = 6 dB are considered. Observe that at SNR = 5 dB, the dual function with the proposed first method decreases from 4.02 bit to 3.66 bit after 4 iterations. Further increasing iterations only marginally decrease . Furthermore, the values of the dual function using the proposed two methods converge to the same minimum, i.e., about 3.66 bit. Similar behavior can be found at SNR = 6 dB.

To sum up, the above results indicate that the proposed two methods are convergent; with the same system configuration, the solutions to problem P1 using two precoding methods are identical, which also demonstrates the validity of the proposed methods.

5.2. The R-E Trade-Offs with Different Transmission Schemes

Figure 5 is a plot of the information transmission rate versus SNR with different energy thresholds. Three cases with , , and no energy constraint are considered. For all cases, . The information rate is solved by the proposed first method. In fact, both methods yield the same result, as will be illustrated later. Obviously, it is seen that the case with no energy constraint is the upper bound of the other two cases. When SNR is less than a threshold, the problem P1 has no solution. This is because that even allocating the total power to ’s eigenvector having maximum eigenvalue cannot meet the energy requirement. For instance, the SNR threshold for is about 2.47 dB. This value can also be calculated by the first argument of Lemma 1. On the other front, at relatively low feasible SNR, there is a certain gap between the case with energy constraint and that with no energy constraint. This is due to the fact that the energy constraint decreases the achievable information rate. However, with increasing SNR, the gap becomes smaller and smaller until zero. This is due to the fact that the energy constraint becomes inactive gradually.

Figure 6 presents the R-E trade-offs with the proposed two precoding methods. Two cases with SNR = 5 dB and SNR = 6 dB are considered. For both cases, we set and . Observe that, for each case, the trade-off curve with either the first or second method is horizontal at small energy threshold, and at a flexion point (A1 or A2), further increase of will result in the decrease of transmission rate. This is because that the information rate is nearly not affected by the energy constraint when is small and increasing will result in that the energy constraint becomes more and more active. There is also a cut-off point (B1 or B2), which corresponds to the maximum power, and exceeding the maximum power will result in no solution to . This is because the harvested energy is limited by the maximum transmission power. The flexion point and cut-off point can also be calculated by Lemma 1. In fact, the coordinates of two points obtained by Lemma 1 are accordance with those obtained by numerical method. Moreover, with the same system configuration, the trade-off curve with the first method nearly overlaps that with the second method and hence, the proposed two methods are cross-validated.

Figure 7 compares several transmission schemes, including the proposed precoder, time-switching scheme, the isotropic transmission, and the precoder with hybrid CSI feedback [31]. Since the proposed two methods yield the same precoder, we may as well use one of them, say, the second one. Two scenarios including a MIMO WIPT system and a MIMO WIPT system are investigated, and the corresponding results are plotted in Figures 7(a) and 7(b), respectively. Note that, the correlation coefficients for , , , and are 0.3, 0.5, 0.2, and 0.3, respectively. In Figure 7(a), observe that the achievable R-E region with the proposed precoder contains that with the isotropic transmission or time-switching scheme. The R-E region of isotropic transmission is a rectangular, whose upper boundary is very close to that of . This is because that the coefficient for is small and hence the isotropic transmission is nearly optimal for data transmission. The boundary of is simply a line by connecting the two points and . This can be derived by ranging from 0 to 1. Also observe that the R-E region of the scheme with hybrid CSI feedback is much broader than the proposed scheme. This is because that the former utilizes the instant CSI of the information channel, whereas the latter only has the statistical CSI of the information channel. That is, the increase of average rate is an exchange of additional overhead. Similar phenomena can be found in Figure 7(b). In addition, the optimal precoders with a few cases are listed in Table 3.

To sum up, the results in Figure 5 imply that the energy constraint decreases the achievable information rate at low feasible SNR evidently and it becomes gradually inactive with increasing SNR. The results in Figure 6 indicate that the proposed two methods can be cross-validated and the precalculation of the flexion point and cut-off point by Lemma 1 will help to find a rough R-E boundary curve. The results in Figure 7 demonstrate that the proposed precoder is superior to both the isotropic transmission and time-switching scheme in terms of the R-E region; however, it is inferior to the precoder with hybrid CSI feedback, since the latter utilizes more CSI.

5.3. The Effects of the Number of Transmitter Antennas on the R-E Trade-Off

In Figures 8 and 9, and SNR = 5 dB. The figures investigate the behavior of the proposed algorithms when the number of transmit antennas is large. Observe that, in Figure 8, we find that increasing results in much broader R-E regions. The right boundary of the R-E region, whose horizontal coordinate is the maximum harvested energy , has a limit corresponding to the limit in Figure 9(a). In Figure 9(a), increasing results in an limit around 35.7 joules/s. On the other hand, the maximum achievable rate of the R-E region is characterized by Figure 9(b). In Figure 9(b), for small , increasing makes the maximum rate increase rapidly. For large , e.g., 64, further increase of can only marginal increase the maximum rate. However, we cannot conclude that the maximum rate has a limit because the transmitter utilizes the CSI of information channel.

Recall that the capacity or the maximum rate for an open-loop point-to-point MIMO system has a limit when the number of transmitter antennas is large [47]. However, it does not hold for a closed-loop MIMO system because the transmitter has the CSI.

6. Conclusions

For a typical three-node wireless MIMO information and energy transmission system, the precoder design with only covariance feedback is studied. By using the convex optimization technique, and after a few transformations, we transform the precoder design problem into a dual one. The dual problem is proved to have no duality gap by Slater’s condition. Then, two algorithms have been proposed to solve this problem. The first one decomposes the transmission covariance and emphasizes numerically searching the optimal precoder, while the second one emphasizes exploiting the structure of the covariance, which is presented by Proposition 4. Their convergence, complexity, etc. have also been addressed by theoretical analysis or computer simulation.

On the other hand, the R-E region is often used to measure the WIPT system performance. We have exploited the key features of the R-E boundary curve by Lemma 1. In fact, the precalculation of the flexion point and cut-off point by this lemma will help to find a rough R-E boundary curve. Furthermore, we compare the proposed methods with a few existing transmission schemes. Simulation results show that the proposed precoder is superior to the isotropic transmission and time-switching scheme and it has the broadest R-E region compared with the other two regions. However, the proposed precoder is inferior to the precoder with hybrid CSI feedback, since the latter one utilizes more CSI.

The proposed precoding methods and above results can be a reference for practical system implementation in the future. We will further improve the system performance by incorporating the other wireless transmission techniques [4852].

Appendix

A. Proof of Lemma 1

Sinceit is clear that the first conclusion of Lemma 1 holds. Then, when , where is the solution of , it is deduced that is also the solution of P1 because of the following: the objective function of P1 is concave and all constraints are linear; therefore, P1 is convex and has a global optimal solution; maximizes the objective function of P1 and satisfies all constraints. Obviously, the energy constraint of P1 is not active because the equality sign does not hold.

B. Proof of Slater’s Condition

For one thing, if , the structure of the optimal covariance is given bywhere corresponds with the eigenvectors of . Without loss of generality, we assume that the eigenvalues of are sorted in descending order, so that the total power is allocated to the eigenvector with the maximum eigenvalue. In this scenario, there is no need transforming P1 into a dual problem.

For another, if , let , where . We have and . Therefore, there exists a strict feasible point for P1, and the Slaters condition is satisfied for .

C. Proof of Proposition 4

First, we reexpress by transforming . With (16), letting , one haswhere . Since and , it is deduced that the condition is equivalent to . Hence, (C.1) can be written as Notice that although is positive semidefinite and is positive definite, the product of them, , may not be positive semidefinite.

Then, we further arrange the expectation term of . Since may not be positive semidefinite, the eigendecomposition is no longer suitable. Let and its SVD is expressed asMeanwhile, the eigendecomposition of is expressed asSubstituting (C.3) and (C.4) into (C.2), one obtainsLetting and noticing that pre- or postmultiplying a unitary matrix does not change the distribution of , it is easily found that is a random matrix whose entries are independent, with zero-mean and arbitrary variance. Then, (C.5) is further written asLetting and substituting it into (C.6), one has

Finally, we transform into an equivalent problem and present the eigenvectors of the optimal . Assume that the total transmission power, corresponding to the trace of the optimal , is . Now, consider what are the eigenvectors of . It is easy to find that this task is equivalent to the following problem: Obviously, it is also equivalent tobecause the objective function achieves its maximum value at the maximum transmission power. Since satisfies the definition in [32, Definition 2], according to [32, Theorem 1], the eigenvectors of are given by the columns of . Since , we have the optimal covariance and hence, this proposition is proved.

D. Derivation of (29)

To start with, we have the following lemma.

Lemma D.1. For the matrix , where and are two column vectors, the inverse of is .

With the above lemma, we will reexpress the optimality condition (27).where . Noticing , (D.1) is further arranged asTherefore, the optimality condition (27) is reexpressed asWith (D.3), we can prove that (29) holds. For one thing, consider the case of . If , one has , because the denominator is larger than 1. However, this contradicts the optimality condition (D.3). Hence, for . For another, consider another case of . If , one obtains , which also contradicts (D.3). Therefore, in this case we have and , or equivalently, . In a word, (29) has been proved.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Disclosure

This paper has been presented in part at the IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC), Hong Kong, China, 2016.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This work was supported by the National Science Foundation of China (no. 61601275); Guangdong Province Natural Science Fund (no. 2016A030313068); Guangdong Natural Science Funds for Distinguished Young Scholar (no. 2014A030306027); the Innovation Team Project of Guangdong Province University (no. 2016KCXTD017); the Science and Technology Program of Guangzhou (no. 201807010103); highly educated talent research start-up fund of Nanjing Forestry University (no. GXL2016030); the Scientific Research Projects of Guangzhou Municipal Universities (no. 1201430722); the Scientific Research Project of Education Department of Guangdong, China (no. 2017GKTSCX045); Science and Technology Program of Guangzhou, China (no. 201707010389); the Scientific Research Project of Guangzhou Municipal University (no. 1201620439); and Qingshanhu Young Scholar Program in GZPYP (no. 2016Q001).