Research Article  Open Access
Distributed Structured Compressive SensingBased TimeFrequency Joint Channel Estimation for Massive MIMOOFDM Systems
Abstract
In massive multiinput multioutput orthogonal frequency division multiplexing (MIMOOFDM) systems, accurate channel state information (CSI) is essential to realize system performance gains such as high spectrum and energy efficiency. However, highdimensional CSI acquisition requires prohibitively high pilot overhead, which leads to a significant reduction in spectrum efficiency and energy efficiency. In this paper, we propose a more efficient timefrequency joint channel estimation scheme for massive MIMOOFDM systems to resolve those problems. First, partial channel common support (PCCS) is obtained by using timedomain training. Second, utilizing the spatiotemporal common sparse property of the MIMO channels and the obtained PCCS information, we propose the prioriinformation aided distributed structured sparsity adaptive matching pursuit (PADSSAMP) algorithm to achieve accurate channel estimation in frequency domain. Third, through performance analysis of the proposed algorithm, two signal power reference thresholds are given, which can ensure that the signal can be recovered accurately under powerlimited noise and accurately recovered according to probability under Gaussian noise. Finally, pilot design, computational complexity, spectrum efficiency, and energy efficiency are discussed as well. Simulation results show that the proposed method achieves higher channel estimation accuracy while requiring lower pilot sequence overhead compared with other methods.
1. Introduction
Due to the scarcity of radio spectrum in the microwave band, how to improve spectrum efficiency (SE) has become one of the significant problems for future wireless communication systems. By leveraging the outstanding advantages of multiplexing and diversity, massive multiinput multioutput (MIMO) or largescale antenna technology can achieve high SE and energy efficiency (EE) and has become one of the key technologies for fifthgeneration (5G) wireless communications [1, 2]. In addition, orthogonal frequency division multiplexing (OFDM) has been widely adopted in modern wireless communication systems due to its excellent antimultipath capability [3]. Consequently, massive MIMOOFDM will be a new standard for 5G.
In massive MIMOOFDM systems, channel state information (CSI) is indispensable for precoding/combine, channel equalization and coherent detection, etc., which makes the accurate channel estimation crucial to improving system performance [3, 4]. In conventional MIMOOFDM systems, orthogonal pilot training scheme is adopted for accurate downlink CSI acquisition. In massive MIMOOFDM systems, however, since hundreds of transmit antennas are installed at BS, the acquisition of highdimensional CSI results in the prohibitively high pilot overhead, much less the problem of computational complexity caused by highdimensional matrix operation [5]. Meanwhile, quantization accuracy of the CSI fed back from users to BS, which is affected by the number of BS antennas, is a question worth considering as well [6, 7].
To solve those problems, many lowrank channel estimation approaches were proposed such as lowrank channel covariance matrices (CCMs) based method [8, 9], array signal processingbased method [10, 11], and compressive sensing (CS) based method [12â€“23]. Although the lowrank property of CCMs can make the dimensions of MIMO channels greatly reduced, highdimensional CCMs are difficult to be obtained. Moreover, highdimensional matrix operations involved in the singular value decomposition (SVD) or eigenvalue decomposition (EVD) lead to high computational complexity. Array signal processingbased methods have many advantages, but it is particularly applicable to millimeterwave massive MIMO systems which have high angular resolution [4]. Fortunately, the CSbased channel estimation method can be adopted for massive MIMOOFDM systems due to the sparse characteristics of MIMO wireless channels [24].
Basically, there are three categories of CSbased channel estimation methods for MIMOOFDM systems: timedomain training channel estimation (TTCE) method [12, 13], frequencydomain pilot training channel estimation (FTCE) method [17â€“19], and timefrequency joint training channel estimation (TFTCE) method [21â€“23]. TTCE exploits the interblock interferencefree (IBIfree) region in the redundant portion of the received timedomain training sequences (TSs) for channel estimation. Although this scheme has high spectral efficiency, the design of the timedomain TSs is rather difficult when the number of antennas is large enough, because it not only needs to ensure the orthogonality between different TSs but also makes sure that the sensing matrices satisfy the restricted isometry property (RIP) condition [14, 15]. Moreover, accuracy of the estimation is seriously affected by channel length [16]. Unlike the TTCE method, the FTCE method uses orthogonal or nonorthogonal frequencydomain pilots for channel estimation. Although the difficulty of designing training sequences is greatly reduced, the spectral efficiency is decreased because the useful information provided by the preamble is not effectively utilized. Taking the advantages of the above two schemes, Dai et al. [5] proposed TFTCE scheme, while the CS method is not adopted for higher accuracy estimation. On this basis, Ding et al. [21] proposed a TFTCE method based on compressed sensing. However, due to the interference of nonorthogonal timedomain TSs, it cannot be applied to massive MIMO systems. Ding et al. and Fan et al. [22, 23] exploited identical TSs in time domain and frequency domain over a transmission frame, while changing TSs can bring better performance [25]. More importantly, these methods did not make effective use of the sparse characteristics of MIMO channels.
In this paper, we propose a distributed structured compressive sensingbased timefrequency joint channel estimation method for massive MIMOOFDM systems. Our contributions in this paper are summarized as follows:(i)By using spatiotemporal common sparse characteristics of wireless MIMO channel, we propose a priori information aided distributed structured sparsity adaptive matching pursuit algorithm (PADSSAMP). The proposed algorithm regards the partial channel common support as a priori aided information and combines SAMP algorithm with structured multimeasurement vector (MMV) ideas [26]. The proposed algorithm improves the spectral efficiency and the accuracy of the channel estimation, while without knowing the sparsity of MIMO channels.(ii)The performance of the proposed algorithm is analyzed. First, the signal power thresholds which can ensure accurate recovery of signals under powerlimited noise and according to probability accurate recovery under additive white Gaussian noise (AWGN) are obtained though strict derivation. Next, we give the pilot design scheme that is well suited for massive MIMO systems. Finally, computational complexity, spectrum efficiency, and energy efficiency are given, which show that not only the computational complexity is less than other algorithms but also the spectrum efficiency and energy efficiency are greatly improved.
The remainder of this paper is organized as follows. The distributed massive MIMOOFDM system model with blocksparse channel is described in Section 2. In Section 3, a distributed structured compressive sensingbased channel estimation algorithm is proposed. Next, Section 4 presents the performance analysis of the proposed algorithm. Simulation results are presented in Section 5 to demonstrate the performance of the proposed algorithm. Finally, the paper is concluded in Section 6.
Notation. Throughout our discussion, lowercase boldface letters indicate vector and uppercase boldface letters indicate matrix; and denote the zeros matrix of arbitrary size and identity matrix; , , , , and stand for the transport, conjugate transport, matrix inversion, Mooreâ€“Penrose matrix inversion, and cardinality, respectively. and represent norm and Frobenius norm; operator represents circular convolution; expresses the block of matrix or vector and represents mixed norm of matrix or vector .
2. Distributed System Model with BlockSparse Channel
Consider a massive MIMOOFDM system with transmit antennas configured at base station (BS) to serve signalantenna mobile terminals, where . In this paper, we focus on the case of downlink transmission in which each frame consisting of several timefrequency training ODFM (TFTOFDM) symbols is transmitted from BS to mobile terminals. As described in Figure 1, the ith () TFTOFDM symbol transmitted from the () antenna consists of a length timedomain TS and a following length OFDM data block . Unlike cyclic prefix OFDM (CPOFDM) and zeropadding OFDM (ZPOFDM), known pseudonoise (PN) instead of unknown CP or ZP is adopted as guard interval in TFTOFDM to alleviate interblock interference (IBI) caused by multipath.
At receiver, the TFTOFDM symbols transmitted from each BS antenna are mixed together at each mobile terminal so that the received TFTOFDM symbol vector in time domain can be written as [5]where and are Toeplitz lower and upper triangular matrices with the first column vector and the first row vector , respectively. At the same time, denotes the channel impulse response (CIR) vector from the antenna to receiver with the length and is the complex additive white Gaussian noise (AWGN) vector with zero mean and the variance of .
After cyclicity reconstruction operated by the overlapping and adding (OLA) algorithm [5], the received pilot sequence of the OFDM symbol can be expressed through discrete Fourier transform (DFT) process aswhere is the DFT matrix and is the matrix consisted of the first column of , denotes the index set of subcarriers allocated to pilots whose elements are uniquely selected repeatedly from the set of , and are the submatrix or subvector by selecting the rows of or according to , , , , is the number of pilot subcarriers, and is the complex AWGN.
Wireless channels have sparse characteristics [24, 27]. The presence of scatterers and reflectors in wireless communication environment results in a multipath channel with several significantly resolvable propagation paths. Because the significant scatterers between transmitter and receiver are usually scarce, the number of the paths containing the majority of channel energy is so small that they are sparsely distributed in the resolvable paths. In other words, the gains of most paths approach to zeros except for a small number of significant paths so that the channel exhibits sparsity in delay domain [12]. The sparsity support of the sparse channel is definedwhere is the noise floor which can be determined according to [28]. The sparsity level of the channel is denoted as .
Because antennas usually share the same scattering environment, all of the channels between one user and different antennas deployed on base station share common sparsity [17]. Therefore, the nonzero entries of the CIR vector can occur in clusters so that the CIR vector can be transformed to a blocksparse CIR vectorwhere . The blocking process of CIR vector is shown in Figure 2. At the same time, the observation matrix should have corresponding changes as follows:where . Thus, equation (2) can be reexpressed as
Besides, the path delays of practical wireless channels vary much slower than the path gains as shown in many studies [12]. This slow variation results that the CIRs over consecutive TFTOFDM symbols share the same sparsity pattern [17]. Consequently, this spatiotemporal common sparse characteristic of MIMO channel over TFTOFDM symbols can be expressed as
Defining the spatiotemporal Common Sparse Support of aswhere and is determined as (3). For simplicity, we assume the same threshold is used over each frame. Thus, the spatiotemporal Common Sparse Level of the channel vector set is .
Because the observation matrices are different, but CIRs have the same sparse characteristics, this system model can be called a distributed system model. The structure of the distributed system model with common blocksparse channel is described in Figure 3.
3. Distributed Structured Compressive SensingBased Channel Estimation
In this section, we will introduce a timefrequency joint channel estimation method to obtain more accurate estimation accuracy. Firstly, timedomain training is used to obtain partial common path delay and then accurate channel estimation is achieved using the proposed PADSSAMP algorithm.
3.1. Acquisition of Partial Channel Common Support
Although timedomain training cannot realize accurate channel estimation, it can be used to acquire some prior information of channels such as partial channel delay information. Using the overlapadd method, the superimposed signal with the sum of the main part and tail of received timedomain TS can be expressed aswhere and are the received signals associated with PN part and the tail part expressed as (10) and is the number of symbols in which the channel delay remains unchanged.where the measurement matrices and are determined by and which are given in [12]. Thus, we can get
The matrix is generated by cyclically shifting the basic TS and the data block interference (DBI) matrix can be denoted as
Taking advantage of the circular convolution, the coarse CIR associated with the antenna can be estimated as
In equation (13), the second part and the third part are the interference caused by nonorthogonal PN sequences and data, respectively. The data interference is mainly due to the superposition of the previous TFTOFDM data block and the current TFTOFDM symbol data block. Fortunately, there is no correlation between the random OFDM data block and the fixed TS sequence so that the data interference from others antennas has little effect on the delay estimation of the channel, especially in largescale antenna systems [29]. However, the PN sequences interference from different antennas will increase linearly with the number of antennas when the cyclic matrices of the PN sequences associated with different antennas are not orthogonal with each other. To the best of our knowledge, however, there is no a set of sequences which have good autocorrelation and the cyclic matrices of which are orthogonal to each other. It is worth noting that this problem can be resolved once the same PN sequence is used for each antenna. Consequently, we have average CIR
As a perfect sequence, Zadoffâ€“Chu sequence which has good autocorrelation and crosscorrelation is widely used in the OFDM system for synchronization and channel estimation [30]. A length Zadoffâ€“Chu sequence is defined as [30]where is the sequence length, is an integer, and is an integer prime to . By taking the inverse discrete Fourier transform (IDFT) of a diagonal matrix which the diagonal elements comprise the Zadoffâ€“Chu sequence , matrix is constructed with a lot of excellent correlation properties in each row vector as well. For each row sequence of the constructed matrix , the following correlation properties are held [30]:where represents a periodic cycle of length .
It has been proved that exploiting constant TS within the transmission frame is not an optimal scheme for channel tracking and changing TS can bring better channel estimation performance [5, 25]. Thus, we adopt changing timedomain training scheme which different PN sequences generated by each row sequence of for the acquisition of accurate CSI over one frame of the transmission signals. The partial common support of MIMO channels over TFTOFDM symbols can be calculated as follows:where is the same as that in (8) an can ensure the reliability of the channel common support.
3.2. Acquisition of Accurate Path Gain Estimation
In this section, we use the frequencydomain method in combination with the partial channel common support as a priori information for accurate channel estimation.
Since the support set of CIR remains unchanged over several consecutive TFTOFDM symbols, it can be estimated simultaneously by using multiple measurement vector (MMV) CS methods [31]. In general, MMV methods are used to solve sparse signal recovery problems in which measurement vectors are associated with the same sensing matrix, while generalized MMV (GMMV) CS methods are required for the distributed systems with different sensing matrices. The problem of sparse channel state acquisition can be formulated as the following optimization problem:
To solve this problem, the PADSSAMP algorithm listed in Algorithm 1, which is developed from the DSAMP algorithm [18], is proposed.

The initialization of the PADSSAMP algorithm is implemented in Steps 1âˆ¼3. In Step 5, the correlations of the residuals with the column vector of sensing matrices are calculated over each TFTOFDM symbol and the indexes of the largest nonzero entries are selected in Step 6, where represents the indexes of maximum values in the set and and presents basic sparsity and increased sparsity, respectively. Steps 7âˆ¼10 are the support pruning for more precise sparsity support. Note that Step 9 does not guarantee that the basic support is fully contained so that Step 10 is needed to refine the effective support and ensure entries in the final support. Step 11 estimates associated with the effective support using LS, and Step 12 calculate the residuals over each symbol. The index of the minimum power in the block of is found in Step 13, and the iteration is terminated if the condition that the average energy of signal is less than a certain threshold holds, which is expressed in Step 14. Steps 15âˆ¼21 are the step adaptive process, and the advantages of this approach will be explained below, where denotes upper bound integer operation. The whole iteration will be terminated if the residual average energy is less than a certain threshold given in Step 22. Finally, the estimation of structured channel impulse response is obtained in Step 23.
Compared with DSAMP algorithm [18] and the classic SAMP algorithm [32], the proposed PADSSAMP algorithm has four distinctive features:(i)Since the signal to be estimated has block structure, the structured sparsity is exploited in the proposed PADSSAMP algorithm for block signal recovery. However, this useful signal feature is not used in either DSAMP algorithm or SAMP algorithm.(ii)In the PADSSAMP algorithm, the sparsity support consists of basic support and additional support. Only additional support is updated while the basic support does not change in each iteration operation. The reason is that the basic support achieved by timedomain training has high reliability.(iii)The third difference is that the step size can be adaptively adjusted in PADSSAMP algorithm, which has no discussion in DSAMP algorithm. Due to the tradeoff between recovery speed and recovery efficiency [33], an adaptive step size selection scheme is utilized which is based on the experiences that large step size is preferable for signal with flat magnitude and small step size is suitable for signal with fast magnitude attenuation.(iv)Finally, we give the termination threshold of the algorithm through strict mathematical derivation, which plays an important role in the performance of the algorithm.
Remark 1. After removing the large gain taps of CIR by using a priori information of channel delay, the remaining tap amplitudes have not much difference so that large step size is suitable. However, with large magnitude taps being reconstructed after a few stages of the algorithm, the number of taps to be reconstructed is less and less. In other words, the energy of the reconstructed CIR approaches to remain stable and the estimated sparsity level is close to the true sparsity level. Therefore, in order to better determine the more accurate sparseness, the step should be gradually reduced. According to this behavior, a large step size is set at the several stages of beginning in PADSSAMP algorithm to expedite the convergence. Then, the step size decreases adaptively to provide fine tuning in following stages when the change of the reconstructed signal is less than a certain threshold.
4. Performance Analysis
In this part, we discuss the performance of the proposed algorithm. Firstly, we discuss the convergence of the algorithm, getting the convergence condition and termination threshold of the algorithm. Secondly, pilot design for sensing matrices is discussed. Then, the computational complexity of proposed algorithm is compared with OMP, SOMP, DSAMP algorithm, DSSAMP algorithm, and PASOMP algorithm. Finally, pilot overhead for CSI acquisition is obtained. Before performance analysis, we first introduce two important definitions and two useful lemmas, which will play a key role in the subsequent analysis and discussion.
Definition 1 [34]. BlockCoherence of is defined aswhere and is the size of the block.
Definition 2 [34]. Subcoherence of is defined asIf , then .
Lemma 1. Let be a matrix and be the block of , thenLemma 1 is obtained directly from Lemma 1 [34].
Lemma 2. [35]. Suppose that the noise vector follows Gaussian distribution ; if bounded set is satisfied, then we have
4.1. Convergence Analysis of Proposed PADSSAMP Algorithm
The PADSSAMP algorithm begins with the residual initialization as , and the significant part is to choose at least one correct column block of the sensing matrix in each iteration. At the stage (), the block best matched to is chosen according to . Suppose denotes the support set of the signal and denotes the complementary set of . Theorem 1 reveals the convergence conditions of the proposed algorithm in the absence of noise. We first discuss the noisefree case, then discuss powerlimited noise , and finally extend it to Gaussian white noise case .
Theorem 1. For blocksparse signals with the blocks length of which satisfy , a sufficient condition for the PADSSAMP algorithm to recover is that
Proof. See Appendix A.
The sufficient condition (26) depends on which is determined by the nonzero position of the signal , but the nonzero position of the is not known in advance. Therefore, Theorem 1 is not practical. Following theorem under certain conditions on Block Coherence and Subcoherence associated with can make sure the (A.1) holds.
Theorem 2. Sufficient condition (26) is satisfied if for every , there is
The proof of the theorem can refer to the literature [34]. Based on Theorem 1 and Theorem 2, we will discuss the condition in which the proposed algorithm can select the correct atom for each step in the case of powerlimited noise.
Theorem 3. Suppose that observed noise follows additive white Gaussian noise ; if the remaining energy of the signal satisfy , the proposed algorithm with the stopping rule selects the correct atom with probability at least , where .
Proof. See Appendix B.
Remark 2. Theorem 3 gives the double threshold condition for the signal to be fully recovered according to a certain probability. Although noise power and empirical threshold involved in signal power are usually used as terminated conditions, these thresholds are somewhat conservative. When the signaltonoise ratio is low, it leads to the incorrect supports. Theorem 3 can provide a degree of quantification of correct recovery so as to guide the choice of thresholds.
4.2. Pilot Design Discussion for Sensing Matrices
According to (2), the design of sensing matrices is related to pilot sequences and pilot placement . In CS theory, RIP is treated as a sufficient condition that the sparse signal can be reliably and stably recovered. However, RIP of a sensing matrix is so difficult to calculate that mutual incoherence property (MIP) which is stronger than RIP is widely used as a framework for sparse signal recovery. Besides, from Theorem 2, we can see that the MIP of sensing matrix is directly related to the termination threshold of the proposed algorithm. The smaller the and the , the lower the threshold of the recoverable signal and the more accurate the signal recovery. Therefore, our purpose is to design sensing matrices with good crosscorrelation.
Using the Central Limit Theorem, Gao [17] proposed a constant envelope complex exponential random phase pilot , where has the independent and identically distributed (i.i.d.) uniform distribution , and proved that the pilot vectors between different antennas and different OFDM symbols have asymptotic orthogonality when pilots are placed at equal intervals.where , , and is the index of the set . The equation takes advantage of independent condition between and . When , we have ; otherwise, .
Therefore, when the pilot number is large enough, sensing matrices will have good crosscorrelation. As discussed below, the number of pilots is proportional to the number of antennas. So, for massive MIMO systems, the column vectors of the sensing matrices satisfy approximately orthogonality. Besides, according to Theorem 2, we have inequality , which means that smaller and bring a larger number of antenna.
4.3. Computational Complexity of the DSSAMP Algorithm
The computational complexity of the algorithm is analyzed in this section. Firstly, the computational complexity of each iteration in each step is given in Table 1.

Obviously, among the steps of the PADSSAMP algorithm, matrix inversion operated for least squares (LS) estimation contributes the dominant computational complexity. Gao [18] compared the number of complex multiplications of OMP, SAMP, and DSAMP algorithms in each iteration. Compared with these four algorithms, it is clear that the DSSAMP algorithm almost has the same computational complexity due to almost the same calculation steps they have. However, with a priori common support, the number of iterations of the proposed algorithm will be greatly reduced so that the total amount of calculation will be significantly smaller.
4.4. Spectrum Efficiency and Energy Efficiency
Blockstructured processing of signals can reduce pilot overhead, and the minimum pilot overhead will be derived below.
Theorem 4. [31]. For , , whose elements obey an i.i.d. continuous distribution, the minimization problem will have the unique solution ifwhere , is a full rank matrix satisfying and is the common support of . After coarse CIR estimation, system equation becomeswhere denotes the difference set between and . Due to the temporal common sparsity, the sparsity of the signal that needs to be recovered becomes and the equation (29) can be transformed into . Besides, we know that and ; thus, we have
In addition, in order to recovery the signal with known support, pilots are needed. Therefore, the total number of pilots needs to meet the following condition:
Subsequently, we can get the smallest required pilot overhead which is less than the minimum pilot overhead required for unstructured CS methods.
Due to the overhead caused by the timedomain guard interval and the frequencydomain pilots, the spectral efficient of the proposed method normalized by the ideal case without any overhead [5, 13] can be expressed in the percentage notation as
Table 2 compares the spectral efficiency of several commonly used algorithms [5], where the typical wireless digital television system with the total subcarriers number is adopted. Besides, the channel model with six resolvable paths is defined by ITU, which means that the sparsity . Without loss of generality, the guard interval length is and the number TFTOFDM symbols in a frame is . Suppose the initial sparsity is ; for largescale MIMO system, the spectral efficient can be calculated as . As can be seen from Table 2, the proposed method has the highest spectral efficiency compared with other methods.
Besides, the PN sequence power and pilot power are boosted to achieve more reliable channel estimation in the TFTOFDM scheme. Therefore, the energy efficiency can be expressed in the percentage notation as [12]where and denote the amplitude factor imposed on the timedomain NP and frequencydomain pilots, respectively. Generally, and have been specified such as DVBT2 standard and DTMB standard. Therefore, energy efficiency can be calculated as . Table 3 summarizes the energy efficiency comparison for different OFDM schemes. It is clear that the proposed method has the highest energy efficiency. The reason is that less pilot overhead leads to higher energy efficiency.
5. Simulation Results
In this section, in order to verify the effectiveness of the proposed algorithm, we perform performance analysis by comparing the proposed algorithm with other seven schemes: OMP, simultaneous OMP (SOMP), distributed SAMP (DSAMP) [18], and priori information aided SOMP (PASOMP) [22] through computer simulation. Simulation system is configured according to the most commonly used wireless broadcasting systems with antennas and the parameters are set as centric carrier frequency , signal sampling frequency bandwidth , Doppler shift , DFT size , and guard interval length . The percentage of pilot overhead can be calculated as . The typical multiple channels with 6taps named ITU B is used to evaluate system performance and the specific parameters can be referred as literature [12].
Figure 4 shows the mean square error (MSE) performance comparison of five channel estimation methods for the massive MIMOOFDM system where per frame of the transmitted signal contains 10 TFTOFDM symbols. The Cramerâ€“Rao Lower Bound (CRLB) is used as a benchmark for comparison. The number of pilots is 600, that is, the percentage of pilot overhead accounts for 14.6 percent. It is clear that the proposed algorithm has 1.5âˆ¼2â€‰dB signaltonoise ratio advantage for other methods when the MSE is 10^{âˆ’1} and OMP algorithm has the worst performance. However, when the MSE is less than 10^{âˆ’2}, the performance of several algorithms tends to be the same. The reason is that those algorithms can choose the correct support when the signaltonoise ratio is high. Therefore, we can conclude that the proposed algorithm has great advantages in the case of low and medium signaltonoise ratio (SNR).
Figure 5 shows the bit error rate (BER) of each algorithm for the massive MIMOOFDM system. Zeroforcing (ZF) equalizer is adopted at mobile terminals for signal detection by using the estimated channel state information. Modulation scheme adopts quadrature amplitude modulation (QAM), and Gray code is used for source code. The BER with ideal CSI is plotted as the benchmark. Although the bit error rate curve of several algorithms is relatively close to each other, it can be seen from the performance curves in the subgraph that the proposed algorithm has the best bit error rate compared to other algorithms. This result is consistent with the analysis results in Figure 4.
To further evaluate the performance of the proposed method, Figures 6 and 7 show the MSE performance and BER performance with the variety of the number of measurements under the fixed SNR of 20â€‰dB, respectively. As the number of pilots increases, it can be observed that the MSE curve of the proposed algorithm is closer to the MSE curve of CRLB. This means that the proposed algorithm can get the right support due to signal structured processing. The proposed algorithm can achieve an MSE accuracy of with the pilot overhead, while SOMP and PASOMP algorithms require pilot overhead of at least and DSAMP needs more. As expected, OMP algorithm has the worst estimation performance consistent with the previous analysis. From the performance curves of the figures and the analysis above, we conclude that the proposed algorithm requires fewer pilots overhead at the same estimation accuracy.
In Figure 8, the correct signal recovery probabilities which are functions of the pilot overhead are drawn to evaluate the performance of the five methods. Due to the presence of additive white noise in the measurement signal, the correct recovery of the signal is probabilistic. For more convenient discussion, the recovery probability of signal is regarded as 1 when the MSE of the signal estimation is lower than . It is clear that the proposed method can accurately recover the signal when the pilot overhead is , while the DSAMP, SOMP, and PASOMP algorithms need up to , , and pilot overhead, respectively. This result is consistent with the performance analysis in Section 4. Therefore, compared to other algorithms, the proposed method can reduce about to pilot overhead so that spectral efficiency is improved.
6. Conclusions
In this paper, we proposed a timefrequency joint channel estimation scheme for massive MIMOOFDM systems. First, partial channel common support was achieved by using changing timedomain training sequence. In order to obtain accurate CSI, we proposed a priori information aided distributed structured sparsity adaptive matching pursuit (PADSSAMP) channel estimation algorithm by using obtained common path delay information. Then, we analyzed the performance of the proposed algorithm, including the convergence analysis, pilot design, computational complexity, and spectrum and energy efficiencies. Through convergence analysis, we gave two signal power thresholds which can ensure signal is fully recovered and recovered fully based on probability under the powerlimited noise and the Gaussian white noise cases, respectively. In addition, pilot design results showed that the pilot sequences of different antennas tend to be orthogonal with the increase in number of antennas. Furthermore, the computational cost of the proposed algorithm is significantly reduced, and the spectrum and energy efficiencies are improved. Experimental simulation showed that compared with other algorithms, the proposed algorithm not only increases the estimation accuracy but also greatly reduces the pilot overhead.
Appendix
A. Proof of Theorem 1
Assume that the next chosen indexes will at least contain a block in if is in . This is equivalent to the following inequality:
Due to , where , , and , inequality (A.1) can be converted to the following form:and it also can be expressed as the following form:
As a generalization of the matrix norm, the mixed matrix norm is defined as . According to the properties of the pseudoinverse, is the orthogonal projector onto . Thus, it holds that . Since is the Hermitian matrix, the following equation holds . So, we have
The first inequality is obtained by using the definition of mixed matrix norm , and the second inequality is obtained by Lemma 1. Since and , it results in a block diagonal matrix