Abstract

In massive multi-input multi-output orthogonal frequency division multiplexing (MIMO-OFDM) systems, accurate channel state information (CSI) is essential to realize system performance gains such as high spectrum and energy efficiency. However, high-dimensional CSI acquisition requires prohibitively high pilot overhead, which leads to a significant reduction in spectrum efficiency and energy efficiency. In this paper, we propose a more efficient time-frequency joint channel estimation scheme for massive MIMO-OFDM systems to resolve those problems. First, partial channel common support (PCCS) is obtained by using time-domain training. Second, utilizing the spatiotemporal common sparse property of the MIMO channels and the obtained PCCS information, we propose the priori-information aided distributed structured sparsity adaptive matching pursuit (PA-DS-SAMP) algorithm to achieve accurate channel estimation in frequency domain. Third, through performance analysis of the proposed algorithm, two signal power reference thresholds are given, which can ensure that the signal can be recovered accurately under power-limited noise and accurately recovered according to probability under Gaussian noise. Finally, pilot design, computational complexity, spectrum efficiency, and energy efficiency are discussed as well. Simulation results show that the proposed method achieves higher channel estimation accuracy while requiring lower pilot sequence overhead compared with other methods.

1. Introduction

Due to the scarcity of radio spectrum in the microwave band, how to improve spectrum efficiency (SE) has become one of the significant problems for future wireless communication systems. By leveraging the outstanding advantages of multiplexing and diversity, massive multi-input multi-output (MIMO) or large-scale antenna technology can achieve high SE and energy efficiency (EE) and has become one of the key technologies for fifth-generation (5G) wireless communications [1, 2]. In addition, orthogonal frequency division multiplexing (OFDM) has been widely adopted in modern wireless communication systems due to its excellent antimultipath capability [3]. Consequently, massive MIMO-OFDM will be a new standard for 5G.

In massive MIMO-OFDM systems, channel state information (CSI) is indispensable for precoding/combine, channel equalization and coherent detection, etc., which makes the accurate channel estimation crucial to improving system performance [3, 4]. In conventional MIMO-OFDM systems, orthogonal pilot training scheme is adopted for accurate downlink CSI acquisition. In massive MIMO-OFDM systems, however, since hundreds of transmit antennas are installed at BS, the acquisition of high-dimensional CSI results in the prohibitively high pilot overhead, much less the problem of computational complexity caused by high-dimensional matrix operation [5]. Meanwhile, quantization accuracy of the CSI fed back from users to BS, which is affected by the number of BS antennas, is a question worth considering as well [6, 7].

To solve those problems, many low-rank channel estimation approaches were proposed such as low-rank channel covariance matrices- (CCMs-) based method [8, 9], array signal processing-based method [10, 11], and compressive sensing- (CS-) based method [1223]. Although the low-rank property of CCMs can make the dimensions of MIMO channels greatly reduced, high-dimensional CCMs are difficult to be obtained. Moreover, high-dimensional matrix operations involved in the singular value decomposition (SVD) or eigenvalue decomposition (EVD) lead to high computational complexity. Array signal processing-based methods have many advantages, but it is particularly applicable to millimeter-wave massive MIMO systems which have high angular resolution [4]. Fortunately, the CS-based channel estimation method can be adopted for massive MIMO-OFDM systems due to the sparse characteristics of MIMO wireless channels [24].

Basically, there are three categories of CS-based channel estimation methods for MIMO-OFDM systems: time-domain training channel estimation (TTCE) method [12, 13], frequency-domain pilot training channel estimation (FTCE) method [1719], and time-frequency joint training channel estimation (TFTCE) method [2123]. TTCE exploits the interblock interference-free (IBI-free) region in the redundant portion of the received time-domain training sequences (TSs) for channel estimation. Although this scheme has high spectral efficiency, the design of the time-domain TSs is rather difficult when the number of antennas is large enough, because it not only needs to ensure the orthogonality between different TSs but also makes sure that the sensing matrices satisfy the restricted isometry property (RIP) condition [14, 15]. Moreover, accuracy of the estimation is seriously affected by channel length [16]. Unlike the TTCE method, the FTCE method uses orthogonal or nonorthogonal frequency-domain pilots for channel estimation. Although the difficulty of designing training sequences is greatly reduced, the spectral efficiency is decreased because the useful information provided by the preamble is not effectively utilized. Taking the advantages of the above two schemes, Dai et al. [5] proposed TFTCE scheme, while the CS method is not adopted for higher accuracy estimation. On this basis, Ding et al. [21] proposed a TFTCE method based on compressed sensing. However, due to the interference of nonorthogonal time-domain TSs, it cannot be applied to massive MIMO systems. Ding et al. and Fan et al. [22, 23] exploited identical TSs in time domain and frequency domain over a transmission frame, while changing TSs can bring better performance [25]. More importantly, these methods did not make effective use of the sparse characteristics of MIMO channels.

In this paper, we propose a distributed structured compressive sensing-based time-frequency joint channel estimation method for massive MIMO-OFDM systems. Our contributions in this paper are summarized as follows:(i)By using spatio-temporal common sparse characteristics of wireless MIMO channel, we propose a priori information aided distributed structured sparsity adaptive matching pursuit algorithm (PA-DS-SAMP). The proposed algorithm regards the partial channel common support as a priori aided information and combines SAMP algorithm with structured multimeasurement vector (MMV) ideas [26]. The proposed algorithm improves the spectral efficiency and the accuracy of the channel estimation, while without knowing the sparsity of MIMO channels.(ii)The performance of the proposed algorithm is analyzed. First, the signal power thresholds which can ensure accurate recovery of signals under power-limited noise and according to probability accurate recovery under additive white Gaussian noise (AWGN) are obtained though strict derivation. Next, we give the pilot design scheme that is well suited for massive MIMO systems. Finally, computational complexity, spectrum efficiency, and energy efficiency are given, which show that not only the computational complexity is less than other algorithms but also the spectrum efficiency and energy efficiency are greatly improved.

The remainder of this paper is organized as follows. The distributed massive MIMO-OFDM system model with block-sparse channel is described in Section 2. In Section 3, a distributed structured compressive sensing-based channel estimation algorithm is proposed. Next, Section 4 presents the performance analysis of the proposed algorithm. Simulation results are presented in Section 5 to demonstrate the performance of the proposed algorithm. Finally, the paper is concluded in Section 6.

Notation. Throughout our discussion, lowercase boldface letters indicate vector and uppercase boldface letters indicate matrix; and denote the zeros matrix of arbitrary size and identity matrix; , , , , and stand for the transport, conjugate transport, matrix inversion, Moore–Penrose matrix inversion, and cardinality, respectively. and represent norm and Frobenius norm; operator represents circular convolution; expresses the block of matrix or vector and represents mixed norm of matrix or vector .

2. Distributed System Model with Block-Sparse Channel

Consider a massive MIMO-OFDM system with transmit antennas configured at base station (BS) to serve signal-antenna mobile terminals, where . In this paper, we focus on the case of downlink transmission in which each frame consisting of several time-frequency training ODFM (TFT-OFDM) symbols is transmitted from BS to mobile terminals. As described in Figure 1, the ith () TFT-OFDM symbol transmitted from the () antenna consists of a length- time-domain TS and a following length- OFDM data block . Unlike cyclic prefix OFDM (CP-OFDM) and zero-padding OFDM (ZP-OFDM), known pseudonoise (PN) instead of unknown CP or ZP is adopted as guard interval in TFT-OFDM to alleviate interblock interference (IBI) caused by multipath.

At receiver, the TFT-OFDM symbols transmitted from each BS antenna are mixed together at each mobile terminal so that the received TFT-OFDM symbol vector in time domain can be written as [5]where and are Toeplitz lower and upper triangular matrices with the first column vector and the first row vector , respectively. At the same time, denotes the channel impulse response (CIR) vector from the antenna to receiver with the length and is the complex additive white Gaussian noise (AWGN) vector with zero mean and the variance of .

After cyclicity reconstruction operated by the overlapping and adding (OLA) algorithm [5], the received pilot sequence of the OFDM symbol can be expressed through discrete Fourier transform (DFT) process aswhere is the DFT matrix and is the matrix consisted of the first column of , denotes the index set of subcarriers allocated to pilots whose elements are uniquely selected repeatedly from the set of , and are the submatrix or subvector by selecting the rows of or according to , , , , is the number of pilot subcarriers, and is the complex AWGN.

Wireless channels have sparse characteristics [24, 27]. The presence of scatterers and reflectors in wireless communication environment results in a multipath channel with several significantly resolvable propagation paths. Because the significant scatterers between transmitter and receiver are usually scarce, the number of the paths containing the majority of channel energy is so small that they are sparsely distributed in the resolvable paths. In other words, the gains of most paths approach to zeros except for a small number of significant paths so that the channel exhibits sparsity in delay domain [12]. The sparsity support of the sparse channel is definedwhere is the noise floor which can be determined according to [28]. The sparsity level of the channel is denoted as .

Because antennas usually share the same scattering environment, all of the channels between one user and different antennas deployed on base station share common sparsity [17]. Therefore, the nonzero entries of the CIR vector can occur in clusters so that the CIR vector can be transformed to a block-sparse CIR vectorwhere . The blocking process of CIR vector is shown in Figure 2. At the same time, the observation matrix should have corresponding changes as follows:where . Thus, equation (2) can be re-expressed as

Besides, the path delays of practical wireless channels vary much slower than the path gains as shown in many studies [12]. This slow variation results that the CIRs over consecutive TFT-OFDM symbols share the same sparsity pattern [17]. Consequently, this spatio-temporal common sparse characteristic of MIMO channel over TFT-OFDM symbols can be expressed as

Defining the spatio-temporal Common Sparse Support of aswhere and is determined as (3). For simplicity, we assume the same threshold is used over each frame. Thus, the spatio-temporal Common Sparse Level of the channel vector set is .

Because the observation matrices are different, but CIRs have the same sparse characteristics, this system model can be called a distributed system model. The structure of the distributed system model with common block-sparse channel is described in Figure 3.

3. Distributed Structured Compressive Sensing-Based Channel Estimation

In this section, we will introduce a time-frequency joint channel estimation method to obtain more accurate estimation accuracy. Firstly, time-domain training is used to obtain partial common path delay and then accurate channel estimation is achieved using the proposed PA-DS-SAMP algorithm.

3.1. Acquisition of Partial Channel Common Support

Although time-domain training cannot realize accurate channel estimation, it can be used to acquire some prior information of channels such as partial channel delay information. Using the overlap-add method, the superimposed signal with the sum of the main part and tail of received time-domain TS can be expressed aswhere and are the received signals associated with PN part and the tail part expressed as (10) and is the number of symbols in which the channel delay remains unchanged.where the measurement matrices and are determined by and which are given in [12]. Thus, we can get

The matrix is generated by cyclically shifting the basic TS and the data block interference (DBI) matrix can be denoted as

Taking advantage of the circular convolution, the coarse CIR associated with the antenna can be estimated as

In equation (13), the second part and the third part are the interference caused by nonorthogonal PN sequences and data, respectively. The data interference is mainly due to the superposition of the previous TFT-OFDM data block and the current TFT-OFDM symbol data block. Fortunately, there is no correlation between the random OFDM data block and the fixed TS sequence so that the data interference from others antennas has little effect on the delay estimation of the channel, especially in large-scale antenna systems [29]. However, the PN sequences interference from different antennas will increase linearly with the number of antennas when the cyclic matrices of the PN sequences associated with different antennas are not orthogonal with each other. To the best of our knowledge, however, there is no a set of sequences which have good autocorrelation and the cyclic matrices of which are orthogonal to each other. It is worth noting that this problem can be resolved once the same PN sequence is used for each antenna. Consequently, we have average CIR

As a perfect sequence, Zadoff–Chu sequence which has good autocorrelation and cross-correlation is widely used in the OFDM system for synchronization and channel estimation [30]. A length- Zadoff–Chu sequence is defined as [30]where is the sequence length, is an integer, and is an integer prime to . By taking the inverse discrete Fourier transform (IDFT) of a diagonal matrix which the diagonal elements comprise the Zadoff–Chu sequence , matrix is constructed with a lot of excellent correlation properties in each row vector as well. For each row sequence of the constructed matrix , the following correlation properties are held [30]:where represents a periodic cycle of length .

It has been proved that exploiting constant TS within the transmission frame is not an optimal scheme for channel tracking and changing TS can bring better channel estimation performance [5, 25]. Thus, we adopt changing time-domain training scheme which different PN sequences generated by each row sequence of for the acquisition of accurate CSI over one frame of the transmission signals. The partial common support of MIMO channels over TFT-OFDM symbols can be calculated as follows:where is the same as that in (8) an can ensure the reliability of the channel common support.

3.2. Acquisition of Accurate Path Gain Estimation

In this section, we use the frequency-domain method in combination with the partial channel common support as a priori information for accurate channel estimation.

Since the support set of CIR remains unchanged over several consecutive TFT-OFDM symbols, it can be estimated simultaneously by using multiple measurement vector (MMV) CS methods [31]. In general, MMV methods are used to solve sparse signal recovery problems in which measurement vectors are associated with the same sensing matrix, while generalized MMV (GMMV) CS methods are required for the distributed systems with different sensing matrices. The problem of sparse channel state acquisition can be formulated as the following optimization problem:

To solve this problem, the PA-DS-SAMP algorithm listed in Algorithm 1, which is developed from the DSAMP algorithm [18], is proposed.

Input:
(1)Initial support , initial channel sparsity ,
(2)Noisy measurements and sensing matrices , ,
(3)Step size selection threshold , initial step size ,
(4)Termination threshold , noise power , iteration index .
Initialization:
(1),
(2)Residual ,
(3)Increased sparsity ,
(4), .
Iterations:
(5) /correlation/
(6) /rough support obtain/
(7) /support union/
(8) /coarse estimate/
(9) /full support/
(10) /support pruning/
(11) /block estimate/
(12) /residue update/
(13) /minimum power index/
(14)If then quit
(15)Else if then
(16)    if then
(17)      /step adaptation/
(18)    Else , /sparsity level update/
(19)    End if
(20)Else /iteration index update/
(21)end
(22)Until
(23)
Output: Estimation of structured channel impulse response , .

The initialization of the PA-DS-SAMP algorithm is implemented in Steps 1∼3. In Step 5, the correlations of the residuals with the column vector of sensing matrices are calculated over each TFT-OFDM symbol and the indexes of the largest nonzero entries are selected in Step 6, where represents the indexes of maximum values in the set and and presents basic sparsity and increased sparsity, respectively. Steps 7∼10 are the support pruning for more precise sparsity support. Note that Step 9 does not guarantee that the basic support is fully contained so that Step 10 is needed to refine the effective support and ensure entries in the final support. Step 11 estimates associated with the effective support using LS, and Step 12 calculate the residuals over each symbol. The index of the minimum power in the block of is found in Step 13, and the iteration is terminated if the condition that the average energy of signal is less than a certain threshold holds, which is expressed in Step 14. Steps 15∼21 are the step adaptive process, and the advantages of this approach will be explained below, where denotes upper bound integer operation. The whole iteration will be terminated if the residual average energy is less than a certain threshold given in Step 22. Finally, the estimation of structured channel impulse response is obtained in Step 23.

Compared with DSAMP algorithm [18] and the classic SAMP algorithm [32], the proposed PA-DS-SAMP algorithm has four distinctive features:(i)Since the signal to be estimated has block structure, the structured sparsity is exploited in the proposed PA-DS-SAMP algorithm for block signal recovery. However, this useful signal feature is not used in either DSAMP algorithm or SAMP algorithm.(ii)In the PA-DS-SAMP algorithm, the sparsity support consists of basic support and additional support. Only additional support is updated while the basic support does not change in each iteration operation. The reason is that the basic support achieved by time-domain training has high reliability.(iii)The third difference is that the step size can be adaptively adjusted in PA-DS-SAMP algorithm, which has no discussion in DSAMP algorithm. Due to the tradeoff between recovery speed and recovery efficiency [33], an adaptive step size selection scheme is utilized which is based on the experiences that large step size is preferable for signal with flat magnitude and small step size is suitable for signal with fast magnitude attenuation.(iv)Finally, we give the termination threshold of the algorithm through strict mathematical derivation, which plays an important role in the performance of the algorithm.

Remark 1. After removing the large gain taps of CIR by using a priori information of channel delay, the remaining tap amplitudes have not much difference so that large step size is suitable. However, with large magnitude taps being reconstructed after a few stages of the algorithm, the number of taps to be reconstructed is less and less. In other words, the energy of the reconstructed CIR approaches to remain stable and the estimated sparsity level is close to the true sparsity level. Therefore, in order to better determine the more accurate sparseness, the step should be gradually reduced. According to this behavior, a large step size is set at the several stages of beginning in PA-DS-SAMP algorithm to expedite the convergence. Then, the step size decreases adaptively to provide fine tuning in following stages when the change of the reconstructed signal is less than a certain threshold.

4. Performance Analysis

In this part, we discuss the performance of the proposed algorithm. Firstly, we discuss the convergence of the algorithm, getting the convergence condition and termination threshold of the algorithm. Secondly, pilot design for sensing matrices is discussed. Then, the computational complexity of proposed algorithm is compared with OMP, SOMP, DSAMP algorithm, DS-SAMP algorithm, and PA-SOMP algorithm. Finally, pilot overhead for CSI acquisition is obtained. Before performance analysis, we first introduce two important definitions and two useful lemmas, which will play a key role in the subsequent analysis and discussion.

Definition 1 [34]. Block-Coherence of is defined aswhere and is the size of the block.

Definition 2 [34]. Subcoherence of is defined asIf , then .

Lemma 1. Let be a matrix and be the block of , thenLemma 1 is obtained directly from Lemma 1 [34].

Lemma 2. [35]. Suppose that the noise vector follows Gaussian distribution ; if bounded set is satisfied, then we have

4.1. Convergence Analysis of Proposed PA-DS-SAMP Algorithm

The PA-DS-SAMP algorithm begins with the residual initialization as , and the significant part is to choose at least one correct column block of the sensing matrix in each iteration. At the stage (), the block best matched to is chosen according to . Suppose denotes the support set of the signal and denotes the complementary set of . Theorem 1 reveals the convergence conditions of the proposed algorithm in the absence of noise. We first discuss the noise-free case, then discuss power-limited noise , and finally extend it to Gaussian white noise case .

Theorem 1. For block-sparse signals with the blocks length of which satisfy , a sufficient condition for the PA-DS-SAMP algorithm to recover is that

Proof. See Appendix A.

The sufficient condition (26) depends on which is determined by the nonzero position of the signal , but the nonzero position of the is not known in advance. Therefore, Theorem 1 is not practical. Following theorem under certain conditions on Block Coherence and Subcoherence associated with can make sure the (A.1) holds.

Theorem 2. Sufficient condition (26) is satisfied if for every , there is

The proof of the theorem can refer to the literature [34]. Based on Theorem 1 and Theorem 2, we will discuss the condition in which the proposed algorithm can select the correct atom for each step in the case of power-limited noise.

Theorem 3. Suppose that observed noise follows additive white Gaussian noise ; if the remaining energy of the signal satisfy , the proposed algorithm with the stopping rule selects the correct atom with probability at least , where .

Proof. See Appendix B.

Remark 2. Theorem 3 gives the double threshold condition for the signal to be fully recovered according to a certain probability. Although noise power and empirical threshold involved in signal power are usually used as terminated conditions, these thresholds are somewhat conservative. When the signal-to-noise ratio is low, it leads to the incorrect supports. Theorem 3 can provide a degree of quantification of correct recovery so as to guide the choice of thresholds.

4.2. Pilot Design Discussion for Sensing Matrices

According to (2), the design of sensing matrices is related to pilot sequences and pilot placement . In CS theory, RIP is treated as a sufficient condition that the sparse signal can be reliably and stably recovered. However, RIP of a sensing matrix is so difficult to calculate that mutual incoherence property (MIP) which is stronger than RIP is widely used as a framework for sparse signal recovery. Besides, from Theorem 2, we can see that the MIP of sensing matrix is directly related to the termination threshold of the proposed algorithm. The smaller the and the , the lower the threshold of the recoverable signal and the more accurate the signal recovery. Therefore, our purpose is to design sensing matrices with good cross-correlation.

Using the Central Limit Theorem, Gao [17] proposed a constant envelope complex exponential random phase pilot , where has the independent and identically distributed (i.i.d.) uniform distribution , and proved that the pilot vectors between different antennas and different OFDM symbols have asymptotic orthogonality when pilots are placed at equal intervals.where , , and is the index of the set . The equation takes advantage of independent condition between and . When , we have ; otherwise, .

Therefore, when the pilot number is large enough, sensing matrices will have good cross-correlation. As discussed below, the number of pilots is proportional to the number of antennas. So, for massive MIMO systems, the column vectors of the sensing matrices satisfy approximately orthogonality. Besides, according to Theorem 2, we have inequality , which means that smaller and bring a larger number of antenna.

4.3. Computational Complexity of the DS-SAMP Algorithm

The computational complexity of the algorithm is analyzed in this section. Firstly, the computational complexity of each iteration in each step is given in Table 1.

Obviously, among the steps of the PA-DS-SAMP algorithm, matrix inversion operated for least squares (LS) estimation contributes the dominant computational complexity. Gao [18] compared the number of complex multiplications of OMP, SAMP, and DSAMP algorithms in each iteration. Compared with these four algorithms, it is clear that the DS-SAMP algorithm almost has the same computational complexity due to almost the same calculation steps they have. However, with a priori common support, the number of iterations of the proposed algorithm will be greatly reduced so that the total amount of calculation will be significantly smaller.

4.4. Spectrum Efficiency and Energy Efficiency

Block-structured processing of signals can reduce pilot overhead, and the minimum pilot overhead will be derived below.

Theorem 4. [31]. For , , whose elements obey an i.i.d. continuous distribution, the minimization problem will have the unique solution ifwhere , is a full rank matrix satisfying and is the common support of . After coarse CIR estimation, system equation becomeswhere denotes the difference set between and . Due to the temporal common sparsity, the sparsity of the signal that needs to be recovered becomes and the equation (29) can be transformed into . Besides, we know that and ; thus, we have

In addition, in order to recovery the signal with known support, pilots are needed. Therefore, the total number of pilots needs to meet the following condition:

Subsequently, we can get the smallest required pilot overhead which is less than the minimum pilot overhead required for unstructured CS methods.

Due to the overhead caused by the time-domain guard interval and the frequency-domain pilots, the spectral efficient of the proposed method normalized by the ideal case without any overhead [5, 13] can be expressed in the percentage notation as

Table 2 compares the spectral efficiency of several commonly used algorithms [5], where the typical wireless digital television system with the total subcarriers number is adopted. Besides, the channel model with six resolvable paths is defined by ITU, which means that the sparsity . Without loss of generality, the guard interval length is and the number TFT-OFDM symbols in a frame is . Suppose the initial sparsity is ; for large-scale MIMO system, the spectral efficient can be calculated as . As can be seen from Table 2, the proposed method has the highest spectral efficiency compared with other methods.

Besides, the PN sequence power and pilot power are boosted to achieve more reliable channel estimation in the TFT-OFDM scheme. Therefore, the energy efficiency can be expressed in the percentage notation as [12]where and denote the amplitude factor imposed on the time-domain NP and frequency-domain pilots, respectively. Generally, and have been specified such as DVB-T2 standard and DTMB standard. Therefore, energy efficiency can be calculated as . Table 3 summarizes the energy efficiency comparison for different OFDM schemes. It is clear that the proposed method has the highest energy efficiency. The reason is that less pilot overhead leads to higher energy efficiency.

5. Simulation Results

In this section, in order to verify the effectiveness of the proposed algorithm, we perform performance analysis by comparing the proposed algorithm with other seven schemes: OMP, simultaneous OMP (SOMP), distributed SAMP (DSAMP) [18], and priori information aided SOMP (PA-SOMP) [22] through computer simulation. Simulation system is configured according to the most commonly used wireless broadcasting systems with antennas and the parameters are set as centric carrier frequency , signal sampling frequency bandwidth , Doppler shift , DFT size , and guard interval length . The percentage of pilot overhead can be calculated as . The typical multiple channels with 6-taps named ITU B is used to evaluate system performance and the specific parameters can be referred as literature [12].

Figure 4 shows the mean square error (MSE) performance comparison of five channel estimation methods for the massive MIMO-OFDM system where per frame of the transmitted signal contains 10 TFT-OFDM symbols. The Cramer–Rao Lower Bound (CRLB) is used as a benchmark for comparison. The number of pilots is 600, that is, the percentage of pilot overhead accounts for 14.6 percent. It is clear that the proposed algorithm has 1.5∼2 dB signal-to-noise ratio advantage for other methods when the MSE is 10−1 and OMP algorithm has the worst performance. However, when the MSE is less than 10−2, the performance of several algorithms tends to be the same. The reason is that those algorithms can choose the correct support when the signal-to-noise ratio is high. Therefore, we can conclude that the proposed algorithm has great advantages in the case of low and medium signal-to-noise ratio (SNR).

Figure 5 shows the bit error rate (BER) of each algorithm for the massive MIMO-OFDM system. Zero-forcing (ZF) equalizer is adopted at mobile terminals for signal detection by using the estimated channel state information. Modulation scheme adopts quadrature amplitude modulation (QAM), and Gray code is used for source code. The BER with ideal CSI is plotted as the benchmark. Although the bit error rate curve of several algorithms is relatively close to each other, it can be seen from the performance curves in the subgraph that the proposed algorithm has the best bit error rate compared to other algorithms. This result is consistent with the analysis results in Figure 4.

To further evaluate the performance of the proposed method, Figures 6 and 7 show the MSE performance and BER performance with the variety of the number of measurements under the fixed SNR of 20 dB, respectively. As the number of pilots increases, it can be observed that the MSE curve of the proposed algorithm is closer to the MSE curve of CRLB. This means that the proposed algorithm can get the right support due to signal structured processing. The proposed algorithm can achieve an MSE accuracy of with the pilot overhead, while SOMP and PA-SOMP algorithms require pilot overhead of at least and DSAMP needs more. As expected, OMP algorithm has the worst estimation performance consistent with the previous analysis. From the performance curves of the figures and the analysis above, we conclude that the proposed algorithm requires fewer pilots overhead at the same estimation accuracy.

In Figure 8, the correct signal recovery probabilities which are functions of the pilot overhead are drawn to evaluate the performance of the five methods. Due to the presence of additive white noise in the measurement signal, the correct recovery of the signal is probabilistic. For more convenient discussion, the recovery probability of signal is regarded as 1 when the MSE of the signal estimation is lower than . It is clear that the proposed method can accurately recover the signal when the pilot overhead is , while the DSAMP, SOMP, and PA-SOMP algorithms need up to , , and pilot overhead, respectively. This result is consistent with the performance analysis in Section 4. Therefore, compared to other algorithms, the proposed method can reduce about to pilot overhead so that spectral efficiency is improved.

6. Conclusions

In this paper, we proposed a time-frequency joint channel estimation scheme for massive MIMO-OFDM systems. First, partial channel common support was achieved by using changing time-domain training sequence. In order to obtain accurate CSI, we proposed a priori information aided distributed structured sparsity adaptive matching pursuit (PA-DS-SAMP) channel estimation algorithm by using obtained common path delay information. Then, we analyzed the performance of the proposed algorithm, including the convergence analysis, pilot design, computational complexity, and spectrum and energy efficiencies. Through convergence analysis, we gave two signal power thresholds which can ensure signal is fully recovered and recovered fully based on probability under the power-limited noise and the Gaussian white noise cases, respectively. In addition, pilot design results showed that the pilot sequences of different antennas tend to be orthogonal with the increase in number of antennas. Furthermore, the computational cost of the proposed algorithm is significantly reduced, and the spectrum and energy efficiencies are improved. Experimental simulation showed that compared with other algorithms, the proposed algorithm not only increases the estimation accuracy but also greatly reduces the pilot overhead.

Appendix

A. Proof of Theorem 1

Assume that the next chosen indexes will at least contain a block in if is in . This is equivalent to the following inequality:

Due to , where , , and , inequality (A.1) can be converted to the following form:and it also can be expressed as the following form:

As a generalization of the matrix norm, the mixed matrix norm is defined as . According to the properties of the pseudoinverse, is the orthogonal projector onto . Thus, it holds that . Since is the Hermitian matrix, the following equation holds . So, we have

The first inequality is obtained by using the definition of mixed matrix norm , and the second inequality is obtained by Lemma 1. Since and , it results in a block diagonal matrix so that formula (A.4) can be transformed into

Thus, if holds, PA-DS-SAMP algorithm can pick up a correct new block in each step. If the correct block can be selected for each iteration, the recursive refining process of the estimate of support set, which includes adding the correct block and removing the wrong block, will lead to estimated subspaces with strictly decreasing distance from the measurement vector so that the residuals will be decreasing. After a limited iteration, the algorithm reaches convergence.

B. Proof of Theorem 2

By using orthogonal projection matrix , we have

Let , , and , then , where and .

The left side and right side of equation (A.2) are changed into

Thus, in Step 6, the sufficient condition that at least one correct atom should be selected is . Besides,

The last equation is obtained by the projection theorem. Thus, the sufficient condition for to be hold is and we have

According to [34], there is inequality (B.5) established:

Thus, we have

Substituting inequality (B.7) into inequality (B.5), we have a sufficient condition that (A.2) is hold, which means that at least one correct atom should be selected in the current cycle

Meanwhile, we have

The second inequality is obtained by using Lemma 5 in [35]. According to Gershgorin circle theorem, we have . Besides, ; thus, we have

Combining formulas (B.8) and (B.9), the sufficient condition for selecting at least one correct atom in the current step is

Thus, we have

On the other hand, there areand . Therefore, we have

This means that if the remaining signal energy is large enough, the algorithm will choose a correct atom at this step. For conservation, we get the next result that ifholds, then the correct atom will be selected in the next step. When all atoms are properly selected, the lowest energy of the selected atoms is .

According to lemma 2, we have . This means that, if (B.14) is hold, the proposed algorithm with the stopping rule selects the correct atom with probability at least .

Abbreviations

MIMO:Multiple-input multiple-output
OFDM:Orthogonal frequency division multiplexing
CSI:Channel state information
PCCS:Partial channel common support
SE:Spectrum efficiency
EE:Energy efficiency
5G:Fifth generation
CCMs:Channel covariance matrices
CS:Compressive sensing
BS:Base station
SVD:Singular value decomposition
EVD:Eigenvalue decomposition
TTCE:Time-domain training channel estimation
FTCE:Frequency-domain channel estimation
TFTCE:Time-frequency joint training channel estimation
TS:Training sequence
RIP:Restricted isometry property
MIP:Mutual incoherence property
MMV:Multimeasurement vector
AWGN:Additive white Gaussian noise
IBI:Interblock interference
DBI:Data block interference
PN:Pseudo noise
CP-OFDM:Cyclic prefix OFDM
ZP-OFDM:Zero padding OFDM
TFT-OFDM:Time-frequency training ODFM
CIR:Channel impulse response
LS:Least squares
OLA:Overlapping and adding
DFT:Discrete Fourier transform
IDFT:Inverse discrete Fourier transform
MMV:Multiple measurement vectors
GMMV:Generalized MMV
OMP:Orthogonal matching pursuit
PA-SOMP:Priori information aided SOMP
SOMP:Simultaneous OMP
SAMP:Sparsity adaptive matching pursuit
DSAMP:Distributed SAMP
DS-SAMP:Distributed structured SAMP
PA-DS-SAMP:Priori information aided distributed structured SAMP
SNR:Signal-to-noise ratio
ZF:Zero-forcing
QAM:Quadrature amplitude modulation
BER:Bit error rate
MSE:Mean square error
CRLB:Cramer–Rao lower bound.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Grant no. 61571364), Innovation Foundation for Doctoral Dissertation of Northwestern Polytechnical University (Grant no. CX201833), and Shaanxi Provincial Natural Science Foundation (Grant no. 2017JM6073).