Abstract

This paper presents a novel iterative detection and channel estimation scheme that combines the effort of superimposed training (ST) and pilot-aided training (PAT) for multiple-input multiple-output (MIMO) flat fading channels. The proposed method, hereafter known as joint mean removal ST and PAT (MRST-PAT), implements an iterative detection and channel estimation that achieves the performance of data-dependent ST (DDST) algorithm, with the difference that the data arithmetic cyclic mean is estimated and removed from data at the receiver’s end. It is demonstrated that this iterative and cooperative detection and channel estimator algorithm surpasses the effects of data detection identifiability condition that DDST has shown when higher orders of modulation are used. Theoretical performance of the MRST-PAT scheme is provided and corroborated by numerical simulations. In addition, the performance comparison between the proposed method and different MIMO channel estimation techniques is analyzed. The joint effort between ST and PAT shows that MRST-PAT is a solid candidate in communications systems for multiamplitude constellations in Rayleigh fading channels, while achieving high-throughput data rates with manageable complexity and bit-error rate (BER) as a figure of merit.

1. Introduction

Estimation theory deals with the basic problem of inferring a set of required statistical parameters of a random experiment based on the observation of its outcome. It is assumed that it is possible to produce an effect in the experiment by means of a controlled excitation signal. This approach is normally adopted in practical communications systems where channel estimation is an essential part of standard receiver designs [1] and carried out by transmitting training symbols commonly known as pilot symbols [2]. In this case, the random experiment can be seen as an unknown system that is identified by observing how the system reacts to the applied excitation or training signal.

Traditionally, the most widely used channel estimation technique is pilot-assisted training where the pilots are multiplexed in time or frequency. This is widely known in the literature as pilot symbol assisted modulation (PSAM) [2] and is denominated as pilot-assisted transmission (PAT) [3]. This scheme employs a nonrandom training pilot sequence known a priori by the transmitter and the receiver. The training pilots are periodically inserted into certain positions in the time (frequency) with the information-bearing symbols, before modulation and transmission. Using the knowledge of the training symbols and the corresponding received signal, the channel estimation block at the receiver is able to make an estimate of the channel impulse response (CIR). Conventional PAT-based channel estimation methods use pilot symbols in time-division multiplexing (TDM) schemes, thus decreasing the effective data transmission rate.

Recently, an alternative channel estimation strategy that circumvents the unwanted effect of data rate reduction has emerged, called implicit training (IT) [4]. Two outstanding IT approaches, known as superimposed training (ST) [5, 6] and data-dependent ST (DDST) [7, 8], achieve higher effective data rates with manageable complexity [9]. These techniques are based on a training sequence added (superimposed) to the information-bearing symbols. Both schemes provide a simple (unsophisticated) channel estimation process [10, 11] and differ only in that the arithmetic mean of the transmitted data of the DDST scheme is superimposed onto the transmitted sequence. Both techniques have been successfully applied to single-input single-output (SISO), as well as multiple-input multiple-output (MIMO) systems [1117], combined with orthogonal frequency division multiplexing (OFDM) modulations [1820], and time-varying channels [2123]. Likewise, there are interesting alternatives to OFDM in [24, 25], where the single-carrier modulation approach is employed, using a joint frequency domain equalization and channel estimation.

Although DDST outperforms ST [12] in terms of channel estimation error, it is worth mentioning that the data decoding under DDST is of an iterative nature, as it needs to remove the data-dependent distortion. Furthermore, DDST [8] performs similarly to TDM-based channel estimation, while saving the overhead in TDM data rate due to pilot transmission. There are, however, some drawbacks that must be taken into consideration when DDST is used. First, this technique introduces a delay in the transmitted data when it calculates the data-dependent signal; second, it assigns less transmission power to the data signal; hence, the symbol demapping operation is not suitable for higher orders of modulation due to identifiability problems, as was highlighted in [26]. This is an important issue, because recent communication standards consider this type of modulation; for example, WiMAX (IEEE 802.16e-2005 standard) uses 64-QAM digital modulation that DDST cannot support. In addition, DDST presents two more drawbacks: its performance is highly sensitive to the data block length used as well as the increased peak-power and peak-to-average ratio (PAPR) in the transmitted signal, as shown in [27].

This paper proposes an iterative detection and channel estimation structure for MIMO systems, which is capable of dealing with the data identifiability problem previously described. The novel algorithm is based on the use of joint (fusion of both techniques) implicit and explicit training for the channel estimation and symbol detection processes. Previous results of the authors related to implicit training for MIMO systems were presented in [28], which introduced the mean removal ST (MRST) technique devoted to estimate the arithmetic cycling mean of the data block signal at the receiver side (instead of at the transmitter side like DDST does). However, later on it was recognized that MRST is not able to deal with the identifiability problem, just like DDST, when the MIMO system employs high-order digital modulation schemes.

The proposal of an iterative detection and channel estimation scheme is hereafter referred to as joint mean removal ST and PAT (MRST-PAT). The method is based on a reliable preliminary channel estimate using PAT with a small number of dedicated pilot symbols. Subsequently, it uses the time-average estimator with the ST signal added to each transmitted data block, achieving an improvement in system throughput. Thus, MRST-PAT merges the best qualities of the two techniques, achieving a better BER performance than if employed separately. In addition, the proposed method eliminates the data identifiability difficulties that DDST exhibits when higher orders of modulation are used [26].

This paper is organized as follows: Section 2 presents the space-time MIMO signal model. Section 3 summarizes the fundamental theory and performance of PAT (TDM), ST, and DDST channel estimation techniques. Section 4 details the structure of the iterative MIMO detection and channel estimation joint MRST-PAT and its theoretical performance analysis. Section 5 depicts the application of the iterative joint MRST-PAT using a 2-OSTBC coder. Section 6 considers the numerical simulation results and performance analysis for the training-based frequency-flat block-fading MIMO channel estimation using TDM, DDST, and MRST-PAT. Finally, some concluding remarks in Section 7 close this paper.

Notation. Bold letters in lower (upper) case are used to denote vector (matrices); stands for the complex number field; and represent transpose and conjugate transpose, respectively; denotes identity matrix; and represent Euclidean norm and Frobenius norm, respectively; denotes trace of a matrix; stands for Kronecker product; and denotes a multidimensional complex Gaussian distribution with mean and covariance matrix .

2. Space-Time Signal Model

This section describes the space-time signal model applicable to most existing space-time coding designs such as Vertical Bell Labs Layered Space-Time (VBLAST) [29] and the generalized schemes referred to as space-time block codes from orthogonal designs [30].

With reference to Figure 1, this paper considers a MIMO single-carrier system operating over frequency-flat quasistatic block-fading channel model, with transmit and receive antennas described bywhere is the transmitted vector comprising elements denoted as and is the received vector.

We use a random matrix that represents the flat fading channel in a baseband equivalent model, where each element is generated as an independent and identically distributed (i.i.d.) circularly symmetric complex Gaussian random variable . In addition, is an additive white circularly symmetric complex Gaussian noise vector, with zero mean and variance per real and imaginary dimension.

Let us assume the block transmission scheme, where a “block” is defined as a single transmission burst. The channel matrix is assumed to be constant for channel uses and then changes to an independent realization for the next block [31]. Here, denotes the block length. For any transmitted signal matrix , the expression in (1) can be written aswhereare the matrices of the received signals, transmitted signals, and noise, respectively.

The transmit symbol vector, prior to space-time coding, of size is denoted bywhere and denotes a normalized (average symbol energy of is assumed to be one) signal constellation to which information symbols belong.

The vector of information symbols is fed into a space-time encoder, which maps (one-to-one mapping) this vector to code matrixassociated with the space-time scheme in use. Therefore, the space-time code data rate, in symbols per channel use, is given by . Additionally, the transmitted signal is formed by using concatenated code matrices, where the block length is a positive integer multiple of .

Particularly, complex matrix-valued function is called an OSTBC if it satisfies the following conditions [30]:(a)All entries of are linear functions of complex random variables and their complex conjugates.(b)For any arbitrary in ,

3. Channel Estimation Techniques

Consider the PAT diagram in Figure 1. During the transmission of one block of information, the data and pilot symbols, depicted by white and dark boxes, respectively, can be transmitted separately or combined in different ways. For the purpose of channel estimation, the key to unifying various schemes is to view the problem of training design as one of power allocation [3].

Conventionally, a PAT scheme where some known symbols (pilots) are time-multiplexed (TDM) with the information-bearing data is known as a conventional pilot- (CP-) based MIMO system [32]. This transmission scheme is shown in Figure 2 and analyzed in the following subsection. In this case, pilot symbols are known at the receiver and can be used for channel estimation. Since pilot symbols carry no data information, the time and the power spent on pilot symbols degrade the throughput of the system. If more frequent pilot symbols are transmitted, then a better channel estimation is achieved. However, the PAT transceiver shown in Figure 1 suggests another scheme for solving the problem, where a training sequence is superimposed onto the data. This technique, known as superimposed pilots (SIP) in [32], was introduced in [5] and has been developed extensively under different modalities such as ST in [33], implicit training in [4, 6], and DDST in [7].

It should be emphasized that, in the following calculations, is assumed to be random. The estimation process, however, will obtain estimates of the parameters of a particular realization of that corresponds to the current data block. We consider the channel estimation procedure applied to TDM, ST, DDST, and MRST-PAT schemes, starting from the least squares (LS) analysis proposed in [34].

3.1. PAT (TDM) Case

With reference to Figure 2, let us consider a PAT transceiver under TDM modality with . In this scenario, is the number of pilot symbols transmitted in a given transmission interval (block). matrix represents the training signal transmitted at the beginning of each block of information (preamble), and matrix represents the information-bearing data of length per transmit antenna. In this scheme, the data and training symbols have the same average power defined as and ; that is, . It is clear that during the training stage, the transmission power is allocated to the pilot symbols; that is, , where denotes the average power of the transmitted symbol.

It is important to note that the TDM-based channel estimation algorithm employs only the pilot signals. It follows that the noise contribution and the received signal span during channel uses. In this case, and have dimensions .

Assuming that has full row-rank, knowing the received signal , and using the LS approach as is developed in [34, 35], then the realization of channel matrix can be estimated aswhere is the pseudoinverse of . Constraining the transmitted power during the training interval, the following is obtained:

Let us find that minimizes the channel estimation error and satisfies condition (8). This is equivalent to the following optimization problem:Using (2) and (7), we have and, therefore, the objective function can be written asUsing with representing the receiver noise power, then (10) can be rewritten as

From (11), the optimization problem shown by (9) can be equivalently reformulated asThe choice of must be such that is nonsingular, which implies that . Furthermore, any training matrix is optimal in the sense of (12) if it satisfiesIn other words, any matrix with orthogonal rows of the same norm is optimal [35].

For optimal training that satisfies (13), the LS channel estimate (7) yieldsthat is, the estimation error is .

From (11) and (13), it follows that the channel estimation error under the MSE criterion and optimal training is given byFinally, using the constraint indicated in (8), for the TDM-based estimate, the channel estimation error can be expressed as

3.2. Superimposed Training Case

Let us consider Figure 1 with an ST-based channel estimation algorithm. In comparison with TDM-based channel estimation, the ST scheme has throughout the transmission, and the training matrix is matrix; that is, . For this reason this scheme is called overlay pilot in [32]. It follows that (2) can be described as

Accordingly, the least square (LS) channel estimate can be obtained by treating as an extra additive noise term for channel estimation purposes. Even though there are an infinite number of matrices that satisfy the optimum condition of (12), the goal is to select a training matrix that makes the ST technique computationally attractive.

Under frequency-flat fading conditions, the optimal length of the training interval is [36]. Hence the training matrix may be a block column matrix that can be constructed by concatenated replicas of matrix, where , and the block length is chosen to be an integer multiple of the number of transmit antennas Then, the training matrix can be written aswhere is the unit vector and the block training pattern is matrix, where their rows are orthogonal to each other, and it satisfies the constraint .

The training sequence at th transmit antenna is given bywith and . Another way to build the training sequence is by shaping matrix as a circulant matrix whose first row entries are given bywhere , with when is even, and if is odd [6]. The Walsh-Hadamard matrices are other attractive options.

The benefit of using any of these training patterns is that a time-domain estimator based on the synchronized averaging of the received signal can be implemented, as in the SISO case described in [12].

Let be matrix, which defines the time-average of the received signal . We assume that the channel remains constant over the entire block length of time . Specifically, th entry of is given bywhere and .

It follows that can be written aswhere is matrix constructed by concatenated replicas of identity matrix and is given by

From (17) and (22), can be reformulated aswhere is matrix and is matrix that represent the time-average of the data and noise , respectively, for every block transmitted. Moreover, . Since the training matrix is known at the receiver, the channel matrix realization can be estimated asFrom (24) and (25), the channel estimation error is given by

Since is an orthogonal matrix that satisfies then . Therefore, using this result in (26), the channel estimate (for the ST case) can be expressed as

The MSE of the ST case can be found in [11]. It may be inferred from (27) that represents an extra term for the channel estimate. This time-average term is exploited by the following method.

3.3. Data-Dependent ST Case

In DDST, the signal is used to define the perturbation matrixwhich will be arithmetically added to the data signal , in every transmitted block. Hence the transmitted signal is determined by , and the corresponding received signal is given by

It is important to stress that the term will be nonexistent at the receiver end, because it was removed at the transmitter side when the signal was added. Hence, using (25) and (27), the LS time-domain channel estimator based on the synchronized averaging of the received signal is given bySubsequently, the mean-square channel estimation error under optimal training with is given byUsing , the DDST mean-square estimation error may be expressed asby replacing the inverse of , and , thenwhich is finally reduced to

In order to have the same estimation error performance between the DDST method and the TDM-based estimate, we compare (16) and (34), and the resulting relation is given bySimilar expressions for (34) and (35) are obtained using another procedure in [11].

In DDST, the data-dependent term has the purpose of removing the dependency on data; therefore, certain frequency components are removed at the transmitter and data symbols may be distorted. This data distortion becomes a problem at the receiver when its value is commensurable with the Euclidean distance between the symbols of the constellation used. It is more significant as the order of modulation increases, at fixed transmission power. In this case, the receiver would not be able to identify the signals from the constellation. This problem was highlighted and designated the data identifiability problem in [26]. In preliminary DDST works, such as [7, 8], the data distortion is removed in the receiver using an iterative method that performs effectively for low orders of modulations. Recently in [26], another iterative removal scheme is proposed for high orders of modulations. In what follows, we will show a hybrid training method that overcomes this impairment by combining ST and PAT.

4. Iterative Joint MRST-PAT

Even though the DDST channel estimate is better (according to the LS criterion) than the ST estimate, the DDST technique presents several disadvantages compared to ST, as mentioned in the Introduction. Here, we want to stress only its main drawbacks: it cannot work with high-order constellations, and it is highly sensitive to block length. For these main reasons, it is desirable to find ST improvements that do not present these disadvantages.

The MRST scheme in [28, 37] was developed from the following two points: (i) the fact that the difference between ST (27) and DDST (30) channel estimation techniques is the factor and (ii) the hypothesis that if we could obtain an estimate of the signal at the receiver end, we would achieve the performance of DDST in terms of channel estimate MSE. However, as more power is allocated to data signals, BER performance should increase.

Assuming this hypothesis, the signal can be approached as with , where represents the data estimate computed at the receiver. Then, using (24) and (25), the MRST-PAT channel estimate is given byit follows that if then and, subsequently, .

As its name indicates, the MRST algorithm has the goal of removing the mean of the data-bearing signal at the receiver in an iterative way. The structure of this algorithm offers another benefit.

The initial channel estimation may be performed in one of several ways: employing the ST scheme, designating a few pilots according to [36] at the beginning of each block transmitted in the TDM scheme, or using the expectation maximization (EM) algorithm with iterative decoding strategies [38].

In order to make a reliable estimate () of , we consider the joint effort of PAT (TDM) and ST techniques. This is illustrated in Figure 3, where represents matrix of pilot symbols. The pilots are considered to have a preliminary channel estimate and consequently an initial estimate, , of ; then the superimposed training signal is used in the refinement phase to improve upon the initial estimate. The following steps describe the iterative detection and channel estimation procedure of the joint MRST-PAT technique:(1)Set and compute . In this case (2)Set the number of iterations denoted by for the ST phase.(3)Use (14) to obtain a preliminary channel estimate using the pilots in the TDM case.(4)Use the channel estimated to obtain , which represents an estimate of the transmitted symbols.(5)If , compute the signal with , else end.(6)Compute the time-average of the received signal and make .(7)Remove from the received signal to obtain new according to(8)Use (36) with to obtain the channel estimate .(9)Decrease the value of .(10)Go to step .

5. MRST-PAT for Alamouti Space-Time Coding

In this section, we will show the use of the proposed MRST-PAT channel estimation method with 2-OSTBC system (Alamouti scheme), using and . This implementation assumes and to be even, so transmitted matrix is given by . In extended formThe matrix’s rows and columns indicate the transmit antenna and the time slot (channel use), respectively.

pilot matrix is given byThe entries of the pilot matrix are given by the Walsh-Hadamard bipolar spreading sequences [39]. The Walsh-Hadamard sequences of length , with and , are often defined using Hadamard matrices [2] with The resulting matrices are orthogonal matrices; that is, for every we havewhere is the -dimensional identity matrix. Additionally, the structure of assures that the rows (or columns) of a Hadamard matrix are mutually orthogonal.

data-bearing matrix is given byIn this case, the matrix has a dimension of and therefore is a matrix.

Assuming a frequency-flat fading scenario [36], matrix can be composed of two symbols , and two other symbols generated through the Alamouti scheme, exploiting the property of orthogonality. In this wayLikewise, ST matrix is given byand therefore the two row vectors of matrix are orthogonal.

The initial channel estimate (step of the iterative algorithm) is performed using the symbol pilot matrix with the TDM scheme. An improved estimate of the channel matrix will be made using the ST time-domain estimator pursuant to the iterative algorithm described in Section 4.

Let us estimate the channel matrix . For the sake of simplicity, we restricted the analysis to a noise-free scenario. From (24), the time-average of the received signal is given bywhere is the column-wise vectorization operator. Subsequently, the expression in (45) can be reformulated aswith

An estimate of can be used for the purpose of avoiding the computational complexity of in step ; therefore,

It is important to note that is an orthogonal matrix. Consequently, the channel estimate matrix is given byand, finally,where and are and identity matrices, respectively. Matrix represents a new MIMO channel estimate, which is used in the iterative procedure to obtain a new estimate of the arithmetic mean of the data-bearing.

The key idea of this procedure is the removal of the mean at the receiver. Instead of subtracting the mean from the time-average received matrix , this MRST-PAT implementation incorporates the estimated mean in the matrix to obtain an estimate of the channel.

6. Simulation Results

The following section presents the performance of the proposed channel estimation scheme in a frequency-flat fading scenario. A 2-OSTBC system with antennas was used. This simulation scenario was also employed in [28]. The block length was fixed to symbols, unless another value is indicated, and all simulations were run until 1000 block errors were found. The BER is represented as a function of the average SNR, where and is the average symbol energy. QPSK and -ary QAM () constellations were used with Gray-coded symbols, and the transmitted power level for each antenna was normalized to one; for example, (Watt).

To test the channel estimation techniques, we use as a figure of merit the MSE of channel estimate. Figure 4 plots this metric for DDST and MRST-PAT using QPSK. Both of these use of the transmitted power . This power level was chosen using the values of previous works such as [6, 15, 28].

The initial channel estimation for the MRST-PAT scheme was implemented using pilots per block transmitted (remember that an Alamouti space-time coding scheme is considered); hence, the overhead associated with the pilots is , which is . For the ST stage, symbols are used.

Through an analysis of this performance, it can be noted that the proposed scheme approaches DDST for SNR values higher than 10 dB, as was anticipated. However, it is worth highlighting that although the MSE channel estimate in MRST-PAT is not close to DDST at low SNR levels, it is good enough to provide a close BER performance, which is the ultimate metric for system comparison.

Before considering the BER performance, let us discuss Figure 5, which shows the MSE versus power fraction for training signal, with the SNR fixed at 15 dB and QPSK signals. Three different channel estimation schemes (TDM, DDST, and MRST-PAT) were compared. If , with and , then the three schemes have the same MSE performance. It can be observed that the proposed MRST-PAT scheme follows DDST for all power levels of . The selected SNR value is based on [11], because it allows a suitable BER.

The BER performance of this MIMO system working with QPSK signals is shown in Figure 6. It can be seen that the TDM scheme achieves the best BER performance at the cost of 17% of data rate overhead due to pilot transmission, while MRST-PAT, with 1.56% of overhead, for example, , achieves very close to DDST BER performance.

In this scenario, DDST matches the relation described in (35) for the purpose of having the same MSE performance as TDM; the parameters for MRST-PAT were chosen to have the same training power as DDST and 4 pilots, although under frequency-flat fading conditions the minimum length is [36].

According to [26], when -ary QAM constellations with are used, there is evidence of DDST performance degradation due to the data identifiability problem, which is discussed in the following paragraph.

In some block realizations, the data-dependent signal distortion can have more power than the difference between two symbols; consequently, the data-bearing signal will be modified to such a degree that it is not possible to correctly recover it.

Figure 7 shows the block error rate (BLER) performance using 16 QAM signals. In this case, BLER performance is used because it is a relevant measurement used for 3 GPP performance testing requirements, which determine the quality of the radio link. It corresponds to the ratio of the number of blocks received with at least one symbol error to the total number of blocks transmitted.

It can be observed that at fixed BLER of an advantage close to 1 dB was accomplished by MRST-PAT over DDST, both using one iteration and of the transmitted power.

Figure 8 plots the BER performance for 64 QAM signals. Here, the advantage of the proposed MRST-PAT algorithm is more evident. An interesting analysis is that MRST-PAT ( and one iteration) shows the same performance as TDM with . MRST-PAT ( and two iterations) approaches TDM with in 0.5 dB, for SNR levels above 20 dB.

Figure 9 plots the performance of 64 QAM 2-OSTBC versus the block length for the SNR fixed at 30 dB and for DDST and MRST-PAT. It can be seen from the figure that DDST for short block length yields poor results and for large block length ( symbols) approaches TDM with eight pilots, while MRST-PAT for short block length yields good results and the performance increases proportionally as increases.

It is important to note that DDST’s BER performance is highly sensitive to block length. In this scenario, we can appreciate the benefits and flexibility of the MRST-PAT structure, which exploits the presence of the known training symbols and the fact that the ST algorithm achieves a better channel estimate when the block length of the data transmitted is increased. Hence, the system user can change the parameters according to the requirements of block length or less rate loss, for example, fewer dedicated pilots or BER performance.

Finally, it is important to analyze the BER performance versus the power fraction for the training signal, with the SNR fixed at 34 dB and 256 QAM signals. This SNR value is commonly required for this order of modulation. Figure 10 shows that DDST exhibits a poor BER performance for every value of power fraction allocated to the training signal. However, the BER performance exhibited by the MRST-PAT confirms that the proposed method overcomes the identifiability problem and makes the superimposed technique an attractive choice for multiamplitude QAM constellations. The same behavior is found at other SNR levels.

For future research purposes, the proposed approach will be used with more sophisticated equalizers, such as that developed in [24, 25, 28], as well as with channel coders. The results obtained in this future research will allow determining the scenarios under which this new technique represents the best practical solution to be employed.

7. Conclusion

The iterative joint MRST-PAT MIMO detection and channel estimation technique introduced in this contribution deals effectively with DDST’s identifiability problem when higher-order modulation schemes are used. This leads to communication systems that achieve near-DDST performance for low-order symbols, such as BPSK and QPSK, and attain PAT (TDM) performance for higher-order symbols, such as 16 QAM or beyond, with minimal reduction in throughput and using a computationally efficient implementation. Since MRST-PAT is an iterative joint scheme, it can take advantage of both efforts, that is, using pilots by means of explicit training (PAT) and taking the benefit of the implicit training using the ST training signal in the transmitted block. Consequently, more flexible communication systems may be built according to different requirements and constraints. Similar results are expected for MIMO frequency-selective channels.

Competing Interests

The authors declare that they have no competing interests.