Abstract

This contribution describes a novel iterative radio channel estimation algorithm based on superimposed training (ST) estimation technique. The proposed algorithm draws an analogy with the data dependent ST (DDST) algorithm, that is, extracts the cycling mean of the data, but in this case at the receiver_s end. We first demonstrate that this mean removal ST (MRST) applied to estimate a single-input single-output (SISO) wideband channel results in similar bit error rate (BER) performance in comparison with other iterative techniques, but with less complexity. Subsequently, we jointly use the MRST and Alamouti coding to obtain an estimate of the multiple-input multiple-output (MIMO) narrowband radio channel. The impact of imperfect channel on the BER performance is evidenced by a comparison between the MRST method and the best iterative techniques found in the literature. The proposed algorithm shows a good tradeoff performance between complexity, channel estimation error, and noise immunity.

I. Introduction

One of the most widely used approaches to channel estimation is to employ pilot assisted transmission (PAT), where a known training sequence, also referred as pilot, is inserted at each block of transmitted data [1]. Using the knowledge of the training symbols and the corresponding received signal, the channel estimator block at the receiver is able to make an estimate of the channel impulse response (CIR). However, these training pilots, using time division multiplexing (TDM) scheme, consume valuable bandwidth resulting in the reduction of the data rate.

There are two well-known channel estimation techniques to avoid the bandwidth waste of PAT schemes: superimposed training (ST) [2, 3] and data dependent ST (DDST) [4]. These techniques are based on the arithmetic addition (superimposed) of a training sequence to the information data. Both schemes provide a simple (unsophisticated) channel estimation processes; they differ only in that the cyclic mean of the transmitted data of the DDST scheme is superimposed into the transmitted sequence in similar way that the training signal.

Although DDST outperforms ST [4] in terms of channel estimation error, it is worth mentioning that the decoding of data under DDST is of iterative nature because it needs to extract the data-dependent distortion. Considering this, DDST with DDD removal (henceforth we will refer to this scheme as DDST-DDD removal) gives a similar performance as TDM-based channel estimation, but with fewer bandwidth losses. There are however some drawbacks in trade; DDST technique introduces a delay in the transmitted data in order to calculate the cyclic mean. It also assigns less power to the data signal, and hence the use of high-order symbol constellations has repercussions on the data decoding process [5].

The last constraints lead to research on iterative implementations of ST as in [610], starting from the SISO radio channel. These works are based on the use of the decoded data to eliminate the distortion introduced by the received data in the channel estimation process. The first approach uses ST in combination with a traditional least squares channel estimate (LSST) and was developed in [6]. In [6, 7] it is clearly shown that in terms of the channel estimation mean square error (MSE), LSST converges to a fully trained system for high SNR in two iterations, thereby outperforming both conventional ST and DDST. The great disadvantage of LSST scheme is the computational burden of for one iteration, where is the block length and is the order of the CIR. In [11], it is demonstrated that DDST-DDD achieves similar performance than LSST but with considerably less complexity.

Alternative iterative procedures of both ST and DDST methods were introduced in [7]. The first one, IST, uses the equalized symbols, obtained via ST, to improve the channel estimate in an iterative way but with less complexity than LSST [6]. The second one, LSDDST employs the LSST iterative approach but based on DDST instead of ST. LSDDST scheme, has the same computational burden as LSST, that is, , but it converges faster to the fully trained system. In terms of bit error rate (BER), LSDDST scheme yields almost the same BER than LSST.

From the previous works, we are now able to establish this contribution in the context of low-complexity iterative algorithms using ST that shows good BER performance. We introduce a new iterative mean removal ST (MRST) proposal and compare its performance with the previous and most relevant works.

This MRST yields similar performance to DDST-DDD removal and IST but with less complexity when they are compared with LSDDST and LSST. Because the iterative channel estimation methods depend on and work jointly with the equalization stage, we present the results using two equalizers widely used in communication systems: the minimum mean square error (MMSE) equalizer and the maximum likelihood sequence estimation (MLSE) equalizer. The inclusion of both techniques is helpful to accentuate some particularities of the channel estimation methods used.

Additionally, we extend the results of SISO to MIMO case, and study the performance of training-based flat block-fading MIMO channel estimation. Three training-based channel estimators are considered (TDM, DDST, and MRST), which offer different tradeoffs in terms of performance. We analyze the error performance of the MRST method based on the traditional least squares (LSs) method and obtain the corresponding MSE. The proposed MRST estimator for the MIMO case is illustrated using an orthogonal space-time block coder (OSTBC) with two transmit and two receive antennas, that is, Alamouti space-time coding.

This paper is organized as follows. Section 2 deals with the new iterative ST approach in the framework for ST/DDST schemes for SISO systems. The performance analysis of the MRST for MIMO systems is obtained in Section 3. In Sections 4 and 5, the simulation results and performance comparison are given for SISO and MIMO, respectively. Finally, the conclusions are set down in Section 6.

2. Mean Removal ST for SISO Systems

2.1. SISO System Model

Assuming a frequency-selective channel, Figure 1 depicts the discrete-time baseband block diagram of a digital communication system, where the channel input signal represents a succession of information blocks of length , and is given by taking into account that the block of interest is k indexed and . The sequence represents the data with zero mean and variance is a deterministic periodic training sequence with period and power , and refers to a data-dependent sequence, with period , obtained by periodically repeating times the signal where .

For the ST case, and the sequences are arithmetically added in a superimposed way according to [4], before transmission. Then, the received signal, assuming that exact synchronization and DC-offset are provided, can be observed at the receiver as where is the impulse response of order (i.e., and ), and is the complex Gaussian random noise with zero-mean, white, uncorrelated to and independent real and imaginary parts with variance per dimension.

Assuming that the channel is quasistatic, that is, the channel is time-invariant during the information block received, and the exact channel order is known in advance, then the strong constraint () showed in [11], with exact synchronization and DC-offset provided, can be relaxed to . We used this assumption for the mathematical analysis of the system. Despite this relaxation, all simulations were carried out considering the strong constraint.

2.2. Performance Analysis

It is clear that DDST method must compute the sequence from the data block that will be transmitted; consequently this data processing has an impact on the total delay of every transmitted block. On the other hand, the sequence has a wider dynamic range in comparison with the sequence obtained with ST method. This results in a higher peak-to-average power ratio (PAPR) of the preamplifier communication building block. Another implication using DDST is the fact that the sequence will have less power in comparison again with ST and then less noise immunity. For these reasons, iterative ST schemes are very attractive.

We developed the MRST scheme starting from the hypothesis that if we could obtain an estimate of the signal (i.e., a cyclic mean of sequence ) at the receiver side, then we would achieve the performance of DDST in terms of channel error estimate MSE but with more power assigned to the data sequence , having an impact on a better performance in terms of BER.

Estimating the cycling mean of period as in [3], we can write with .

Combining (3) and (4) yields where From (5), it follows that in matrix form where and are circulant matrices with first columns and respectively, and . The column vectors and have similar expression to . Assuming a cyclic prefix of length (as was done in [4, 7]), is circulant matrix with first column .

For the ST case (i.e., when in (7)) we have , and using the channel estimate from [3] then

For the DDST case in (7), so we have the channel estimate [4] as Clearly, the difference between these two channel estimation schemes is the factor .

Now using the hypothesis early mentioned (i.e., with ST scheme), we proceed to make the channel estimation for the MRST scheme, then (7) becomes and multiplying (10) by we obtain and finally

From (12) it follows that if then . In order to make a good estimate of , the following steps are made.

(1)Use (8) to have an initial channel estimate as plain ST and make .(2)Use the channel estimated to obtain the equalized symbols and employ a hard decision detector.(3)Use the hard-decision symbols detected to calculate .(4)Remove from the received signal to obtain a new according to where is a column vector .(5)Use (11) with and update the channel estimate .(6)Go to step 2 and repeat as need it.

Defining the , then from [3, 4], and having a better estimation of , it follows that , then

From (9) and (11), DDST and MRST, respectively, we note the fact that is calculated only once beforehand. The IST scheme, however, uses the estimate of to calculate (see [7, (12)]) for every iteration and information block received. This action implies an additional complexity to the computational burden.

On the other hand, as we will see in Section 4, the estimate used to estimate the channel in IST and MRST methods depends on what type of equalizer is used. Conversely, the scheme DDST-DDD removal cannot take advantage of MLSE equalizer, because this equalizer delivers hard-decision symbols, and DDST-DDD removal uses the equalized symbols previous to hard-decision procedure.

Table 1 summarizes the computational burden, the computation of inverse matrix , and the MSE performance approach. The complexity of an algorithm is a quite important metric when the algorithm is to be implemented in HW, as this is the case we are very concern of this value. As for computational burden metric, we choose the number of iterations as function of the transmitted block length N and the equalizer length , to be precise, and the number of coefficients in the equalizer.

Although DDST has the least computational complexity at the receiver side, it needs to calculate the data-dependent sequence , that implies additional complexity and time delay. Furthermore, it assigns less transmission power to the data signal.

3. Mean Removal ST for MIMO Systems

3.1. MIMO System Model

We consider a wireless MIMO communication link with transmit and receive antennas, operating in a Rayleigh flat-fading environment. The fading coefficient is the complex path gain from transmit antenna to receive antenna . We assume that the coefficients are independently complex circular symmetric Gaussian with unit variance, and then The expression for the received symbols can be expressed as where is the received signal vector, is the transmitted signal vector, and is an vector of additive noise terms, assuming that noise is spatially and temporally white Gaussian with zero mean and independent real and imaginary parts with variance per dimension.

Let us assume the block transmission scheme with the block length at times and we also assume that the channel matrix remains constant within a block of symbols, that is, the block length is much small than the channel coherence time. Under these assumptions, the channel, within one block, can be written as where are the matrices of the received signal, transmitted signals, and noise, respectively [12].

Let us denote the set of complex information symbols prior to space-time encoding as , where each denotes a set of signal constellation points. The symbols are zero-mean mutually uncorrelated random variables. Let us introduce the vector where denotes transpose. Note that , where is the set of all possible symbol vectors and is the cardinality of this set. The complex matrix-valued function is called OSTBC [13] if it satisfies that

(1)all the entries of are linear functions of the complex variables and their complex conjugates;(2)for any arbitrary in , where is the identity matrix, is the Euclidean norm, and denotes Hermitian transpose.

3.2. Performance Analysis

In order to estimate the channel matrix , it should be emphasized that in any statistical expectation below, the matrix is treated as random; at the same time, any estimator of is supposed to obtain an estimate of a particular realization of this random matrix that corresponds to the current block of the received data.

In the conventional ST estimation technique for MIMO systems [9], a known training matrix , is added arithmetically to the data matrix during every block transmitted. In this way, the transmitted signal matrix can be expressed as . It is clear that the total transmitted power is distributed between the data and the training signals, that is, .

The ST system is depicted in Figure 2, where a signal matrix is transmitted over the radio MIMO channel with the channel matrix , and distorted with the noise matrix .

Based on the received signal R, the MRST delivers an estimate of the MIMO channel matrix, denoted as ; subsequently the decoder obtains an estimate of , denoted as , and an estimate of the mean of the data, , which can be used by the channel estimator block, in an iterative way, to provide the decoder with a better estimate of .

Now, the task of this channel estimation algorithm is to recover the channel matrix based on the knowledge of and . Assuming flat-frequency channel conditions then all the row vectors of the training matrix can be equal. Hence, a time-domain estimator based on the synchronized averaging of the received signal can be implemented. The time-average of the signal is given by where is a column vector, is the unit vector, is the training vector repeated times and represent the time-average of the data matrix and noise matrix , respectively, for every block transmitted.

Using the LS approach [14, 15], an estimate of can be obtained as where is the pseudoinverse of .

We will use the following transmitted training power constraint where is the identity matrix; now the channel estimate is It is clear that the average over the data signal represents an extra term for the channel estimate. This average is exploited by the DDST method at the transmitter or by the MRST method at the receiver.

In order to obtain the MSE achieved by MRST method, we use the performance analysis of the DDST method, explicitly we use and define the perturbation matrix that will be arithmetically added to the data signal every transmitted block. Hence, the transmitted signal is and the corresponding received signal is , then from (21) we obtain

Considering (22) under optimal training and , the MSE for DDST [9] is given by

Instead to take away the contribution of at the transmitter, the MRST uses the plain ST and removes in an iterative manner at the receiver. From (21), the channel estimation error can be expressed as It follows that if then , therefore the as it will be corroborated by the simulation results in Section 5. With respect to the iterative procedure to obtain a better estimate of , we follow the steps explained for the SISO case.

3.3. MRST with Alamouti Space-Time Coding

Let us use MRST estimation method with the -OSTBC system (Alamouti coding scheme). Let us assume to be even, and are the matrices of the transmitted signal, data signal, and training signal, respectively, with denoting complex conjugation. The matrix’s rows and columns indicate the transmission time and transmit antenna, respectively. Assuming flat-fading scenario, no cyclic prefix is required, and the signal training selection can be done choosing two symbols and exploiting the property of orthogonally achieved with the OSTBC system, that is, the two column vectors of the matrix are orthogonal.

The estimate of the cycling mean in flat-fading scenario can be done obtaining the mean of the receive matrix denoted as . Each element of is given by where the indexes correspond to Alamouti’s emission time () and transmit antenna () respectively for this particular block coding transmission.

Let us estimate the channel matrix using (28) and assuming a noisy free scenario (without loss of generality), hence, we have and is the vectorization operator stacking all rows of a matrix on top of each other. Therefore, the channel estimate is given by Because matrix is unitary, then (30) can be rewritten as and finally, where and are the and identity matrix, respectively, and denotes the Kronecker product. The matrix represents the new MIMO channel estimate used in the iterative procedure to get a new estimate of the mean of the data-bearing ().

The key idea of this implementation is the way to remove the mean at the receiver. Instead of subtracts the mean of the data, from the received matrix we incorporate the mean estimate in the matrix and use the Alamouti decoding procedure to estimate the channel matrix.

4. Simulation and Results for SISO Systems

Equalization is a well-known technique used to combat intersymbol interference (ISI) whereby the receiver attempts to compensate for the effects of the channel of the transmitted symbols. An equalizer attempts to determine the transmitted data from the received distorted symbols using an estimate of the channel that caused the distortions. In this contribution, we consider two types of equalizers widely used in communications systems: the MMSE and MLSE equalizers. The MMSE has lesser complexity and performance in terms of BER than the MLSE equalizer that is optimal for ISI.

4.1. SISO System Using MMSE Equalizer

We considered a time-invariant random three-tap frequency-selective Rayleigh fading channel with and . The channel coefficients were complex Gaussian, i.i.d. with unit variance, rescaled to achieve unitary mean energy. The sequence is an R.V with uniform p.d.f. and variance . The parameters and are chosen for ST and DDST independently such that . The training to information power ratio was set to with . The block length is fixed to and a cyclic prefix of length added at the beginning of each block in both ST and DDST methods. The channel estimated is used to design an MMSE equalizer of length and equalization delay . All simulations that were run until 1000 blocks with errors were found.

Figure 3 shows the channel estimate MSE obtained from the three iterative procedures considered (a) DDST with DDD removal exposed in [11], (b) IST scheme introduced in [7], and (c) MRST method presented here. It can be observed that IST and MRST have a significant approach to DDST (from 10 dB to 20 dB) just after two iterations.

Figure 4 shows the BER performance comparison where for 2 iterations, all schemes show similar behavior for low and medium SNR levels; DDST exhibits a slight advantage for high SNRs because it achieves a better channel estimate at these levels as is shown in Figure 3.

4.2. SISO System Using MLSE Equalizer

In order to get a better estimate of for both IST and MRST schemes, we use an MLSE equalizer with traceback length of 11. To be more specific, this equalizer comprises a Viterbi algorithm, which finds the most likely data sequence transmitted. Although, to perform close to ideal MLSE, the equalizer requires traceback lengths of the order of 5-6 times the ISI span, we chose the traceback length of 11, that is, the same number of taps of the MMSE equalizer used in Section 4.1.

Due to the fact that DDST-DDD removal [11] works with the equalizer output before proceeding with a new hard-decision process, that is, equalization and detection stages are carried out separately, it cannot exploit the benefits of the MLSE equalizer. In order to have similar conditions for the DDST method in the decoding procedure, firstly we used the MMSE equalizer, secondly we removed the data distortion, and finally we used the MLSE equalizer to obtain the data symbols. Figure 5 shows the BER performance of these three methods and the conventional ST method.

Clearly, we observe the benefits of use IST or MRST coupling with an MLSE equalizer, where both schemes outperform DDST-DDD removal when it only uses the MLSE equalizer. However, there is no noticeable performance difference when DDST uses the concatenation of the MMSE and MLSE equalizers.

Figure 6 shows the block error rate (BLER) performance. BLER is the statistical measurement of the ratio of the number of blocks with error received to the total number of blocks transmitted and it is part of the performance requirements of 3GPP test. We observe that there is no noticeable performance difference between the MRST and IST with one iteration, and DDST-DDD removal using the equalizer concatenation and removing the data distortion.

5. Simulation and Results for MIMO Systems

In what follows, we illustrate the performance of the proposed scheme on MIMO systems working in a flat-fading scenario. We use the -OSTBC system, transmit and receive antennas, respectively, with ideally uncorrelated elements. The block length is fixed to symbols, and all simulations were run until 1000 blocks were in error. The BER is represented as a function of the average SNR, where and is the average energy per symbol.

A QPSK symbol constellation is used, and the power transmitted for each antenna is normalized to one, that is, [Watt].

Figure 7 depicts the channel estimate MSE comparisons for DDST and the MRST techniques with one and two iterations; both of them use of the power transmitted , that correspond to an approximate upper bound of the region with the best BER shown in Figure 10. Making an analysis of this performance, we can realize that the proposed scheme starts to approach the DDST performance from 10 [dB]. The MRST plot for the second iteration reaches the best approach. Note that this benefit comes at the price of decoding complexity.

Figure 8 shows the BER performance comparison between TDM with 17% bandwidth loss, DDST with DDD removal and the MRST algorithm. We observe that DDST and MRST plots with one or two iterations practically achieved the same performance. TDM-based channel estimation attained the best BER performance at the expenses of 17% bandwidth loss.

Figure 9 depicts the MSE of MRST versus when . For example, with we have that . A better MSE performance is directly proportional to the training power used.

Figure 10 plots the BER of MRST versus from to 15 dB. Note that the minimum is achieved in the range where approximately. Because the BER achievement is the key and critical factor in digital communication systems, we worked in the region with the best BER in the selection of the training power assigned. This selection is only a guide or suggestion for this type of radio propagation conditions at these particular SNR levels. Similar training power range was chosen for DDST in [9], but using a spatial multiplexing system. A more detailed analysis of the training power allocation problem is shown in [16]. Particularly, the simulations showed in Figures 9 and 10 were run until 10 000 blocks were in error.

6. Conclusion

low-complexity iterative superimposed training schemes that work jointly with equalization stage can offer similar or better performance than the iterative symbol-by-symbol detection DDST with DDD removal. The MRST introduced in this work effectively compensates the data dependent distortion that ST is unable to deal with, but at the receiver side. This leads to communication systems that perform similar to DDST but without suffering its drawbacks. Furthermore, MRST can be successfully applied to both SISO and MIMO systems, and in cooperation with the most widely used equalizers. Particularly, for the SISO case, MRST shows similar performance like the previous proposed method IST, but it avoids the inverse matrix computation that every iteration IST does. Although the results of both iterative show insignificant gaps in the performance of BER and MSE, MRST is preferred by its less complexity hardware implementation.

For the MIMO case, the performance results are closely similar, that is, the performance of DDST is attained with the proposed method. Additionally, iterative ST methods can still be applied to time-varying channels, while DDST based system does not. This is true because the concept of cyclic mean in DDST is meaningless due to the fact that every symbol of the block transmitted is distorted in different way by the channel.

Acknowledgments

The authors thank the reviewers for their careful reading of the manuscript and their constructive comments. This work was supported by INTEL research grants DCIT2006 and CERMIMO2008, and CONACYT research grants 47909-Y and 84559-Y.