EURASIP Journal on Advances in Signal Processing
Volume 2008 (2008), Article ID 390102, 13 pages
doi:10.1155/2008/390102
Research Article

A Two-Stage Approach for Improving the Convergence of Least-Mean-Square Adaptive Decision-Feedback Equalizers in the Presence of Severe Narrowband Interference

1Department of Electrical and Computer Engineering, University of California at San Diego, La Jolla, CA 92093-0407, USA
2Wireless@VT and the DSP Research Laboratory, Bradley Department of Electrical and Computer Engineering, Virginia Tech, Blacksburg, VA 24061-0111, USA

Received 3 January 2007; Revised 16 April 2007; Accepted 8 August 2007

Academic Editor: Peter Handel

Copyright © 2008 Arun Batra et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

It has previously been shown that a least-mean-square (LMS) decision-feedback filter can mitigate the effect of narrowband interference (L.-M. Li and L. Milstein, 1983). An adaptive implementation of the filter was shown to converge relatively quickly for mild interference. It is shown here, however, that in the case of severe narrowband interference, the LMS decision-feedback equalizer (DFE) requires a very large number of training symbols for convergence, making it unsuitable for some types of communication systems. This paper investigates the introduction of an LMS prediction-error filter (PEF) as a prefilter to the equalizer and demonstrates that it reduces the convergence time of the two-stage system by as much as two orders of magnitude. It is also shown that the steady-state bit-error rate (BER) performance of the proposed system is still approximately equal to that attained in steady-state by the LMS DFE-only. Finally, it is shown that the two-stage system can be implemented without the use of training symbols. This two-stage structure lowers the complexity of the overall system by reducing the number of filter taps that need to be adapted, while incurring a slight loss in the steady-state BER.

1. Introduction

Maintaining reliable wireless communication performance is a challenging problem because of channel impairments such as fading, intersymbol interference (ISI), narrowband interference, and noise. Therefore, there is a need for innovative receivers which can mitigate these impairments rapidly, especially when the information is being transferred in small packets or short bursts.

There has been a considerable amount of work on mitigating the effects of ISI (see [1] and references therein) and fading channels (see [2] and references therein). The focus of this paper is on techniques that can quickly mitigate strong narrowband interference. Narrowband interference typically occurs because of nonlinearities in the mixer or by other communication systems radiating in the same frequency band (as occurs in many of the unlicensed bands, e.g., Bluetooth is a narrowband interferer for WLAN systems). A strong interferer can make recovering the transmitted information quite challenging.

Several methods for suppressing narrowband interference have been discussed in the literature. A linear equalizer (LE) and a decision-feedback equalizer (DFE) were studied in [3]. It was shown that the performance of the DFE is better than that of the LE. The LE seen in both systems removes the interference, while the additional feedback taps of the DFE enable the cancellation of the post-cursor ISI that is induced by the LE. Linear prediction [4, 5] is another common technique that has been used in direct-sequence CDMA systems [68] when the processing gain does not provide enough immunity to the interference. When the signal of interest is wideband compared to the bandwidth of the interferer, linear prediction predicts the current value of the interference from past samples. When the structure is implemented as a prediction-error filter, the estimate of the interference is removed at the cost of some signal distortion. A further review of interference suppression techniques can be found in [9, 10].

When the statistics of the interference are known, the weights of these systems are found by minimizing the mean-squared error [11] (or equivalently by solving the Wiener-Hopf equation). In practice, however, this type of a priori information is not available. Thus, these systems are best implemented adaptively. Of the many algorithms available, we focus on a low-complexity method, specifically the least-mean square (LMS) algorithm [11]. The LMS algorithm is also noted for its robustness and improved tracking performance [11, 12]. The drawback of this particular algorithm is its slow convergence when there is a large disparity in the eigenvalues of the input signal [11]. Slow convergence leads to the need for a large number of training symbols. These symbols do not transmit any new information, reducing the overall throughput of the system. Conventional analyses of adaptive algorithms use the mean-squared error (MSE) as the metric when investigating the convergence. However, since BER is a more definitive performance metric for analyzing communication systems, the convergence is viewed in terms of the BER with the aid of a sliding window. Convergence is defined as the number of symbols needed to attain a certain BER.

Although it has been shown that alternate adaptive algorithms, such as the recursive least squares (RLS) algorithm [11], provide improved convergence relative to the LMS algorithm in cases of high eigenvalue disparity, there are many reasons why LMS is chosen for practical communications system applications. Hassibi discusses [12] some of the fundamental differences in the performance of gradient-based estimators such as the LMS algorithm and time-averaged recursive estimators such as the RLS algorithm in the cases of modeling errors and incomplete statistical information concerning the input signal, interference, and noise parameters. Hassibi [12] examines the conditions for which LMS can be shown to be more robust to variations and uncertainties in the signaling environment than RLS. LMS has also been shown to track more accurately than RLS because it is able to base the filter updates on the instantaneous error rather than the time-averaged error [1316]. The improved tracking performance of LMS over RLS for a linear chirp input is well established [11, 16]. In [17] it is shown that an extended RLS filter that estimates the chirp rate of the input signal can minimize the tracking errors associated with the RLS algorithm and provides performance that exceeds that of LMS. It should be noted, however, that the improved tracking performance requires a significant increase in computational complexity and knowledge that the underlying variations in the input signal can be accurately modeled by a linear FM chirp. For cases where the input is not accurately represented by the linear chirp model, performance can be expected to be significantly worse than simply using an LMS estimator, for the reasons discussed in [12]. The computational complexity of RLS, in particular for high-order systems, favors the use of LMS. The latter is also more robust in fixed-point implementations. In addition, the LMS estimator has been shown to provide nonlinear, time-varying weight dynamics that allow the LMS filter to perform significantly better than the time-invariant Wiener filter in several cases of practical interest [18, 19]. It is further shown that the improved performance associated with these non-Wiener effects is difficult to realize for RLS estimators due to the time averaging that is inherent in the estimation process [20].

In this paper, we first demonstrate that the LMS DFE possesses an extended convergence time (greater than 10,000 symbols for the cases investigated here) when severe narrowband interference (SIR  dB) is present, due to the fact that the equalizer does not have a true reference for the interference. To reduce the convergence time and the number of training symbols needed, we propose a two-stage system that uses an LMS prediction-error filter (PEF) as a prefilter to the LMS DFE-only. For strong interference the PEF generates a direct reference for the interference from past samples and mitigates it prior to equalization.

A two-stage system employing a linear predictor has been previously investigated [21, 22] in combination with the constant modulus algorithm (CMA). The prediction filter is employed to mitigate the interference and ensure that the CMA locks on to the signal of interest. The prediction filter is not used specifically for its convergence properties. The two-stage structure in this paper uses a supervised algorithm for the adaptation of the second structure and is developed with the goal of improving the convergence of the overall system.

The second contribution of this paper is to show that the two-stage system reduces the number of training symbols required to reach a BER of by two orders of magnitude without substantially degrading the steady-state BER performance as compared to the LMS DFE-only case. All comparisons will be made under the condition that the LMS DFE-only and the two-stage structure have the same total number of taps. The two-stage system's adaptive implementation is superior due to the fact that the prediction-error filter utilizes the narrowband nature of the interference to obtain a beneficial initialization point. On the other hand, the LMS DFE-only employs only the training symbols which have no knowledge of the statistical characteristics of the interference.

Finally, the two-stage system may be implemented in a manner that does not require any training symbols. The PEF is inherently a blind algorithm because the error signal is determined from the current sample and the past samples. A relationship between the PEF weights and the DFE feedback weights is obtained, allowing the DFE to be operated in decision-directed mode after convergence of the PEF weights. This technique outperforms the nonblind decision-directed implementation when a small number of training symbols is used. The nonblind decision-directed implementation suffers because the feedback weights lie far from their steady-state values prior to the switch to decision-directed mode. This blind method also allows for a reduction in the complexity of the system (i.e., fewer weights that need to be adapted) at the cost of a slight increase in steady-state BER.

The paper is organized as follows. Section 2 describes the system model. The LMS algorithm and its convergence properties are reviewed in Section 3. In Section 4, the previous approaches of the DFE and the PEF are discussed. The proposed two-stage system is revealed in Section 5 along with its relation to the DFE. A blind implementation for the proposed system is also presented in Section 5. In Section 6, the convergence and steady-state BER results are presented. Concluding remarks are given in Section 7.

2. System Model

A complex baseband representation of a single-carrier communication system is depicted in Figure 1. The signal of interest, , is composed of i.i.d. symbols, drawn from an arbitrary QAM constellation, with average power equal to . It is passed through a pulse shaping filter that is necessary for bandlimited transmission. This signal is corrupted by narrowband interference, , modeled as a pure complex exponential and additive white Gaussian noise. A matched filter is employed at the receiver to maximize the signal-to-noise ratio (SNR) at the output of the filter. Note that the overall frequency response of the pulse shape and the matched filter is assumed to satisfy Nyquist's criterion for no intersymbol interference (ISI) and the filters operate at the symbol rate.

Figure 1: Discrete-time system model.

The signal at the input to the equalizer, , is defined as (1) where is the symbol duration, is the interferer power, is the angular frequency of the interferer, and is a random phase that is uniformly distributed between and . The additive noise, , is modeled as a zero-mean Gaussian random process with variance . The signal-to-noise ratio is defined as SNR = and the signal-to-interference ratio is defined as SIR = .

It is assumed that the communication signal, interferer, and noise are uncorrelated to each other. The autocorrelation function of the input, , is defined as (2) where is the expectation operator, indicates conjugation, and is the Kronecker delta function.

3. LMS Algorithm

The LMS algorithm [11] is defined by the following three equations: (3) where is the input vector to the equalizer, is the vector of adapted tap weights, is the desired signal, is the output of the decision-device when is its input, is the error signal, is the step-size parameter, and represents conjugate (Hermitian) transpose.

Note that there are two stages associated with the adaptive algorithm. The first stage is the training phase, where known training symbols are used to push the filter in the direction of the optimal weights. After the training symbols have been exhausted, the algorithm switches to decision-directed mode. The output of the decision device is used as the desired symbol when calculating the error signal. Ideally, at the end of the training phase the output of the filter is close to the desired signal.

3.1. LMS Convergence

In conventional analyses, convergence refers to the asymptotic progress of either the adaptive weights or the MSE toward the optimal solutions. The convergence (as well as the stability) of the system is dependent on the step-size. The step-size parameter is chosen in a manner to guarantee convergence in the mean-square sense, namely, (4) where is the maximum eigenvalue of the input autocorrelation matrix.

Assuming that the adaptive weights and the input vector are independent, Shensa [23] showed that the convergence of the weight vector can be expressed as (5) where are the eigenvalues and are the eigenvectors of the input autocorrelation matrix. The optimal Wiener solution is represented by . A similar equation arises for the convergence of the mean-square error (MSE) [24], when gradient noise (on the order of ) is neglected (6) Letting the learning curve be approximated by a single exponential allows a time constant [11] to be defined for each mode, (7) The maximum modal time constant is associated with the minimum eigenvalue, (8)

This maximal time constant can be seen to be a conservative estimate by examining (5) more closely. The convergence will be influenced only by those eigenvalues for which the projection of the corresponding eigenvector on the optimal weights is large. Lastly, it can be seen for the case of , that it is possible for the convergence of the filter output (mean-square error) to be faster than the convergence of the filter weights. This is because there may be fewer modes controlling the MSE convergence (i.e., when ).

The equations above provide excellent insight into the convergence of the LMS algorithm; however, in this paper, we are interested in the convergence in a limited time interval when the metric of interest is BER. Therefore, we define the convergence to be the average number of training symbols needed to achieve a BER of . This value is consistent with the notion that the BER should be less than when switching from training to decision-directed mode [25]. Additionally, using a convolutional code with an input BER equal to is equivalent to a BER of at the output of the decoder [26].

3.2. Sliding BER Window

As mentioned above, the convergence of an adaptive filter is viewed by the ensemble average learning curve [11], a plot of the MSE versus iteration. Note that in this work, each iteration of the adaptive algorithm occurs at the symbol rate. To examine the convergence of the BER here, we employ a sliding window of symbols. For example, the first BER value corresponds to the average number of bit errors over symbols 1 through 100; the second value corresponds to the average number of bit errors over symbols 2 through 101; and so forth. These values are then averaged for trials. A general formula for BPSK modulation can be seen as (9) where is the transmitted symbol of the packet and is the decision of the symbol of the packet. Note that the minimum nonzero BER value will be equal to .

4. Previous Approaches

4.1. Decision-Feedback Equalizer
4.1.1. Equalizer Structure

The DFE is composed of a transversal feedforward filter with -taps (one main tap and side taps) and a feedback filter that has -taps. A block diagram of the DFE is shown in Figure 2. The output of the filter, , with inputs and is (10) where is the estimate of the symbol out of the decision device. Note that are the tap weights associated with the feedforward filter, and are the tap weights associated with the feedback filter. During the training phase, in (10) equals .

Figure 2: Decision-feedback equalizer block diagram.

The feedback taps allow the equalizer to cancel out post-cursor ISI based on the estimated decisions without enhancing the noise. The BER analysis of the DFE with error propagation can be accomplished utilizing Markov chains to model the term as contents of a shift register and the assumption that the fed back decisions are perfect [3, 2729]. The number of states in the Markov chain grows exponentially with the number of feedback taps.

4.1.2. DFE Optimal Weights

The optimal weights under the minimum mean-square error (MMSE) criterion can be found using the orthogonality principle [11]. equations are obtained, and the weights can be found using the method described in [3, 30]. The optimal DFE tap weights are given by (11)(12)(13)(14)

Observe that the weight of the feedback taps (14) is the negative of the feedforward side taps (12) when . This implies that if the data fed back is perfect, the ISI caused by the previous data symbols will be completely canceled. Also note that (13) is a scaled multiple of (12). This scaling value effectively removes the influence of the associated data symbols that can not be canceled by the feedback taps. For the special case of , it can be seen that if the data fed back is perfect, the ISI caused by the feedforward equalizer will be completely canceled, leaving only the symbol of interest.

4.1.3. DFE SINR Calculation

The signal-to-interference-plus-noise ratio (SINR) at the input to the decision device of the DFE can be found using (10) and the optimal weights given in (11)–(14) to be (15)

4.1.4. Autocorrelation Structure

The input to the decision-feedback equalizer is the concatenation of the received input to the equalizer and the fed back decisions, given by , where is the transpose operator. The vector, , is composed of the fed back decisions that are assumed to be correct, and is thus defined as (16) The autocorrelation matrix for the -tap feedforward and -tap feedback equalizer is defined as (17) The autocorrelation matrix seen in (17) is partitioned into 4 submatrices. The matrices on the diagonal are the autocorrelation matrix of the received input to the equalizer and the autocorrelation matrix of the data symbols, respectively. The values in the upper left submatrix are given by (2). The cross-correlation matrix between the received input to the equalizer and the data symbols is located on the off-diagonal.

4.1.5. Eigenvalues

There is no closed form expression for determining the eigenvalues of the correlation matrix defined in (17). A method to bound the eigenvalues of positive-definite Toeplitz matrices can be found in [31] and its application to the correlation matrix given in (17) can be found in [32]. However, for the case of and , the minimum and maximum eigenvalues are found to be (18) and the eigenvalue spread is (19) Note that the eigenvalues given in (18) are not a function of .

4.1.6. Convergence Properties

The projection of any of the eigenvectors on the optimal weight vector is nonzero. This implies that the time constant (8) is inversely proportional to the minimum eigenvalue . The delay in convergence can be attributed to the fact that the DFE does not have a direct reference for the interferer during adaptation and is thus forced to converge on the basis of the training data only. The feedback taps converge slower than the feedforward taps because the DFE is designed such that the interferer is canceled by the feedforward taps, while the feedback taps attempt to cancel out the signal distortion caused by the feedforward taps [3].

4.2. Prediction Filter
4.2.1. Predictor Structure

The linear predictor (LP) is a structure that uses the correlation between past samples to form an estimate of the current sample [11, 25, 33]. A variant of this filter, the prediction-error filter (PEF), has the property that it removes the correlation between samples, thereby whitening the spectrum. A common example of this property is seen when determining the parameters of an autoregressive (AR) process. The prediction-error filter (assuming a sufficient filter order) of such an input provides both the AR parameters and a white output sequence that is equal to the innovations process.

This technique has also been used to remove narrowband interference in many applications [68, 29, 30]. The filter is able to predict the interferer due to its narrowband properties. A block diagram of the prediction-error filter is shown in Figure 3. The PEF is a transversal filter with taps. The decorrelation delay () ensures that the signal of interest at the current sample is decorrelated from the samples in the filter when calculating the error term. Because the data is i.i.d., is a sufficient choice, giving the one-step predictor. The linear combination of the weighted input samples, , forms an estimate of the interferer, given by (20) where are the tap weights of the predictor. The output of the PEF, , is defined as the subtraction of the estimate of the interference given in (20) from the current input sample (21) Note that is also the error term of the structure. This implies that the PEF is in fact a blind algorithm. It does not require any training symbols when calculating the error term.

Figure 3: Prediction-error filter block diagram.
4.2.2. Predictor Optimal Weights

The optimal tap weights can be found in a way similar to those for the equalizer above [3, 30]. Using the orthogonality principle, equations are obtained and the weights of the PEF are given by (22) where is equal to (23) For the scenario of interest in this paper, the interference power is much larger than both the signal power and the noise power. Therefore, the SIR and the noise-to-interference ratio (NIR) can be assumed to be very small (i.e., SIR  dB, NIR  dB [3]) and can be approximated as (24)

4.2.3. Sensitivity to Additive Noise

The PEF has been shown to be sensitive to additive noise when used for channel estimation [34, 35]. An algorithm was proposed in [36] to provide adaptive estimation of unbiased linear predictors with the goal of obtaining a consistent estimate of an ISI single-input multiple-output (SIMO) channel. To examine the effect of the additive noise on the PEF for this problem, we are interested in the noise-free predictor weights, given by (25) where is equal to (26) We compare (25) with the biased predictor weights given in (22) and look at the norm of the difference (bias), (27) This bias can be approximated using the assumptions that the SIR and NIR are very small to give (28) The value in (28) is quite small due to the assumption that the NIR is small. Thus, in this work, the bias in the linear predictor does not substantially affect the system's performance.

4.2.4. Autocorrelation Structure

The input autocorrelation matrix for the PEF is defined as (29) where the components of the matrix are given by (2).

4.2.5. Eigenvalues

The eigenvalues for the correlation matrix given by (2) and (29) can be found [7, 23, 37] to be equal to (30) The eigenvalue spread is defined [11] as (31)

4.2.6. Convergence Properties

In this case the eigenvectors corresponding to the minimum eigenvalues are orthogonal to the optimal weight vector, hence these eigenvalues do not affect the convergence [23]. Thus the time constant is dependent only upon the maximum eigenvalue .

4.2.7. Output Autocorrelation

The whitening property of the PEF can be seen more clearly through the autocorrelation function of the output of the PEF, which is derived to be (32) An approximation for the output autocorrelation function in (32) can be found using the approximation given in (24), (33) Finally, letting the filter order increase toward infinity shows that the output spectrum is approximately white, (34)

4.2.8. Eigenvalue Spread

The effect of the PEF is that the interference is removed, which then results in the reduction of the eigenvalue spread. This can be seen in Figure 4 for  dB, SIR  dB, and . Also in the plot is the eigenvalue spread of the received data given by (31). Note that it is assumed that . It is clearly seen that the spread has been reduced, and the modes of this input to the LMS DFE will converge in similar amounts of time.

Figure 4: Eigenvalue spread of input to DFE-only and output of PEF for  dB, SIR  dB, and .

5. Two-Stage System

As discussed in Section 4.2.7, the PEF provides an approximately white output spectrum when an infinite number of filter taps is used. Each additional tap provides an increase in spectral resolution when notching out the narrowband interference. However, the implementation of a large number of taps is not generally feasible and some distortion in the form of postcursor ISI will be present. To combat the distortion induced by the PEF, the DFE is a simple structure that removes the ISI without enhancing the noise. This leads to a simple two-stage structure that uses the PEF for rapid convergence and the DFE for removing postcursor ISI as a system to mitigate narrowband interference.

A similar approach is discussed in [25, pages 364-365] when deriving the zero-forcing decision-feedback equalizer. Barry et al. demonstrate that the optimal DFE precursor equalizer is related to optimal linear prediction. Consider transmitting data through a channel that induces ISI. This distortion can be removed by employing a linear zero-forcing equalizer, while causing the noise samples at the output of the equalizer to be correlated. This correlation can be subsequently removed with a PEF, at the expense of postcursor ISI. Finally, a zero-forcing feedback postcursor equalizer removes the ISI without enhancing the noise.

We now consider the performance of the PEF followed by the DFE, which will be abbreviated as PEF + DFE. A block diagram of the two-stage structure is shown in Figure 5. The PEF is tasked with whitening the spectrum by removing the interference, but due to its limited length it will introduce postcursor ISI; this ISI is then removed by the DFE. The DFE is designed to have a one tap feedforward section and an -tap feedback section. In general, there is no need for a feedforward section, because the input is distorted with only postcursor ISI that can be resolved by the feedback equalizer portion. We have chosen to include the one tap to compensate for any phase shifts that might exist because of phase errors, and/or gain mismatch between the transmitter and receiver.

Figure 5: Two-stage structure (PEF + DFE) block diagram.
5.1. Feedback Filter Order Estimation

We can estimate the optimal feedback filter order by looking at the output of the DFE. Assuming that the feedforward filter weight is equal to and the decisions fed back are perfect, let the output be defined as (35) We would like to find the weights that minimize the error, (36) Taking the derivative of the expected value term and setting this result to zero, the optimal weights are given by (37) When , the optimal choice for the feedback filter order is . This ensures that the ISI caused by the PEF is removed. With these choices and the assumption that the interference is canceled by the PEF, the output of the DFE is given by (38)

5.2. Optimal Equalizer Weights After Prediction-Error Filtering

The DFE possesses a -tap feedforward section and an -tap feedback section. The optimal weights for the DFE are found by solving the Wiener-Hopf equations [11, 19]. The feedforward weight is equal to . The output autocorrelation matrix reduces to a scalar value due to the 1-tap feedforward filter and is defined as (39) The latter term is given in (32). is defined as (40) where the components of are given by (41) Finally, is defined as (42) The feedback weights are defined as .

5.3. Steady-State Equivalence

The two-stage structure can be viewed in a different manner when operating in steady-state. Based on linear system theory, two linear time-invariant (LTI) systems can be combined into one LTI structure [38, pages 107-108]. For example, the PEF weights given in (22) and the feedforward weight of the subsequent DFE () can be combined to form an extended feedforward filter () of a DFE with one main tap and side taps. This is accomplished by (43) where “” represents linear convolution. The feedback taps remain the same, that is . Observe that and are the weights of a DFE operating in steady state. The case of interest is when and (as postulated in Section 5.1).

Solving, and for the weights gives (44)(45) The extended feedforward filter weights can be found according to (43), (46)(47)(48) Note that the feedback weights remain the same, namely (45) is equal to (48).

As mentioned previously in Section 4.2.2, the scenario of interest occurs when the interference dominates the signal of interest and the noise. Equations (46)–(48) can be approximated in this region using (24) to give (49)

As a comparison to (49), the DFE-only weights described by (11)–(14) need to be approximated for the assumption of small SIR and NIR as well. Letting , so that there are taps in the feedforward section and taps in the feedback section, the DFE-only weights are approximated as (50)

Comparing (49) and (50), it can be seen that combining the two-stage weights approximates the weights of the DFE-only.

5.4. Blind Implementation

The previous sections established a relationship between the PEF weights, the feedforward weight, and the feedback weights. Note that in Section 5.1 the feedback weights are equal to the PEF weights associated with past data symbols scaled by the feedforward tap weighting. Also, recall that the weights of the PEF rapidly converge and the structure does not require knowledge of training symbols. With and , the two-stage system can be implemented in a manner where the feedback tap weights are not adapted. After the PEF weights have converged, the multiplication of the PEF weights and the feedforward weight is used as the feedback weights. The feedforward tap is initialized to unity and is adapted in decision-directed mode. Thus, no explicit training symbols are required during the adaptation process. This method also reduces the complexity of the system; only of the total tap weights are adapted. In the scenario where there is a phase and/or gain error, the system requires the use of either training symbols to adapt the feedforward weight or a phase locked loop (PLL) and automatic gain control (AGC). Observe that these two components can be implemented in a decision-directed manner with no need for training symbols.

6. Results

6.1. Simulation Parameters

In the simulation results to follow, a QPSK constellation is utilized and the  dB. For convergence results, a 100-symbol window was used and the BER values are averaged over 1000 runs. The interferer frequency is located at DC (). All of the data were considered as training data, unless specified otherwise. The step-sizes are chosen to ensure convergence toward the steady-state BER. The DFE steady-state BER results in the convergence plots are given by , where is the Q-function [29, page 40] and the SINR is given in (15). The simulation results demonstrating complete agreement with this theory-based result are omitted to avoid unnecessary clutter in the figures to follow.

The DFE adapted with the RLS algorithm [11] is also simulated as a benchmark for the LMS DFE and the LMS PEF + DFE. The forgetting factor and the regularization factor were found through trial and error and set to , , respectively, for all simulations.

The adaptive weights are initialized such that the main tap is set to one, resulting in the desired symbol being part of the output of the equalizer. The remaining taps are set to zero.

6.2. Convergence Results

In previous works [3, 39], the convergence has been viewed through the adaptive weights, even though they may not be unique [18]. As mentioned above in Section 3.1, the convergence of the weights may lag behind the MSE convergence if the eigenvalues are small. Similarly, the weight convergence does not provide an indication of how the BER behaves during the transient period. Thus, the convergence results are shown in terms of a sliding BER window, discussed in Section 3.2.

Figure 6 demonstrates the convergence of the LMS DFE, the LMS PEF + DFE, and the RLS DFE in relation to the steady-state BER for SIR = −20 dB. The number of taps is set such that , and the step-sizes for each structure are . The LMS PEF + DFE is seen to converge significantly faster than the LMS DFE. Specifically, the LMS PEF + DFE converges to a BER of in approximately 450 symbols (or iterations, as adaptation takes place at the symbol rate), while the LMS DFE converges in approximately 20 000 symbols. An improvement of two orders of magnitude is obtained by implementing the LMS PEF + DFE structure instead of the LMS DFE structure for this particular scenario. In the case of the RLS DFE, convergence to a BER of occurs in 150 symbols. As expected, RLS provides faster convergence because it whitens the input by using the inverse correlation matrix. This improved convergence comes at the cost of higher complexity. For example, in the context of echo cancellation, it has been shown that the implementation of RLS in floating point on the 32 bit, 16 MIPS, 1 serial port, TMS320C31 requires 20 times the number of machine cycles that LMS does [40].

Figure 6: Convergence comparison of the LMS DFE, the LMS PEF + DFE, and the RLS DFE for SNR = 9 dB, SIR = −20 dB, .

Figure 7 is a plot of the convergence for the above scenario when the SIR = −30 dB. The step-sizes for this case are . Again, the time required for convergence of the LMS PEF + DFE is dramatically less than for the convergence of the LMS DFE. The LMS PEF + DFE converges in 3000 symbols, while the LMS DFE requires 200 000 symbols. The RLS DFE requires 160 symbols to converge for this case.

Figure 7: Convergence comparison of the LMS DFE, the LMS PEF + DFE, and the RLS DFE for SNR = 9 dB, SIR = −30 dB, .

Finally, Figure 8 shows the convergence of the two systems when the number of filter coefficients for each stage is doubled, namely, and SIR = −20 dB. The step-sizes for this scenario are . The LMS PEF + DFE converges in 300 symbols and the LMS DFE converges in 10 000 symbols. The RLS DFE converges in 130 symbols. Doubling the complexity reduces the convergence time of the LMS DFE and the LMS PEF + DFE more than that of the RLS DFE. Note that increasing the order will eventually lead to a degradation in the performance due to the increase of gradient noise. This degradation is observed when increasing the number of taps from (in Figure 6) to (in Figure 8).

Figure 8: Convergence comparison of the LMS DFE, the LMS PEF + DFE, and the RLS DFE for SNR = 9 dB, SIR = −20 dB, .
6.2.1. Blind Implementation

In this section, we examine the convergence of the blind implementation discussed in Section 5.4. This algorithm allows the LMS PEF to converge before the LMS DFE that follows it is turned on. Let represent the number of symbols that is allocated to allow for PEF convergence. This system is compared to two other cases. The first is the scenario where all the transmitted symbols are considered as training data (similar to the results shown above). The second scenario demonstrates the convergence when a subset of the symbols is used for training, while the adaptive algorithm operates in decision-directed mode for the remaining symbols. We refer to this case as the decision-directed algorithm. The number of training symbols used for this case will also be equal to .

Figure 9 demonstrates the BER convergence of the three discussed cases in relation to the steady-state BER for SIR = −20 dB and symbols. The number of taps is set such that , and the step-sizes for each structure are and . The performance of both the blind algorithm and the decision-directed algorithm deviates from the case of using all training data. This is due to propagation of feedback errors that cause more errors. Observe that the blind algorithm produces faster convergence and better BER performance than the decision-directed algorithm.

Figure 9: Convergence comparison of the different LMS PEF + DFE implementations for SNR = 9 dB, SIR = −20 dB, .

Figure 10 demonstrates the BER convergence of the three discussed cases in relation to the steady-state BER for SIR = −20 dB, however now symbols. The blind algorithm outperforms the decision-directed algorithm in terms of both convergence and BER. The degradation of the decision-directed algorithm arises from the fact that the number of training symbols used does not allow the feedback weights to approach their steady-state values before switching to decision-directed mode.

Figure 10: Convergence comparison of the different LMS PEF + DFE implementations for SNR = 9 dB, SIR = −20 dB, .
6.3. BER Results

Figure 11 is a plot of the steady-state BER results for the DFE and PEF + DFE for SIR = −20 dB and varying filter orders. The performance of ideal QPSK is plotted as a reference. The performance of the PEF + DFE is seen to be approximately the same as the performance of the DFE when both structures are operating in steady state. This validates the analysis performed in Section 5.3. It is also seen that the performance of the systems improves as the number of filter taps is increased, approaching the performance of QPSK. The improvement results from the increased spectral resolution provided by the larger number of taps in the feedforward section of each system.

Figure 11: Steady-state BER results of the DFE and the PEF + DFE for SIR = −20 dB and . DFE results obtained using optimal weights given in (11)–(14), PEF + DFE results obtained using optimal weights given in (22), (44), and (45).

Figure 12 demonstrates the BER results of the LMS PEF + DFE blind implementation in comparison to the steady-state PEF + DFE results. For the blind implementation, the DFE is turned on after symbols and the BER is calculated over the last 2500 symbols. The step-sizes are chosen for convergence to the steady-state BER and are noted in Table 1. This table also gives the average number of symbols required to obtain a BER of for the blind implementation when SNR = 10 dB. A convergence value equal to indicates that the blind algorithm has converged to the target BER after the first windowed calculation. It is clear that there is a small degradation in the BER when implementing the blind version of the PEF + DFE algorithm. This degradation is attributed to the combination of the misadjustment of the adaptive algorithm and the presence of uncanceled interference that causes feedback errors. Note that this degradation in BER becomes smaller as the number of parameters is increased. This occurs because a larger number of taps allows for more of the interference to be canceled, thereby reducing the number of feedback errors.

Table 1: Step-sizes and Convergence (at SNR = 10 dB) for LMS PEF + DFE Blind Implementation.
Figure 12: Steady-state BER results of PEF + DFE and the BER for the LMS blind implementation for SIR = −20 dB and . PEF + DFE steady-state results obtained using optimal weights given in (22), (44), and (45).

7. Conclusion

We investigated the response of the LMS DFE in the presence of severe narrowband interference. Due to the absence of a reference for the interference, the convergence time for this equalizer may be unacceptably slow for use in some realistic applications. The proposed system of an LMS PEF as a prefilter to the equalizer is shown to provide a solution to this problem. This two-stage system was shown to reduce the convergence time, in terms of reaching a BER of , by two orders of magnitude. An added benefit is that the steady-state BER for the two-stage system approximates that of the LMS DFE-only. Thus, it is possible to improve the convergence results of the LMS DFE, by splitting the system into an LMS prediction-error filter and a separate LMS DFE while not significantly degrading the steady-state BER results. The convergence results were also benchmarked against the DFE adapted with the RLS algorithm, which demonstrated faster convergence at the cost of higher complexity. A blind implementation (i.e., no training symbols are needed) that reduces complexity at the cost of a small degradation in the steady-state BER is also discussed.

Acknowledgments

A portion of the material in this paper was presented at the European Signal Processing Conference (EUSIPCO), Florence, Italy, September, 2006. This work was supported by the Office of Naval Research, Code 313, SPAWAR Systems Center, San Diego, and the UCSD Center for Wireless Communications (UCDG Grant no. Com 06-10216). The authors would like to thank the two anonymous reviewers whose comments and suggestions greatly improved the presentation of the material in this paper.

References

  1. S. U. H. Qureshi, “Adaptive equalization,” Proceedings of the IEEE, vol. 73, no. 9, pp. 1349–1387, 1985.
  2. B. Sklar, “Rayleigh fading channels in mobile digital communication systems—I: Characterization,” IEEE Communications Magazine, vol. 35, no. 7, pp. 90–100, 1997.
  3. L.-M. Li and L. Milstein, “Rejection of CW interference in QPSK systems using decision-feedback filters,” IEEE Transactions on Communications, vol. 31, no. 4, pp. 473–483, 1983.
  4. J. Makhoul, “Linear prediction: A tutorial review,” Proceedings of the IEEE, vol. 63, no. 4, pp. 561–580, 1975.
  5. J. R. Zeidler, “Performance analysis of LMS adaptive prediction filters,” Proceedings of the IEEE, vol. 78, no. 12, pp. 1781–1806, 1990.
  6. L.-M. Li and L. B. Milstein, “Rejection of narrow-band interference in PN spread-spectrum systems using transversal filters,” IEEE Transactions on Communications, vol. 30, no. 5, pp. 925–928, 1982.
  7. M. J. Reed and B. Liu, “An analysis of LMS adaptive two-sided transversal filters,” in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP '91), vol. 3, pp. 2145–2148, Toronto, Ont, Canada, April 1991.
  8. F. Symons, Jr., “Narrow-band interference rejection using the complex linear prediction filter,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 26, no. 1, pp. 94–98, 1978.
  9. L. B. Milstein, “Interference rejection techniques in spread spectrum communications,” Proceedings of the IEEE, vol. 76, no. 6, pp. 657–671, 1988.
  10. J. D. Laster and J. H. Reed, “Interference rejection in digital wireless communications,” IEEE Signal Processing Magazine, vol. 14, no. 3, pp. 37–62, 1997.
  11. S. Haykin, Adaptive Filter Theory, Prentice Hall, Upper Saddle River, NJ, USA, 4th edition, 2002.
  12. B. Hassibi, “On the robustness of LMS filters,” in Least-Mean-Square Adaptive Filters, S. Haykin and B. Widrow, Eds., pp. 105–144, John Wiley & Sons, Hoboken, NJ, USA, 2003, chapter 4.
  13. O. M. Macchi and N. J. Bershad, “Adaptive recovery of a chirped sinusoid in noise—I: Performance of the RLS algorithm,” IEEE Transactions on Signal Processing, vol. 39, no. 3, pp. 583–594, 1991.
  14. N. J. Bershad and O. M. Macchi, “Adaptive recovery of a chirped sinusoid in noise—II: Performance of the LMS algorithm,” IEEE Transactions on Signal Processing, vol. 39, no. 3, pp. 595–602, 1991.
  15. O. M. Macchi, N. J. Bershad, and M. Mboup, “Steady-state superiority of LMS over LS for time-varying line enhancer in noisy environment,” IEE Proceedings, Part F: Radar and Signal Processing, vol. 138, no. 4, pp. 354–360, 1991.
  16. P. C. Wei, J. Han, J. R. Zeidler, and W. H. Ku, “Comparative tracking performance of the LMS and RLS algorithms for chirped narrowband signal recovery,” IEEE Transactions on Signal Processing, vol. 50, no. 7, pp. 1602–1609, 2002.
  17. S. Haykin, A. H. Sayed, J. R. Zeidler, P. Yee, and P. C. Wei, “Adaptive tracking of linear time-variant systems by extended RLS algorithms,” IEEE Transactions on Signal Processing, vol. 45, no. 5, pp. 1118–1128, 1997.
  18. A. A. Beex and J. R. Zeidler, “Steady-state dynamic weight behavior in (N)LMS adaptive filters,” in Least-Mean-Square Adaptive Filters, S. Haykin and B. Widrow, Eds., pp. 335–443, John Wiley & Sons, Hoboken, NJ, USA, 2003, chapter 9.
  19. M. Reuter and J. R. Zeidler, “Nonlinear effects in LMS adaptive equalizers,” IEEE Transactions on Signal Processing, vol. 47, no. 6, pp. 1570–1579, 1999.
  20. A. A. Beex and J. R. Zeidler, “Non-Wiener effects in recursive least squares adaptation,” in Proceedings of the 17th International Symposium on Signal Processing and Its Applications (ISSPA '03), vol. 2, pp. 595–598, Paris, France, July 2003.
  21. E. H. Satorius, S. Krishnan, X.-L. Yu, L. J. Griffiths, I. S. Reed, and T.-K. Truong, “Suppression of narrowband interference via single channel adaptive preprocessing,” in Proceedings of the 22nd Asilomar Conference on Signals, Systems and Computers, vol. 1, pp. 270–273, Pacific Grove, Calif, USA, October 1988.
  22. R. P. Gooch and B. Daellenbach, “Prevention of interference capture in a blind (CMA-based) adaptive receive filter,” in Proceedings of the 23rd Annual Asilomar Conference on Signals, Systems and Computers, vol. 2, pp. 898–902, Pacific Grove, Calif, USA, October 1989.
  23. M. J. Shensa, “Time constants and learning curves of LMS adaptive filters,” Tech. Rep. 312, Naval Ocean Systems Center, San Diego, Calif, USA, 1978.
  24. M. J. Shensa, “The spectral dynamics of evolving LMS adaptive filters,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '79), vol. 4, pp. 950–953, Washington, DC, USA, April 1979.
  25. J. R. Barry, E. A. Lee, and D. G. Messerschmitt, Digital Communication, Kluwer Academic Press, Boston, Mass, USA, 3rd edition, 2004.
  26. J. P. Odenwalder, “Error control coding handbook,” Linkabit, San Diego, Calif, USA, July 1976.
  27. P. Monsen, “Adaptive equalization of the slow fading channel,” IEEE Transactions on Communications, vol. 22, no. 8, pp. 1064–1075, 1974.
  28. J. E. Smee and N. C. Beaulieu, “Error-rate evaluation of linear equalization and decision feedback equalization with error propagation,” IEEE Transactions on Communications, vol. 46, no. 5, pp. 656–665, 1998.
  29. J. G. Proakis, Digital Communications, McGraw-Hill, Boston, Mass, USA, 4th edition, 2001.
  30. J. R. Zeidler, E. Satorius, D. Chabries, and H. Wexler, “Adaptive enhancement of multiple sinusoids in uncorrelated noise,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 26, no. 3, pp. 240–254, 1978.
  31. A. Dembo, “Bounds on the extreme eigenvalues of positive-definite Toeplitz matrices,” IEEE Transactions on Information Theory, vol. 34, no. 2, pp. 352–355, 1988.
  32. F. Riera-Palou, J. M. Noras, and D. G. M. Cruickshank, “Analysis of the decision delay effect on the convergence of gradient recursive decision feedback equalizers,” in Proceedings of the IEEE International Conference on Acoustic, Speech, and Signal Processing (ICASSP '02), vol. 3, pp. 2665–2668, Orlando, Fla, USA, May 2002.
  33. J. G. Proakis and D. G. Manolakis, Introduction to Digital Signal Processing, MacMillian, New York, NY, USA, 1998.
  34. D. T. M. Slock, “Blind fractionally-spaced equalization, perfect-reconstruction filter banks and multichannel linear prediction,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '94), vol. 4, pp. 585–588, Adelaide, Australia, April 1994.
  35. K. Abed-Meraim, E. Moulines, and P. Loubaton, “Prediction error method for second-order blind identification,” IEEE Transactions on Signal Processing, vol. 45, no. 3, pp. 694–705, 1997.
  36. D. Gesbert and P. Duhamel, “Unbiased blind adaptive channel identification and equalization,” IEEE Transactions on Signal Processing, vol. 48, no. 1, pp. 148–158, 2000.
  37. J. R. Treichler, “Transient and convergent behavior of the adaptive line enhancer,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 27, no. 1, pp. 53–62, 1979.
  38. A. V. Oppenheim, A. S. Willsky, and S. H. Nawab, Signals & Systems, Prentice-Hall, Upper Saddle River, NJ, USA, 2nd edition, 1997.
  39. A. Batra, J. R. Zeidler, and A. A. Beex, “Mitigation of narrowband interference using adaptive equalizers,” in Proceedings of the 14th European Signal Processing Conference (EUSIPCO '06), Florence, Italy, September 2006.
  40. S. G. Sankaran, Implementation and evaluation of echo cancellation algorithms, M.S. thesis, Virginia Tech, Blacksburg, VA, USA, December 1996.