Abstract
It has previously been shown that a least-mean-square (LMS) decision-feedback filter can mitigate the effect of narrowband
interference (L.-M. Li and L. Milstein, 1983). An adaptive implementation of the filter was shown to converge relatively quickly for mild interference. It is shown here, however, that in the case of severe narrowband interference, the LMS decision-feedback equalizer (DFE) requires
a very large number of training symbols for convergence, making it unsuitable for some types of communication systems. This
paper investigates the introduction of an LMS prediction-error filter (PEF) as a prefilter to the equalizer and demonstrates that it
reduces the convergence time of the two-stage system by as much as two orders of magnitude. It is also shown that the steady-state bit-error rate (BER) performance of the proposed system is still approximately equal to that attained in steady-state by the LMS DFE-only. Finally, it is shown that the two-stage system can be implemented without the use of training symbols. This two-stage structure lowers the complexity of the overall system by reducing the number of filter taps that need to be adapted, while incurring a slight loss in the steady-state BER.
1. Introduction
Maintaining reliable wireless communication
performance is a challenging problem because of channel impairments such as
fading, intersymbol interference (ISI), narrowband interference, and noise.
Therefore, there is a need for innovative receivers which can mitigate these
impairments rapidly, especially when the information is being transferred in
small packets or short bursts.
There has been a considerable amount of work on mitigating
the effects of ISI (see [1] and references therein)
and fading channels (see [2] and references therein).
The focus of this paper is
on techniques that can quickly mitigate strong narrowband interference.
Narrowband interference typically occurs because of nonlinearities in the mixer
or by other communication systems radiating in the same frequency band (as
occurs in many of the unlicensed bands, e.g., Bluetooth is a narrowband
interferer for WLAN systems). A strong interferer can make recovering the
transmitted information quite challenging.
Several
methods for suppressing narrowband interference have been discussed in the
literature. A linear equalizer (LE) and a decision-feedback equalizer (DFE)
were studied in [3].
It was shown that the
performance of the DFE is better than that of the LE. The LE seen in both
systems removes the interference, while the additional feedback taps of the DFE
enable the cancellation of the post-cursor ISI that is induced by the LE.
Linear prediction [4, 5] is another common technique
that has been used in direct-sequence CDMA systems [6–8] when the processing gain does
not provide enough immunity to the interference. When the signal of interest is
wideband compared to the bandwidth of the interferer, linear prediction
predicts the current value of the interference from past samples. When the
structure is implemented as a prediction-error filter, the estimate of the
interference is removed at the cost of some signal distortion. A further review
of interference suppression techniques can be found in [9, 10].
When the statistics of the interference are known, the weights of these systems are
found by minimizing the mean-squared error [11] (or equivalently by solving
the Wiener-Hopf equation). In practice, however, this type of a priori
information is not available. Thus, these systems are best implemented
adaptively. Of the many algorithms available, we focus on a low-complexity
method, specifically the least-mean square (LMS) algorithm [11].
The LMS algorithm is also
noted for its robustness and improved tracking performance [11, 12]. The drawback of this
particular algorithm is its slow convergence when there is a large disparity in
the eigenvalues of the input signal [11]. Slow convergence leads to
the need for a large number of training symbols. These symbols do not transmit
any new information, reducing the overall throughput of the system.
Conventional analyses of adaptive algorithms use the mean-squared error (MSE)
as the metric when investigating the convergence. However, since BER is a more definitive
performance metric for analyzing communication systems, the convergence is
viewed in terms of the BER with the aid of a sliding window. Convergence is
defined as the number of symbols needed to attain a certain BER.
Although
it has been shown that alternate adaptive algorithms, such as the recursive
least squares (RLS) algorithm [11], provide improved convergence
relative to the LMS algorithm in cases of high eigenvalue disparity, there are
many reasons why LMS is chosen for practical communications system
applications. Hassibi discusses [12] some
of the fundamental
differences in the performance of gradient-based estimators such as the LMS
algorithm and time-averaged recursive estimators such as the RLS algorithm in
the cases of modeling errors and incomplete statistical information concerning
the input signal, interference, and noise parameters. Hassibi [12] examines the conditions for
which LMS can be shown to be more robust to variations and uncertainties in the
signaling environment than RLS. LMS has also been shown to track more
accurately than RLS because it is able to base the filter updates on the
instantaneous error rather than the time-averaged error [13–16]. The improved tracking
performance of LMS over RLS for a linear chirp input is well established [11, 16]. In [17] it is shown that an extended
RLS filter that estimates the chirp rate of the input signal can minimize the
tracking errors associated with the RLS algorithm and provides performance that
exceeds that of LMS. It should be noted, however, that the improved tracking
performance requires a significant increase in computational complexity and
knowledge that the underlying variations in the input signal can be accurately
modeled by a linear FM chirp. For cases where the input is not accurately
represented by the linear chirp model, performance can be expected to be
significantly worse than simply using an LMS estimator, for the reasons
discussed in [12]. The computational complexity
of RLS, in particular for high-order systems, favors the use of LMS. The latter
is also more robust in fixed-point implementations. In addition, the LMS
estimator has been shown to provide nonlinear, time-varying weight dynamics
that allow the LMS filter to perform significantly better than the
time-invariant Wiener filter in several cases of practical interest [18, 19].
It is further shown that the
improved performance associated with these non-Wiener effects is difficult to
realize for RLS estimators due to the time averaging that is inherent in the
estimation process [20].
In
this paper, we first demonstrate that the LMS DFE possesses an extended
convergence time (greater than 10,000 symbols for the cases investigated here)
when severe narrowband interference (SIR
dB) is present, due to the fact that the
equalizer does not have a true reference for the interference. To reduce the
convergence time and the number of training symbols needed, we propose a
two-stage system that uses an LMS prediction-error filter (PEF) as a prefilter
to the LMS DFE-only. For strong interference the PEF generates a direct reference
for the interference from past samples and mitigates it prior to equalization.
A
two-stage system employing a
linear predictor has been
previously investigated [21, 22] in combination with the
constant modulus algorithm (CMA). The prediction filter is employed to mitigate
the interference and ensure that the CMA locks on to the signal of interest.
The prediction filter is not used specifically for its convergence properties.
The two-stage structure in this paper uses a supervised algorithm for the
adaptation of the second structure and is developed with the goal of improving
the convergence of the overall system.
The
second contribution of this paper is to show that the two-stage system reduces
the number of training symbols required to reach a BER of
by two orders of magnitude without
substantially degrading the steady-state BER performance as compared to the LMS
DFE-only case. All comparisons will be made under the condition that the LMS
DFE-only and the two-stage structure have the same total number of taps. The two-stage
system's adaptive implementation is superior due to the fact that the
prediction-error filter utilizes the narrowband nature of the interference to
obtain a beneficial initialization point. On the other hand, the LMS DFE-only
employs only the training symbols which have no knowledge of the statistical
characteristics of the interference.
Finally, the two-stage system may be
implemented in a manner that does not require any training symbols. The PEF is
inherently a blind algorithm because the error signal is determined from the
current sample and the past samples. A relationship between the PEF weights and
the DFE feedback weights is obtained, allowing the DFE to be operated in
decision-directed mode after convergence of the PEF weights. This technique
outperforms the nonblind decision-directed implementation when a small number
of training symbols is used. The nonblind decision-directed implementation
suffers because the feedback weights lie far from their steady-state values
prior to the switch to decision-directed mode. This blind method also allows
for a reduction in the complexity of the system (i.e., fewer weights that need
to be adapted) at the cost of a slight increase in steady-state BER.
The
paper is organized as follows. Section 2 describes the system model. The LMS
algorithm and its convergence properties are reviewed in Section 3. In Section
4, the previous approaches of the DFE and the PEF are discussed. The proposed
two-stage system is revealed in Section 5 along with its relation to the DFE. A
blind implementation for the proposed system is also presented in Section 5. In
Section 6, the convergence and steady-state BER results are presented.
Concluding remarks are given in Section 7.
2. System Model
A complex baseband representation of a single-carrier communication system is
depicted in Figure 1. The signal of interest,
,
is composed of i.i.d. symbols, drawn
from an arbitrary QAM constellation, with average power equal to
.
It is passed through a pulse shaping filter that is necessary for bandlimited
transmission. This signal is corrupted by narrowband interference,
, modeled as a pure complex exponential and additive white Gaussian noise. A
matched filter is employed at the receiver to maximize the signal-to-noise
ratio (SNR) at the output of the filter. Note that the overall frequency
response of the pulse shape and the matched filter is assumed to satisfy
Nyquist's criterion for no intersymbol interference (ISI) and the filters
operate at the symbol rate.
Figure 1: Discrete-time system model.
The signal at the input to the equalizer,
,
is defined as
(1)
where
is the symbol duration,
is the interferer power,
is the angular frequency of the interferer,
and
is a random phase that is uniformly distributed
between
and
.
The additive noise,
,
is modeled as a zero-mean Gaussian random process with
variance
.
The signal-to-noise ratio is defined as SNR =
and the signal-to-interference ratio is
defined as SIR =
.
It
is assumed that the communication signal, interferer, and noise are uncorrelated
to each other. The autocorrelation function of the input,
,
is defined as
(2)
where
is the expectation operator,
indicates conjugation, and
is the Kronecker delta function.
3. LMS Algorithm
The LMS algorithm [11] is defined by the following three
equations:
(3)
where
is the input vector to the equalizer,
is the vector of adapted tap weights,
is the desired signal,
is the output of the decision-device when
is its input,
is the error signal,
is the step-size parameter, and
represents conjugate (Hermitian) transpose.
Note
that there are two stages associated with the adaptive algorithm. The first
stage is the training phase, where known training symbols are used to push the
filter in the direction of the optimal weights. After the training symbols have
been exhausted, the algorithm switches to decision-directed mode. The output of
the decision device is used as the desired symbol when calculating the error
signal. Ideally, at the end of the training phase the output of the filter is
close to the desired signal.
3.1. LMS Convergence
In conventional analyses, convergence refers to the asymptotic progress of either
the adaptive weights or the MSE toward the optimal solutions. The convergence
(as well as the stability) of the system is dependent on the step-size. The
step-size parameter is chosen in a manner to guarantee convergence in the
mean-square sense, namely,
(4)
where
is the maximum eigenvalue of the input
autocorrelation matrix.
Assuming that the adaptive weights and the input vector are independent, Shensa [23] showed that the convergence
of the weight vector can be expressed as
(5)
where
are the eigenvalues and
are the eigenvectors of the input
autocorrelation matrix. The optimal Wiener solution is represented by
.
A similar equation arises for the convergence of the mean-square error (MSE) [24], when gradient noise (on the
order of
) is neglected
(6)
Letting the learning curve be approximated by
a single exponential allows a time constant [11] to be defined for each mode,
(7)
The maximum modal time constant is associated
with the minimum eigenvalue,
(8)
This maximal time constant can be seen to be a conservative estimate by examining (5) more closely. The
convergence will be influenced only by those eigenvalues for which the
projection of the corresponding eigenvector on the optimal weights is large.
Lastly, it can be seen for the case of
,
that it is possible for the convergence of the filter output (mean-square
error) to be faster than the convergence of the filter weights. This is because
there may be fewer modes controlling the MSE convergence (i.e., when
).
The equations above provide excellent insight into the convergence of the LMS
algorithm; however, in this paper, we are interested in the convergence in a
limited time interval when the metric of interest is BER. Therefore, we define
the convergence to be the average number of training symbols needed to achieve
a BER of
.
This value is consistent with the notion that the BER should be less than
when switching from training to
decision-directed mode [25]. Additionally, using a convolutional code with an input BER equal to
is equivalent to a BER of
at the output of the decoder [26].
3.2. Sliding BER Window
As mentioned above, the convergence of an
adaptive filter is viewed by the ensemble average learning curve [11], a plot of the MSE versus
iteration. Note that in this work, each iteration of the adaptive algorithm
occurs at the symbol rate. To examine the convergence of the BER here, we
employ a sliding window of
symbols. For example, the first BER value
corresponds to the average number of bit errors over symbols 1 through 100; the
second value corresponds to the average number of bit errors over symbols 2
through 101; and so forth. These values are then averaged for
trials. A general formula for BPSK modulation
can be seen as
(9)
where
is the
transmitted symbol of the
packet and
is the decision of the
symbol of the
packet. Note that the minimum nonzero BER
value will be equal to
.
4. Previous Approaches
4.1. Decision-Feedback Equalizer
4.1.1. Equalizer Structure
The DFE is composed of a transversal feedforward filter with
-taps (one main tap and
side taps) and a feedback filter that has
-taps. A block diagram of the DFE is shown in
Figure 2. The output of the filter,
,
with inputs
and
is
(10)
where
is the estimate of the symbol
out of the decision device. Note that
are the tap weights associated with the
feedforward filter, and
are the tap weights associated with the
feedback filter. During the training phase,
in (10) equals
.
Figure 2: Decision-feedback equalizer block diagram.
The
feedback taps allow the equalizer to cancel out post-cursor ISI based on the
estimated decisions without enhancing the noise. The BER analysis of the DFE
with error propagation can be accomplished utilizing Markov chains to model the
term
as contents of a shift register and the
assumption that the fed back decisions are perfect [3, 27–29]. The number of states in the
Markov chain grows exponentially with the number of feedback taps.
4.1.2. DFE Optimal Weights
The optimal weights under the minimum mean-square error (MMSE) criterion can be
found using the orthogonality principle [11].
equations are obtained, and the weights can be
found using the method described in [3, 30]. The optimal DFE tap weights
are given by
(11)
(12)
(13)
(14)
Observe
that the weight of the feedback taps (14) is the negative of the
feedforward side taps (12) when
.
This implies that if the data fed back is perfect, the ISI caused by the
previous data symbols will be completely
canceled. Also note that (13) is a scaled
multiple of (12). This scaling value
effectively removes the influence of the associated data symbols that can not
be canceled by the feedback taps. For the special case of
,
it can be seen that if the data fed back is perfect, the ISI caused by the
feedforward equalizer will be completely canceled, leaving only the symbol of
interest.
4.1.3. DFE SINR Calculation
The signal-to-interference-plus-noise ratio (SINR) at the input to the decision
device of the DFE can be found using (10) and the optimal weights given
in (11)–(14) to be
(15)
4.1.4. Autocorrelation Structure
The input to the decision-feedback equalizer is the concatenation of the received
input to the equalizer and the fed back decisions, given by
,
where
is the transpose operator. The vector,
,
is composed of the fed back decisions that are assumed to be correct, and is
thus defined as
(16)
The autocorrelation matrix for the
-tap feedforward and
-tap feedback equalizer is defined as
(17)
The autocorrelation matrix seen in (17) is partitioned into 4
submatrices. The matrices on the diagonal are the autocorrelation matrix of the
received input to the equalizer and the autocorrelation matrix of the data
symbols, respectively. The values in the upper left submatrix are given by (2). The cross-correlation matrix
between the received input to the equalizer and the data symbols is located on
the off-diagonal.
4.1.5. Eigenvalues
There is no closed form expression for determining the eigenvalues of the correlation
matrix defined in (17). A method to bound the
eigenvalues of positive-definite Toeplitz matrices can be found in
[31] and its application to the
correlation matrix given in (17) can be found in [32]. However, for the case of
and
,
the minimum and maximum eigenvalues are found to be
(18)
and the eigenvalue spread is
(19)
Note that the eigenvalues given in (18) are not a function of
.
4.1.6. Convergence Properties
The projection of any of the eigenvectors on
the optimal weight vector is nonzero. This implies that the time constant (8) is inversely proportional to the
minimum eigenvalue
.
The delay in convergence can be attributed to the fact that the DFE does not
have a direct reference for the interferer during adaptation and is thus forced
to converge on the basis of the training data only. The feedback taps converge
slower than the feedforward taps because the DFE is designed such that the
interferer is canceled by the feedforward taps, while the feedback taps attempt
to cancel out the signal distortion caused by the feedforward taps [3].
4.2. Prediction Filter
4.2.1. Predictor Structure
The linear predictor (LP) is a structure that uses the correlation between past
samples to form an estimate of the current sample [11, 25, 33]. A variant of this filter,
the prediction-error filter (PEF), has the property that it removes the
correlation between samples, thereby whitening the spectrum. A common example
of this property is seen when determining the parameters of an autoregressive
(AR) process. The prediction-error filter (assuming a sufficient filter order)
of such an input provides both the AR parameters and a white output sequence
that is equal to the innovations process.
This
technique has also been used to remove narrowband interference in many
applications [6–8, 29, 30]. The filter is able to
predict the interferer due to its narrowband properties. A block diagram of the
prediction-error filter is shown in Figure 3. The PEF is a transversal
filter with
taps. The decorrelation delay (
) ensures that the signal of interest at the
current sample is decorrelated from the samples in the filter when calculating
the error term. Because the data is i.i.d.,
is a sufficient choice, giving the one-step
predictor. The linear combination of the weighted input samples,
,
forms an estimate of the interferer, given by
(20)
where
are the tap weights of the predictor. The
output of the PEF,
, is defined as the subtraction of the estimate
of the interference given in (20) from
the current input sample
(21)
Note that
is also the error term of the structure. This
implies that the PEF is in fact a blind algorithm. It does not require any
training symbols when calculating the error term.
Figure 3: Prediction-error filter block diagram.
4.2.2. Predictor Optimal Weights
The optimal tap weights can be found in a way
similar to those for the equalizer above [3, 30]. Using the orthogonality
principle,
equations are obtained and the weights of the
PEF are given by
(22)
where
is equal to
(23)
For the scenario of interest in this paper,
the interference power is much larger than both the signal power and the noise
power. Therefore, the SIR and the noise-to-interference ratio (NIR) can be
assumed to be very small (i.e., SIR
dB, NIR
dB [3]) and
can be approximated as
(24)
4.2.3. Sensitivity to Additive Noise
The PEF has been shown to be sensitive to additive noise when used for channel
estimation [34, 35]. An algorithm was proposed in [36] to provide adaptive
estimation of unbiased linear predictors with the goal of obtaining a
consistent estimate of an ISI single-input multiple-output (SIMO) channel. To
examine the effect of the additive noise on the PEF for this problem, we are
interested in the noise-free predictor weights, given by
(25)
where
is equal to
(26)
We compare (25) with the biased predictor
weights given in (22) and look at the norm of the
difference (bias),
(27)
This bias can be approximated using the
assumptions that the SIR and NIR are very small to give
(28)
The value in (28) is quite small due to the
assumption that the NIR is small. Thus, in this work, the bias in the linear
predictor does not substantially affect the system's performance.
4.2.4. Autocorrelation Structure
The
input autocorrelation matrix for the PEF is
defined as
(29)
where the components of the matrix are given
by (2).
4.2.5. Eigenvalues
The eigenvalues for the correlation matrix given by (2) and (29) can be found [7, 23, 37] to be equal to
(30)
The eigenvalue spread is defined [11] as
(31)
4.2.6. Convergence Properties
In this case the
eigenvectors corresponding to the minimum
eigenvalues are orthogonal to the optimal weight vector, hence these
eigenvalues do not affect the convergence [23]. Thus the time constant is
dependent only upon the maximum eigenvalue
.
4.2.7. Output Autocorrelation
The whitening property of the PEF can be seen more clearly through the autocorrelation function of the output of the PEF, which is derived to be
(32)
An approximation for the output
autocorrelation function in (32) can be found using the
approximation given in (24),
(33)
Finally, letting the filter order increase
toward infinity shows that the output spectrum is approximately white,
(34)
4.2.8. Eigenvalue Spread
The effect of the PEF is that the interference is removed, which then results in
the reduction of the eigenvalue spread. This can be seen in Figure 4 for
dB, SIR
dB, and
.
Also in the plot is the eigenvalue spread of the received data given by (31). Note that it is assumed that
.
It is clearly seen that the spread has been reduced, and the modes of this
input to the LMS DFE will converge in similar amounts of time.
Figure 4: Eigenvalue spread
of input to DFE-only and output of PEF for

dB, SIR

dB, and

.
5. Two-Stage System
As discussed in Section 4.2.7, the PEF provides an
approximately white output spectrum when an infinite number of filter taps is
used. Each additional tap provides an increase in spectral resolution when
notching out the narrowband interference. However, the implementation of a
large number of taps is not generally feasible and some distortion in the form
of postcursor ISI will be present. To combat the distortion induced by the PEF,
the DFE is a simple structure that removes the ISI without enhancing the noise.
This leads to a simple two-stage structure that uses the PEF for rapid
convergence and the DFE for removing postcursor ISI as a system to mitigate
narrowband interference.
A similar approach is discussed in [25, pages 364-365] when deriving the zero-forcing
decision-feedback equalizer. Barry et al. demonstrate that the optimal DFE
precursor equalizer is related to optimal linear prediction. Consider
transmitting data through a channel that induces ISI. This distortion can be
removed by employing a linear zero-forcing equalizer, while causing the noise
samples at the output of the equalizer to be correlated. This correlation can
be subsequently removed with a PEF, at the expense of postcursor ISI. Finally,
a zero-forcing feedback postcursor equalizer removes the ISI without enhancing
the noise.
We now consider the performance of the PEF followed by the DFE, which will be
abbreviated as PEF + DFE. A block diagram of the two-stage structure is shown in
Figure 5. The PEF is tasked with
whitening the spectrum by removing the interference, but due to its limited
length it will introduce postcursor ISI; this ISI is then removed by the DFE.
The DFE is designed to have a one tap feedforward section and an
-tap feedback section. In general, there is no
need for a feedforward section, because the input is distorted with only
postcursor ISI that can be resolved by the feedback equalizer portion. We have
chosen to include the one tap to compensate for any phase shifts that might
exist because of phase errors, and/or gain mismatch between the transmitter and
receiver.
Figure 5: Two-stage structure (PEF + DFE) block diagram.
5.1. Feedback Filter Order Estimation
We can estimate the optimal feedback filter order by looking at the output of the
DFE. Assuming that the feedforward filter weight is equal to
and the decisions fed back are perfect, let
the output be defined as
(35)
We would like to find the weights that
minimize the error,
(36)
Taking the derivative of the expected value
term and setting this result to zero, the optimal weights are given by
(37)
When
,
the optimal choice for the feedback filter order is
.
This ensures that the ISI caused by the PEF is removed. With these choices and
the assumption that the interference is canceled by the PEF, the output of the
DFE is given by
(38)
5.2. Optimal Equalizer Weights After Prediction-Error Filtering
The DFE possesses a
-tap feedforward section and an
-tap feedback section. The optimal weights for
the DFE are found by solving the Wiener-Hopf equations [11, 19]. The feedforward weight is
equal to
.
The output autocorrelation matrix
reduces to a scalar value due to the 1-tap
feedforward filter and is defined as
(39)
The latter term is given in (32).
is defined as
(40)
where the components of
are given by
(41)
Finally,
is defined as
(42)
The feedback weights are defined as
.
5.3. Steady-State Equivalence
The two-stage structure can be viewed in a different manner when operating in
steady-state. Based on linear system theory, two linear time-invariant (LTI)
systems can be combined into one LTI structure [38, pages 107-108]. For example, the PEF weights
given in (22) and the feedforward weight of
the subsequent DFE (
) can be combined to form an extended
feedforward filter (
) of a DFE with one main tap and
side taps. This is accomplished by
(43)
where “
” represents linear convolution. The feedback
taps remain the same, that is
.
Observe that
and
are the weights of a DFE operating in steady
state. The case of interest is when
and
(as postulated in Section 5.1).
Solving,
and
for the weights gives
(44)
(45)
The extended feedforward filter weights can
be found according to (43),
(46)
(47)
(48)
Note that the feedback weights remain the
same, namely (45) is equal to (48).
As mentioned previously in
Section 4.2.2, the scenario of interest
occurs when the interference dominates the signal of interest and the noise.
Equations (46)–(48) can be approximated in this
region using (24) to give
(49)
As a comparison to (49), the DFE-only weights
described by (11)–(14) need to be approximated for
the assumption of small SIR and NIR as well. Letting
,
so that there are
taps in the feedforward section and
taps in the feedback section, the DFE-only
weights are approximated as
(50)
Comparing (49) and (50), it can be seen that
combining the two-stage weights approximates the weights of the DFE-only.
5.4. Blind Implementation
The previous sections established a relationship between the PEF weights, the
feedforward weight, and the feedback weights. Note that in Section 5.1 the feedback weights are
equal to the PEF weights associated with past data symbols scaled by the
feedforward tap weighting. Also, recall that the weights of the PEF rapidly
converge and the structure does not require knowledge of training symbols. With
and
,
the two-stage system can be implemented in a manner where the feedback tap
weights are not adapted. After the PEF weights have converged, the
multiplication of the PEF weights and the feedforward weight is used as the
feedback weights. The feedforward tap is initialized to unity and is adapted in
decision-directed mode. Thus, no explicit training symbols are required during
the adaptation process. This method also reduces the complexity of the system;
only
of the total
tap weights are adapted. In the scenario where
there is a phase and/or gain error, the system requires the use of either
training symbols to adapt the feedforward weight or a phase locked loop (PLL)
and automatic gain control (AGC). Observe that these two components can be
implemented in a decision-directed manner with no need for training symbols.
6. Results
6.1. Simulation Parameters
In the simulation results to follow, a QPSK constellation is utilized and the
dB. For convergence results, a 100-symbol
window was used and the BER values are averaged over 1000 runs. The interferer
frequency is located at DC (
). All of the data were considered as training
data, unless specified otherwise. The step-sizes are chosen to ensure
convergence toward the steady-state BER. The DFE steady-state BER results in
the convergence plots are given by
,
where
is the Q-function [29, page 40] and the SINR is given in (15). The simulation results
demonstrating complete agreement with this theory-based result are omitted to
avoid unnecessary clutter in the figures to follow.
The
DFE adapted with the RLS algorithm [11] is also simulated as a
benchmark for the LMS DFE and the LMS PEF + DFE. The forgetting factor and the
regularization factor were found through trial and error and set to
,
,
respectively, for all simulations.
The
adaptive weights are initialized such that the main tap is set to one,
resulting in the desired symbol being part of the output of the equalizer. The
remaining taps are set to zero.
6.2. Convergence Results
In previous works [3, 39], the convergence has been
viewed through the adaptive weights, even though they may not be unique [18]. As mentioned above in
Section 3.1, the convergence of the
weights may lag behind the MSE convergence if the eigenvalues are small.
Similarly, the weight convergence does not provide an indication of how the BER
behaves during the transient period. Thus, the convergence results are shown in
terms of a sliding BER window, discussed in Section 3.2.
Figure 6 demonstrates the convergence
of the LMS DFE, the LMS PEF + DFE, and the RLS DFE in relation to the
steady-state BER for SIR = −20 dB. The number of taps is set such that
,
and the step-sizes for each structure are
.
The LMS PEF + DFE is seen to converge significantly faster than the LMS DFE.
Specifically, the LMS PEF + DFE converges to a BER of
in approximately 450 symbols (or iterations,
as adaptation takes place at the symbol rate), while the LMS DFE converges in
approximately 20 000 symbols. An improvement of two orders of magnitude is
obtained by implementing the LMS PEF + DFE structure instead of the LMS DFE
structure for this particular scenario. In the case of the RLS DFE, convergence
to a BER of
occurs in 150 symbols. As expected, RLS
provides faster convergence because it whitens the input by using the inverse
correlation matrix. This improved convergence comes at the cost of higher
complexity. For example, in the context of echo cancellation, it has been shown
that the implementation of RLS in floating point on the 32 bit, 16 MIPS, 1
serial port, TMS320C31 requires 20 times the number of machine cycles that LMS
does [40].
Figure 6: Convergence
comparison of the LMS DFE, the LMS PEF + DFE, and the RLS DFE for SNR = 9 dB, SIR
= −20 dB,


.
Figure 7 is a plot of the convergence
for the above scenario when the SIR = −30 dB. The step-sizes for this case are
.
Again, the time required for convergence of the LMS PEF + DFE is dramatically
less than for the convergence of the LMS DFE. The LMS PEF + DFE converges in 3000
symbols, while the LMS DFE requires 200 000 symbols. The RLS DFE requires 160
symbols to converge for this case.
Figure 7: Convergence
comparison of the LMS DFE, the LMS PEF + DFE, and the RLS DFE for SNR = 9 dB, SIR
= −30 dB,


.
Finally, Figure 8 shows the convergence of the
two systems when the number of filter coefficients for each stage is doubled,
namely,
and SIR = −20 dB. The step-sizes for this
scenario are
.
The LMS PEF + DFE converges in 300 symbols and the LMS DFE converges in 10
000 symbols. The
RLS DFE converges in 130 symbols. Doubling the complexity reduces the convergence time of the LMS DFE and the LMS PEF + DFE more than that of the RLS DFE. Note that increasing the order will
eventually lead to a degradation in the performance due to the increase of
gradient noise. This degradation is observed when increasing the number of taps
from
(in Figure 6) to
(in Figure 8).
Figure 8: Convergence
comparison of the LMS DFE, the LMS PEF + DFE, and the RLS DFE for SNR = 9 dB, SIR
= −20 dB,


.
6.2.1. Blind Implementation
In this section, we examine the convergence of the blind implementation discussed
in Section 5.4. This algorithm allows the
LMS PEF to converge before the LMS DFE that follows it is turned on. Let
represent the number of symbols that is
allocated to allow for PEF convergence. This system is compared to two other
cases. The first is the scenario where all the transmitted symbols are
considered as training data (similar to the results shown above). The second
scenario demonstrates the convergence when a subset of the symbols is used for
training, while the adaptive algorithm operates in decision-directed mode for
the remaining symbols. We refer to this case as the decision-directed
algorithm. The number of training symbols used for this case will also be equal
to
.
Figure 9 demonstrates the BER
convergence of the three discussed cases in relation to the steady-state BER
for SIR = −20 dB and
symbols. The number of taps is set such that
,
and the step-sizes for each structure are
and
.
The performance of both the blind algorithm and the decision-directed algorithm
deviates from the case of using all training data. This is due to propagation
of feedback errors that cause more errors. Observe that the blind algorithm
produces faster convergence and better BER performance than the
decision-directed algorithm.
Figure 9: Convergence comparison of the different LMS PEF + DFE implementations for SNR = 9 dB, SIR = −20 dB,

.
Figure 10 demonstrates the BER convergence
of the three discussed cases in relation to the steady-state BER for SIR = −20 dB, however now
symbols. The blind algorithm outperforms the
decision-directed algorithm in terms of both convergence and BER. The
degradation of the decision-directed algorithm arises from the fact that the
number of training symbols used does not allow the feedback weights to approach
their steady-state values before switching to decision-directed mode.
Figure 10: Convergence comparison of the different LMS PEF + DFE implementations for SNR = 9 dB, SIR = −20 dB,

.
6.3. BER Results
Figure 11 is a plot of the steady-state
BER results for the DFE and PEF + DFE for SIR = −20 dB and varying filter orders.
The performance of ideal QPSK is plotted as a reference. The performance of the
PEF + DFE is seen to be approximately the same as the performance of the DFE when
both structures are operating in steady state. This validates the analysis
performed in Section 5.3. It is also seen that the
performance of the systems improves as the number of filter taps is increased,
approaching the performance of QPSK. The improvement results from the increased
spectral resolution provided by the larger number of taps in the feedforward
section of each system.
Figure 11: Steady-state BER results
of the DFE and the PEF + DFE for SIR = −20 dB and

.
DFE results obtained using optimal weights given in (
11)–(
14), PEF + DFE results obtained using
optimal weights given in (
22), (
44), and (
45).
Figure 12 demonstrates the BER results
of the LMS PEF + DFE blind implementation in comparison to the steady-state
PEF + DFE results. For the blind implementation, the DFE is turned on after
symbols and the BER is calculated over the
last 2500 symbols. The step-sizes are chosen for convergence to the
steady-state BER and are noted in Table 1. This table also gives the
average number of symbols required to obtain a BER of
for the blind implementation when SNR = 10 dB.
A convergence value equal to
indicates that the blind algorithm has
converged to the target BER after the first windowed calculation. It is clear
that there is a small degradation in the BER when implementing the blind
version of the PEF + DFE algorithm. This degradation is attributed to the
combination of the misadjustment of the adaptive algorithm and the presence of
uncanceled interference that causes feedback errors. Note that this degradation
in BER becomes smaller as the number of parameters is increased. This occurs
because a larger number of taps allows for more of the interference to be
canceled, thereby reducing the number of feedback errors.
Table 1: Step-sizes and Convergence (at SNR = 10 dB)
for LMS PEF + DFE Blind Implementation.
Figure 12: Steady-state BER results of
PEF + DFE and the BER for the LMS blind implementation for SIR = −20 dB and

.
PEF + DFE steady-state results obtained using optimal weights given in (
22), (
44), and (
45).
7. Conclusion
We investigated the response of the LMS DFE in
the presence of severe narrowband interference. Due to the absence of a
reference for the interference, the convergence time for this equalizer may be
unacceptably slow for use in some realistic applications. The proposed system
of an LMS PEF as a prefilter to the equalizer is shown to provide a solution to
this problem. This two-stage system was shown to reduce the convergence time,
in terms of reaching a BER of
,
by two orders of magnitude. An added benefit is that the steady-state BER for
the two-stage system approximates that of the LMS DFE-only. Thus, it is
possible to improve the convergence results of the LMS DFE, by splitting the
system into an LMS prediction-error filter and a separate LMS DFE while not
significantly degrading the steady-state BER results. The convergence results
were also benchmarked against the DFE adapted with the RLS algorithm, which
demonstrated faster convergence at the cost of higher complexity. A blind
implementation (i.e., no training symbols are needed) that reduces complexity at
the cost of a small degradation in the steady-state BER is also discussed.
Acknowledgments
A portion of the material in this paper was
presented at the European Signal Processing Conference (EUSIPCO), Florence, Italy,
September, 2006. This work was supported by the Office of Naval Research, Code
313, SPAWAR Systems Center, San Diego, and the UCSD Center for Wireless
Communications (UCDG Grant no. Com 06-10216). The
authors would like to thank the two anonymous reviewers whose comments and
suggestions greatly improved the presentation of the
material in this paper.
References
- S. U. H. Qureshi, “Adaptive equalization,” Proceedings of the IEEE, vol. 73, no. 9, pp. 1349–1387, 1985.
- B. Sklar, “Rayleigh fading channels in mobile digital communication systems—I: Characterization,” IEEE Communications Magazine, vol. 35, no. 7, pp. 90–100, 1997.
- L.-M. Li and L. Milstein, “Rejection of CW interference in QPSK systems using decision-feedback filters,” IEEE Transactions on Communications, vol. 31, no. 4, pp. 473–483, 1983.
- J. Makhoul, “Linear prediction: A tutorial review,” Proceedings of the IEEE, vol. 63, no. 4, pp. 561–580, 1975.
- J. R. Zeidler, “Performance analysis of LMS adaptive prediction filters,” Proceedings of the IEEE, vol. 78, no. 12, pp. 1781–1806, 1990.
- L.-M. Li and L. B. Milstein, “Rejection of narrow-band interference in PN spread-spectrum systems using transversal filters,” IEEE Transactions on Communications, vol. 30, no. 5, pp. 925–928, 1982.
- M. J. Reed and B. Liu, “An analysis of LMS adaptive two-sided transversal filters,” in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP '91), vol. 3, pp. 2145–2148, Toronto, Ont, Canada, April 1991.
- F. Symons, Jr., “Narrow-band interference rejection using the complex linear prediction filter,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 26, no. 1, pp. 94–98, 1978.
- L. B. Milstein, “Interference rejection techniques in spread spectrum communications,” Proceedings of the IEEE, vol. 76, no. 6, pp. 657–671, 1988.
- J. D. Laster and J. H. Reed, “Interference rejection in digital wireless communications,” IEEE Signal Processing Magazine, vol. 14, no. 3, pp. 37–62, 1997.
- S. Haykin, Adaptive Filter Theory, Prentice Hall, Upper Saddle River, NJ, USA, 4th edition, 2002.
- B. Hassibi, “On the robustness of LMS filters,” in Least-Mean-Square Adaptive Filters, S. Haykin and B. Widrow, Eds., pp. 105–144, John Wiley & Sons, Hoboken, NJ, USA, 2003, chapter 4.
- O. M. Macchi and N. J. Bershad, “Adaptive recovery of a chirped sinusoid in noise—I: Performance of the RLS algorithm,” IEEE Transactions on Signal Processing, vol. 39, no. 3, pp. 583–594, 1991.
- N. J. Bershad and O. M. Macchi, “Adaptive recovery of a chirped sinusoid in noise—II: Performance of the LMS algorithm,” IEEE Transactions on Signal Processing, vol. 39, no. 3, pp. 595–602, 1991.
- O. M. Macchi, N. J. Bershad, and M. Mboup, “Steady-state superiority of LMS over LS for time-varying line enhancer in noisy environment,” IEE Proceedings, Part F: Radar and Signal Processing, vol. 138, no. 4, pp. 354–360, 1991.
- P. C. Wei, J. Han, J. R. Zeidler, and W. H. Ku, “Comparative tracking performance of the LMS and RLS algorithms for chirped narrowband signal recovery,” IEEE Transactions on Signal Processing, vol. 50, no. 7, pp. 1602–1609, 2002.
- S. Haykin, A. H. Sayed, J. R. Zeidler, P. Yee, and P. C. Wei, “Adaptive tracking of linear time-variant systems by extended RLS algorithms,” IEEE Transactions on Signal Processing, vol. 45, no. 5, pp. 1118–1128, 1997.
- A. A. Beex and J. R. Zeidler, “Steady-state dynamic weight behavior in (N)LMS adaptive filters,” in Least-Mean-Square Adaptive Filters, S. Haykin and B. Widrow, Eds., pp. 335–443, John Wiley & Sons, Hoboken, NJ, USA, 2003, chapter 9.
- M. Reuter and J. R. Zeidler, “Nonlinear effects in LMS adaptive equalizers,” IEEE Transactions on Signal Processing, vol. 47, no. 6, pp. 1570–1579, 1999.
- A. A. Beex and J. R. Zeidler, “Non-Wiener effects in recursive least squares adaptation,” in Proceedings of the 17th International Symposium on Signal Processing and Its Applications (ISSPA '03), vol. 2, pp. 595–598, Paris, France, July 2003.
- E. H. Satorius, S. Krishnan, X.-L. Yu, L. J. Griffiths, I. S. Reed, and T.-K. Truong, “Suppression of narrowband interference via single channel adaptive preprocessing,” in Proceedings of the 22nd Asilomar Conference on Signals, Systems and Computers, vol. 1, pp. 270–273, Pacific Grove, Calif, USA, October 1988.
- R. P. Gooch and B. Daellenbach, “Prevention of interference capture in a blind (CMA-based) adaptive receive filter,” in Proceedings of the 23rd Annual Asilomar Conference on Signals, Systems and Computers, vol. 2, pp. 898–902, Pacific Grove, Calif, USA, October 1989.
- M. J. Shensa, “Time constants and learning curves of LMS adaptive filters,” Tech. Rep. 312, Naval Ocean Systems Center, San Diego, Calif, USA, 1978.
- M. J. Shensa, “The spectral dynamics of evolving LMS adaptive filters,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '79), vol. 4, pp. 950–953, Washington, DC, USA, April 1979.
- J. R. Barry, E. A. Lee, and D. G. Messerschmitt, Digital Communication, Kluwer Academic Press, Boston, Mass, USA, 3rd edition, 2004.
- J. P. Odenwalder, “Error control coding handbook,” Linkabit, San Diego, Calif, USA, July 1976.
- P. Monsen, “Adaptive equalization of the slow fading channel,” IEEE Transactions on Communications, vol. 22, no. 8, pp. 1064–1075, 1974.
- J. E. Smee and N. C. Beaulieu, “Error-rate evaluation of linear equalization and decision feedback equalization with error propagation,” IEEE Transactions on Communications, vol. 46, no. 5, pp. 656–665, 1998.
- J. G. Proakis, Digital Communications, McGraw-Hill, Boston, Mass, USA, 4th edition, 2001.
- J. R. Zeidler, E. Satorius, D. Chabries, and H. Wexler, “Adaptive enhancement of multiple sinusoids in uncorrelated noise,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 26, no. 3, pp. 240–254, 1978.
- A. Dembo, “Bounds on the extreme eigenvalues of positive-definite Toeplitz matrices,” IEEE Transactions on Information Theory, vol. 34, no. 2, pp. 352–355, 1988.
- F. Riera-Palou, J. M. Noras, and D. G. M. Cruickshank, “Analysis of the decision delay effect on the convergence of gradient recursive decision feedback equalizers,” in Proceedings of the IEEE International Conference on Acoustic, Speech, and Signal Processing (ICASSP '02), vol. 3, pp. 2665–2668, Orlando, Fla, USA, May 2002.
- J. G. Proakis and D. G. Manolakis, Introduction to Digital Signal Processing, MacMillian, New York, NY, USA, 1998.
- D. T. M. Slock, “Blind fractionally-spaced equalization, perfect-reconstruction filter banks and multichannel linear prediction,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '94), vol. 4, pp. 585–588, Adelaide, Australia, April 1994.
- K. Abed-Meraim, E. Moulines, and P. Loubaton, “Prediction error method for second-order blind identification,” IEEE Transactions on Signal Processing, vol. 45, no. 3, pp. 694–705, 1997.
- D. Gesbert and P. Duhamel, “Unbiased blind adaptive channel identification and equalization,” IEEE Transactions on Signal Processing, vol. 48, no. 1, pp. 148–158, 2000.
- J. R. Treichler, “Transient and convergent behavior of the adaptive line enhancer,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 27, no. 1, pp. 53–62, 1979.
- A. V. Oppenheim, A. S. Willsky, and S. H. Nawab, Signals & Systems, Prentice-Hall, Upper Saddle River, NJ, USA, 2nd edition, 1997.
- A. Batra, J. R. Zeidler, and A. A. Beex, “Mitigation of narrowband interference using adaptive equalizers,” in Proceedings of the 14th European Signal Processing Conference (EUSIPCO '06), Florence, Italy, September 2006.
- S. G. Sankaran, Implementation and evaluation of echo cancellation algorithms, M.S. thesis, Virginia Tech, Blacksburg, VA, USA, December 1996.