EURASIP Journal on Advances in Signal Processing
Volume 2010 (2010), Article ID 172751, 12 pages
doi:10.1155/2010/172751
Research Article

Optimized Paraunitary Filter Banks for Time-Frequency Channel Diagonalization

Communications Laboratory, University of Kassel, Wilhelmshöher Allee 73, 34121 Kassel, Germany

Received 17 June 2009; Revised 7 November 2009; Accepted 31 December 2009

Academic Editor: Faouzi Bader

Copyright © 2010 Ziyang Ju et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

We adopt the concept of channel diagonalization to time-frequency signal expansions obtained by DFT filter banks. As a generalization of the frequency domain channel representation used by conventional orthogonal frequency-division multiplexing receivers, the time-frequency domain channel diagonalization can be applied to time-variant channels and aperiodic signals. An inherent error in the case of doubly dispersive channels can be limited by choosing adequate windows underlying the filter banks. We derive a formula for the mean-squared sample error in the case of wide-sense stationary uncorrelated scattering (WSSUS) channels, which serves as objective function in the window optimization. Furthermore, an enhanced scheme for the parameterization of tight Gabor frames enables us to constrain the window in order to define paraunitary filter banks. We show that the design of windows optimized for WSSUS channels with known statistical properties can be formulated as a convex optimization problem. The performance of the resulting windows is investigated under different channel conditions, for different oversampling factors, and compared against the performance of alternative windows. Finally, a generic matched filter receiver incorporating the proposed channel diagonalization is discussed which may be essential for future reconfigurable radio systems.

1. Introduction

Motivated by the heterogeneity of today's world of wireless communications—which includes cellular mobile radio systems of the second and third generations and beyond, wireless local and personal area networks, broadband wireless access systems, digital audio and video broadcast, emerging peer-to-peer radio, and so forth—particular attention is given to reconfigurable radio architectures. Essential in this context are radio resource management solutions on the higher layers and the ability to comply with a range of different air interfaces on the physical layer. Devices comprising the logic for handling multiple air interfaces in the form of parallel implementations are widely available. However, in view of the still increasing number of standards, monolithic transceiver architectures are desirable which enable a uniform processing of different signals by means of reconfigurable multipurpose signal processing units.

A major challenge in the design of a universal baseband receiver architecture is posed by the dispersive radio channel. For dealing with signal dispersion, fundamentally different approaches are followed in traditional radios depending on the type of modulation. Receivers for single-carrier signals typically model the channel as a tapped delay line. For known coefficients of the delay line, the information in the transmitted signal can be recovered by means of a matched filtering followed by a sequence detector or using instead an equalizer followed by a simple detector. The complexity of the coefficient estimation and detection schemes increases with the delay dispersion and thus with the number of taps. Orthogonal frequency-division multiplexing (OFDM) can evade the need for complex equalizers in high data rate systems. The cyclic extensions in OFDM signals facilitate a frequency domain representation of the multipath channel in the form of parallel single-tap lines. On the basis of the frequency domain signal description resulting from the block-wise Discrete Fourier Transform (DFT), the signal mapping by multipath channels can be represented as diagonal matrices. This channel diagonalization enables straightforward demodulation and coefficient estimation and has, along with the availability of Fast Fourier Transform (FFT) algorithms, led to the popularity of OFDM.

The aforementioned approach for a simple channel inversion based on a frequency domain description is not limited to OFDM receivers. Single-carrier modulation with frequency domain equalization (FDE) can achieve similar performance as OFDM if a proper cyclic prefix is appended to each block of signals [1]. In [2] the computational complexities of time and frequency domain equalizers are compared and it is shown that FDE is simpler when the length of the stationary channel impulse response exceeds the sample time by a factor of 5 or more. Processing signals without cyclic prefix result in errors at the block boundaries. These errors have a limited impact at sufficiently large block sizes, which makes FDE an interesting alternative for code-division multiple access receivers [3, 4].

The limitations of OFDM receivers and FDE to time-invariant channels and certain signal formats can be overcome by resorting to alternative signal representations. A natural choice for the signal transform is the discrete-time Gabor expansion [5] based on a system of time-frequency (TF) shifted versions of a certain window function. Even though a TF domain channel diagonalization based on such a Gabor expansion is approximative in the general case of time-variant channels and aperiodic signals, for the typical underspread channels encountered in mobile radio scenarios the inherent model error can be limited to a usually acceptable level by choosing an adequate window underlying the signal transform [6].

The transform of discrete-time signals into the TF domain can be accomplished by DFT filter banks, for which similarly efficient FFT-based implementations are available as for plain DFTs [7]. There is plenty of literature on filter bank design in the context of generalized multicarrier/multitone modulation in wireless/wired communications. Replacing the block-wise inverse DFT and DFT in the transmitter and receiver, respectively, by more general filter banks is a way to get rid of the rigid framework of rectangular windows and cyclic prefixes in OFDM systems. Interference between adjacent sub-bands or multicarrier symbols can be avoided, or at least limited, by choosing appropriate transmit pulses. Filter banks for transmission over dispersive channels with limited interchannel and intersymbol interference are designed in [814].

The optimization of filter banks for specific objective functions and constraints can sometimes be formulated as a convex optimization (CO) problem [12]. In [15], CO methods are employed for the design of a two-channel multirate filter bank, in [16] for the design of pulse shapes which minimize intercarrier interference due to frequency offsets in OFDM systems, in [17] for finding optimized prototype filters for filtered multitone modulation used in digital subscriber line systems, and in [18] for the design of filter banks for sub-band signal processing under minimal aliasing and induced distortion. Semidefinite programming (SDP), a branch of CO for which efficient numerical solution methods are available, was employed in [19] for the design of a linear phase prototype filter with high stopband attenuation for cosine-modulated filter banks. In [20] two-channel filter banks are optimized under similar criteria by SDP.

In this paper we are not concerned with the design of transmit pulses. Rather, we optimize filter banks in the context of channel diagonalization. We are interested exclusively in paraunitary filter banks, which are related to the concept of tight Gabor frames [21]. The signal transform associated with discrete-time tight Gabor frames fulfills Parseval's identity. This property is crucial for flexible receivers as it lets the correlation between two time domain signals be computed based on the respective TF signal representations. A main concern of this paper is the design of tight Gabor frames facilitating TF domain channel diagonalization with minimal model error for given channel conditions. More specifically, we minimize the mean-squared error (MSE) resulting from the diagonalization of random channels with known second-order statistical properties, complying with the wide-sense stationary uncorrelated scattering (WSSUS) model, with respect to the TF window function. As we showed in [6], window functions minimizing the MSE appearing in the TF domain can be computed by SDP. In this paper we directly focus on the more relevant MSE in the time domain signal. We show that for weak assumptions on the channel statistics, the optimization problem can likewise be turned into a tractable form through semidefinite relaxation. In order to be able to constrain the windows to constitute tight frames, we extend the parameterization of tight Gabor frames presented in [22]. Optimized windows can then be computed off-line for different channel conditions encountered by reconfigurable receivers, such as the generic matched filter-based inner receiver discussed in this paper.

1.1. Outline of This Paper

In Section 2, the mathematical concepts for TF representation and processing of signals are introduced. A parameterization of tight Gabor frames, needed for the constrained optimization in Section 5 is presented in Section 3. In Section 4, TF domain channel diagonalization is discussed, resulting in a certain error in the case of doubly dispersive channels. As shown in Section 5, semidefinite relaxation lets the window design problem be formulated as a CO problem. Numerical results are shown in Section 6 for different channel conditions. In Section 7, a generic matched filter architecture incorporating the channel diagonalization is presented. Finally, conclusions are drawn in Section 8.

1.2. Notation

We enclose the arguments of functions defined on a discrete domain Λ in square brackets in order to distinguish them from functions defined on 𝑛 . The Hilbert space of the square summable functions 𝑓 Λ is denoted as 𝐿 2 ( Λ ) , and the associated inner product 𝑓 , 𝑔 and 𝐿 2 -norm 𝑓 are given by 𝑖 Λ 𝑓 [ 𝑖 ] 𝑔 [ 𝑖 ] and 𝑓 , 𝑓 , respectively, where the asterisk in the superscript denotes complex conjugation. Furthermore, we use to denote convolution, and for the one-by-one multiplication of two compatible functions 𝑓 and 𝑔 , that is, = 𝑓 𝑔 corresponds to [ 𝑖 ] = 𝑓 [ 𝑖 ] 𝑔 [ 𝑖 ] for all 𝑖 Λ . Vectors and matrices are denoted by boldface characters. The transpose and Hermitian transpose of a matrix 𝐗 are denoted as 𝐗 𝑇 and 𝐗 𝐻 , respectively, 𝐗 ( 𝑧 ) stands for the paraconjugate of a polynomial matrix 𝐗 ( 𝑧 ) ( 𝐗 ( 𝑧 ) is obtained from 𝐗 ( 𝑧 ) by transposing it, conjugating all of the coefficients of the rational functions in 𝐗 ( 𝑧 ) , and replacing 𝑧 by 𝑧 1 [7].), t r ( ) for the trace, and 𝐈 𝑁 denotes the identity matrix of size 𝑁 . The 𝑛 th element of the 𝑚 th row of a matrix 𝐗 is represented as [ 𝐗 ] 𝑚 , 𝑛 . Also, 𝐸 [ ] denotes the expected value, ( ) and ( ) represent the real and imaginary parts, respectively, of complex arguments, m o d the modulo operation, 𝚥 1 , and 𝑥 m a x { 𝑛 𝑛 𝑥 } .

2. DFT Filter Banks and Discrete-Time Gabor Frames

In this section, we introduce signal representation concepts needed subsequently. Some important properties of discrete-time Gabor frames are recapitulated with an emphasis on tight frames and the relationship to DFT filter banks. For more insight into Gabor analysis and filter bank theory the reader is referred to the rich literature, for instance [7, 2326].

Let 𝑁 and 𝐾 be two positive integer constants and Λ × { 0 , , 𝐾 1 } . Given a window function 𝑔 𝐿 2 ( ) , the set

𝑔 , 𝑚 [ 𝑘 ] ( , 𝑚 ) Λ ( 1 ) with

𝑔 , 𝑚 [ 𝑘 ] 𝑔 [ 𝑘 𝑁 ] e x p 𝚥 2 𝜋 ( 𝑘 𝑁 ) 𝑚 𝐾 ( 2 ) is referred to as a Gabor system in 𝐿 2 ( ) . The elements of the Gabor system can be associated with the grid points { ( 𝑁 , 2 𝜋 𝑚 / 𝐾 ) ( , 𝑚 ) Λ } of a lattice overlaying the TF plane × [ 0 , 2 𝜋 ) . If there exist two positive constants 𝐴 0 and 𝐵 0 such that

𝐴 0 𝑥 2 ( , 𝑚 ) Λ | | 𝑥 , 𝑔 , 𝑚 | | 2 𝐵 0 𝑥 2 𝑥 𝐿 2 ( ) , ( 3 ) then (1) represents a discrete-time Gabor frame. A necessary condition for (3) is that 𝑁 / 𝐾 1 .

For an arbitrary signal 𝑥 𝐿 2 ( ) the inner products of 𝑥 [ 𝑘 ] with every element of the system (1) form a linear TF representation. In the following, the corresponding transform onto 𝐿 2 ( Λ ) is represented by the analysis operator

𝒢 𝑥 𝑋 , 𝑋 [ , 𝑚 ] = 𝑥 , 𝑔 , 𝑚 , ( , 𝑚 ) Λ . ( 4 ) The mapping (4) can be implemented by a 𝐾 -channel DFT (analysis) filter bank with a prototype filter with impulse response 𝑔 [ 𝑘 ] followed by a down-sampling by a factor 𝑁 [21]. Conversely, a synthesis operator 𝒢 can be defined based on (1) which maps an arbitrary TF representation 𝑌 𝐿 2 ( Λ ) onto an element of 𝐿 2 ( ) according to

𝒢 𝑌 ( , 𝑚 ) Λ 𝑌 [ , 𝑚 ] 𝑔 , 𝑚 [ 𝑘 ] . ( 5 ) The signal synthesis (5) can be implemented by an up-sampling by a factor 𝑁 followed by a 𝐾 -channel DFT (synthesis) filter bank with a prototype filter with impulse response 𝑔 [ 𝑘 ] .

If (3) holds with 𝐴 0 = 𝐵 0 = 1 then (1) represents a (normalized) tight Gabor frame and 𝒢 ( 𝒢 𝑥 ) = 𝑥 for all 𝑥 𝐿 2 ( ) . These special Gabor frames obey a generalized Parseval's identity

𝑥 2 = 𝒢 𝑥 2 𝑥 𝐿 2 ( ) . ( 6 ) Furthermore, the inner product 𝑥 , 𝑦 of any two 𝑥 , 𝑦 𝐿 2 ( ) can be computed on the basis of the respective TF representations 𝒢 𝑥 and 𝒢 𝑦 , that is,

𝑥 , 𝑦 = 𝒢 𝑥 , 𝒢 𝑦 𝑥 , 𝑦 𝐿 2 ( ) . ( 7 ) Henceforth we assume that (1) represents a tight Gabor frame. We note that the range 𝑔 { ( 𝒢 𝑥 ) [ 𝑘 ] 𝑥 𝐿 2 ( ) } of the operator 𝒢 is a subspace of 𝐿 2 ( Λ ) , and the mapping 𝒢 𝐿 2 ( ) 𝑔 is an isometry. If 𝑁 / 𝐾 < 1 the operator 𝒢 𝒢 represents the orthogonal projection from 𝐿 2 ( Λ ) onto 𝑔 . As a direct consequence,

𝒢 𝑋 2 𝑋 2 𝑋 𝐿 2 ( Λ ) ( 8 ) and 𝑔 2 = 𝑁 / 𝐾 .

Tight Gabor frames are associated with paraunitary DFT filter banks. To enable the design of windows with favorable properties, for instance in regard to TF concentration, it is often necessary to indeed choose 𝑁 < 𝐾 , resulting in oversampled filter banks. Besides of available efficient implementations of paraunitary DFT filter banks, the properties (6) and (7) are of prime interest for reconfigurable baseband receivers since they allow operations for the signal demodulation, such as signal energy computations and crosscorrelations with reference waveforms, to be performed directly in the TF domain.

3. Parameterization of Tight Gabor Frames

The conditions under which (1) represents a tight Gabor frame can be formulated via the polyphase representation. Let 𝑀 denote the least common multiple of 𝑁 and 𝐾 , and define 𝐿 and 𝐽 such that 𝐿 𝑁 = 𝑀 and 𝐽 𝐾 = 𝑀 . The 𝑀 -component polyphase representation of the 𝑧 -transform 𝐺 ( 𝑧 ) = 𝑘 = 𝑔 [ 𝑘 ] 𝑧 𝑘 of the window 𝑔 [ 𝑘 ] reads

𝐺 ( 𝑧 ) = 𝑀 1 𝑗 = 0 𝑧 𝑗 𝑅 𝑗 𝑧 𝑀 , ( 9 ) where

𝑅 𝑗 ( 𝑧 ) = 𝑘 𝑔 [ 𝑗 + 𝑘 𝑀 ] 𝑧 𝑘 . ( 1 0 ) Furthermore, the polyphase matrix 𝐆 P ( 𝑧 ) of size 𝐾 × 𝑁 associated with the DFT filter bank implementing (4) can be expressed as [22]

𝐆 P ( 𝑧 ) = 𝐅 𝐾 𝐕 ( 𝑧 ) ( 1 1 ) with 𝐅 𝐾 denoting the DFT matrix of size 𝐾 (defined as [ 𝐅 𝐾 ] 𝑚 , 𝑛 = 𝑒 𝚥 2 𝜋 ( 𝑚 1 ) ( 𝑛 1 ) / 𝐾 ) and

𝐕 ( 𝑧 ) 𝐈 𝐾 𝐈 𝐾 𝐽 d i a g 𝑅 0 𝑧 𝐿 , , 𝑅 𝑀 1 𝑧 𝐿 𝐈 𝑁 𝑧 1 𝐈 𝑁 𝑧 ( 𝐿 1 ) 𝐈 𝑁 . ( 1 2 ) Here, d i a g ( 𝑑 1 , , 𝑑 𝑁 ) is the diagonal matrix with diagonal elements 𝑑 1 , , 𝑑 𝑁 . The Gabor system (1) represents a tight frame in 𝐿 2 ( ) if and only if the polyphase matrix 𝐆 P ( 𝑧 ) is paraunitary with 𝐆 P ( 𝑧 ) 𝐆 P ( 𝑧 ) = 𝐈 𝑁 . Or, equivalently, if and only if the polynomial matrix 𝐕 ( 𝑧 ) is paraunitary with 𝐕 ( 𝑧 ) 𝐕 ( 𝑧 ) = 𝐾 1 𝐈 𝑁 , since 𝐅 𝐻 𝐾 𝐅 𝐾 = 𝐾 𝐈 𝐾 .

We observe that [ 𝐕 ( 𝑧 ) ] 𝑚 , 𝑛 = 0 if ( ( 𝑚 𝑛 ) m o d 𝐵 ) 0 , where 𝐵 = 𝑁 / 𝐽 = 𝐾 / 𝐿 . Consequently, 𝐕 ( 𝑧 ) is paraunitary if and only if the 𝐵 matrices 𝐕 0 ( 𝑧 ) , , 𝐕 𝐵 1 ( 𝑧 ) of size 𝐿 × 𝐽 , which comprise the possibly nonzero elements of 𝐕 ( 𝑧 ) according to [ 𝐕 𝑏 ( 𝑧 ) ] 𝑚 , 𝑛 = [ 𝐕 ( 𝑧 ) ] 1 + 𝐵 ( 𝑚 1 ) + 𝑏 , 1 + 𝐵 ( 𝑛 1 ) + 𝑏 , are all paraunitary. As follows from (12) the elements of the 𝐵 matrices are given as

𝐕 𝑏 ( 𝑧 ) 𝑚 , 𝑛 = 𝑧 𝑓 ( 𝑚 , 𝑛 ) / 𝐽 𝑅 𝐵 𝑓 ( 𝑚 , 𝑛 ) + 𝑏 𝑧 𝐿 , 𝑏 = 0 , , 𝐵 1 ( 1 3 ) with 𝑓 ( 𝑚 , 𝑛 ) 𝐽 1 𝑗 = 0 𝐿 1 = 0 ( 𝑚 + 𝑗 𝐿 1 ) 𝛿 𝑚 + 𝑗 𝐿 , 𝑛 + 𝐽 and 𝛿 𝑖 , 𝑗 denoting the Kronecker delta.

Note that if the sequences ( 𝑓 ( 𝑚 , 𝑛 ) / 𝐽 ) 𝑚 = 1 , , 𝐿 were identical for all column indices 𝑛 = 1 , , 𝐽 except for differing offsets, then the factor 𝑧 𝑓 ( 𝑚 , 𝑛 ) / 𝐽 could be omitted in (13) without affecting the condition 𝐕 𝑏 ( 𝑧 ) 𝐕 𝑏 ( 𝑧 ) = 𝐾 1 𝐈 𝐽 . Replacing some 𝑅 𝑚 ( 𝑧 𝐿 ) by the equivalent 𝑧 𝐿 𝑅 𝑀 + 𝑚 ( 𝑧 𝐿 ) is a way to align the sequences. Having this in mind, we define 𝐵 matrices 𝐖 0 ( 𝑧 ) , , 𝐖 𝐵 1 ( 𝑧 ) of size 𝐿 × 𝐽 according to

𝐖 𝑏 ( 𝑧 ) 𝑚 , 𝑛 = 𝑅 𝐵 𝑓 ( 𝑚 , 𝑛 ) + 𝑏 ( 𝑧 ) , 𝑏 = 0 , , 𝐵 1 ( 1 4 ) with the index map

𝑓 ( 𝑚 , 𝑛 ) 𝑓 ( 𝑚 , 𝑛 ) i f 𝑓 ( 𝑚 , 𝑛 ) 𝑓 ( 1 , 𝑛 ) 𝑓 ( 𝑚 , 𝑛 ) + 𝑀 𝐵 i f 𝑓 ( 𝑚 , 𝑛 ) < 𝑓 ( 1 , 𝑛 ) . ( 1 5 ) Since the polynomial matrices 𝐕 0 ( 𝑧 ) , , 𝐕 𝐵 1 ( 𝑧 ) are paraunitary if and only if the modified matrices 𝐖 0 ( 𝑧 ) , , 𝐖 𝐵 1 ( 𝑧 ) are paraunitary, the Gabor system (1) represents a tight frame in 𝐿 2 ( ) if and only if

𝐖 𝑏 ( 𝑧 ) 𝐖 𝑏 ( 𝑧 ) = 𝐾 1 𝐈 𝐽 𝑏 { 0 , , 𝐵 1 } . ( 1 6 )

We note that the size of each polynomial matrix 𝐖 𝑏 ( 𝑧 ) , their number 𝐵 , and the index map 𝑓 ( 𝑚 , 𝑛 ) are fully determined by 𝑁 and 𝐾 . Given the latter two constants, any tight Gabor frame is uniquely defined by an instance of 𝐖 0 ( 𝑧 ) , , 𝐖 𝐵 1 ( 𝑧 ) satisfying (16), where the association of the elements of the 𝐵 matrices with the samples of the window 𝑔 [ 𝑘 ] is defined by (14) and (10). The length of the window is related to the polynomial orders of the matrices 𝐖 0 ( 𝑧 ) , , 𝐖 𝐵 1 ( 𝑧 ) . We define 𝑃 as the maximal polynomial order of the 𝐵 matrices plus 1 . Thus, in the case 𝑃 = 1 , all elements of the matrices are scalars, and the support of the representable functions 𝑔 [ 𝑘 ] is limited to { 𝐵 𝑓 ( 𝑚 , 𝑛 ) + 𝑏 𝑚 = 1 , , 𝐿 ; 𝑛 = 1 , , 𝐽 ; 𝑏 = 0 , , 𝐵 1 } . This set is usually not of the form [ 𝑎 0 , 𝑏 0 ] for some 𝑎 0 𝑏 0 but exhibits “gaps” as illustrated in the example of Figure 1. By increasing 𝑃 longer windows can be found.

172751.fig.001
Figure 1: Support of the window functions 𝑔 [ 𝑘 ] representable by matrices 𝐖 0 ( 𝑧 ) , , 𝐖 𝐵 1 ( 𝑧 ) with maximal polynomial order 𝑃 1 for 𝐽 = 3 , 𝐿 = 4 , 𝑃 = 1 , 2 .

4. Time-Frequency Channel Diagonalization

The mapping 𝐿 2 ( ) 𝐿 2 ( ) of an input signal 𝑥 [ 𝑘 ] onto the signal 𝑦 [ 𝑘 ] ( 𝑥 ) [ 𝑘 ] at the output of a linear time-variant channel can be expressed as

𝑦 [ 𝑘 ] = 𝑞 = 0 𝑐 [ 𝑘 , 𝑞 ] 𝑥 [ 𝑘 𝑞 ] , ( 1 7 ) where 𝑐 [ 𝑘 , 𝑞 ] denotes the time-variant impulse response. We consider random channels where 𝑐 [ 𝑘 , 𝑞 ] represents a two-dimensional zero-mean random process complying with the WSSUS model. The second-order statistics of 𝑐 [ 𝑘 , 𝑞 ] are determined by the time correlation function 𝜙 𝑡 [ 𝑘 Δ ] and the delay power spectrum 𝑆 d e l a y [ 𝑞 ] according to

𝐸 𝑐 [ 𝑘 , 𝑞 ] 𝑐 𝑘 , 𝑞 = 𝜙 𝑡 𝑘 𝑘 𝑆 d e l a y [ 𝑞 ] 𝛿 𝑞 , 𝑞 . ( 1 8 ) The delay power spectrum is related to the frequency correlation function 𝜙 𝑓 ( 𝜔 Δ ) through

𝜙 𝑓 𝜔 Δ = 𝑞 = 0 𝑆 d e l a y [ 𝑞 ] 𝑒 𝚥 𝜔 Δ 𝑞 . ( 1 9 )

Of interest in the context of TF signal processing is the time-variant transfer function

𝐶 ( 𝑘 , 𝜔 ) = 𝑞 = 0 𝑐 [ 𝑘 , 𝑞 ] 𝑒 𝚥 𝜔 𝑞 , ( 2 0 ) reflecting the TF selectivity of a channel realization. In a digital receiver a realization of a doubly dispersive channel can be represented by a sampled version 𝐻 [ , 𝑚 ] of 𝐶 ( 𝑘 , 𝜔 ) , defined by

𝐻 [ , 𝑚 ] = 𝐶 𝑁 , 2 𝜋 𝑚 𝐾 , ( , 𝑚 ) Λ . ( 2 1 ) For compatibility with the TF signal representations introduced in Section 2, the sampling intervals 𝑁 and 2 𝜋 / 𝐾 are chosen in line with those for the Gabor system (1). The time-variant transfer function represents the complex-valued channel gain over time and frequency. Hence, given the TF representation 𝑋 𝒢 𝑥 of a signal 𝑥 [ 𝑘 ] at the channel input, it is straightforward to approximate the signal 𝑦 [ 𝑘 ] at the channel output as

̂ 𝑦 = 𝒢 ( 𝐻 𝑋 ) . ( 2 2 ) The approximation of a linear operator by 𝒢 ( 𝐻 𝒢 ( ) ) , that is, a concatenation of an analysis operation, an element-wise multiplication, and a synthesis operation, appears in the literature under the name Gabor multiplier [27]. Such an approximation is suitable for operators that do not involve TF shifts of large magnitude (i.e., underspread operators). Figure 2 shows an implementation of (22) by filter banks, where 𝐺 ( 𝑧 ) denotes the 𝑧 -transform of 𝑔 [ 𝑘 ] . The TF channel diagonalization offers several advantages. The flexibility in the choice of the sampling intervals 𝑁 and 2 𝜋 / 𝐾 can be used for the adaptation to different channel conditions or signal formats, or the limitation of the effort for the coefficient estimation in certain receivers. Furthermore, the channel diagonalization facilitates scalable and efficient receiver processing known from OFDM.

172751.fig.002
Figure 2: TF domain channel diagonalization with 𝐺 𝑘 ( 𝑧 ) 𝐺 ( 𝑧 𝑒 𝚥 2 𝜋 𝑘 / 𝐾 ) , 𝑘 = 0 , , 𝐾 1 .

As a result of the sampling of 𝐶 ( 𝑘 , 𝜔 ) the model (22) is usually only approximative, and ̂ 𝑦 [ 𝑘 ] is an approximation of the channel output. The accuracy of ̂ 𝑦 [ 𝑘 ] depends on the channel characteristics and the underlying Gabor frame. We may expect the model error to be limited if every elementary function 𝑔 , 𝑚 [ 𝑘 ] is concentrated around ( 𝑁 , 2 𝜋 𝑚 / 𝐾 ) in the TF plane such that 𝐶 ( 𝑘 , 𝜔 ) is essentially constant within the sphere of 𝑔 , 𝑚 [ 𝑘 ] . Window functions fulfilling this can be designed for the typical underspread channels encountered in mobile radio scenarios by CO, as shown in Section 5.

The error from the channel diagonalization is given by

̂ 𝑦 [ 𝑘 ] 𝑦 [ 𝑘 ] = 𝒢 ( 𝐻 𝒢 𝑥 ) [ 𝑘 ] ( 𝑥 ) [ 𝑘 ] . ( 2 3 ) In order to remain general in regard to signal and channel properties, we consider the error signal under the assumptions of

(i)a white random signal at the channel input,(ii)a random channel complying with the WSSUS model and unit average channel gain (i.e., 𝜙 𝑓 ( 0 ) = 𝜙 𝑡 [ 0 ] = 1 ).

To formulate the resulting MSE, we introduce the random signal 𝑥 𝑄 [ 𝑘 ] being subject to 𝐸 [ 𝑥 𝑄 [ 𝑘 ] ] = 0 and

𝐸 𝑥 𝑄 [ 𝑘 ] 𝑥 𝑄 𝑘 = 𝛿 𝑘 , 𝑘 f o r 𝑘 , 𝑘 𝑄 2 , 𝑄 2 0 o t h e r w i s e ( 2 4 ) with 𝑄 an even integer. The error signal corresponding to the truncated white random input signal 𝑥 𝑄 [ 𝑘 ] reads

𝜖 𝑄 [ 𝑘 ] 𝒢 𝐻 𝒢 𝑥 𝑄 [ 𝑘 ] 𝑥 𝑄 [ 𝑘 ] . ( 2 5 ) The error signal sample energy relative to the unit average sample energy of the desired signal, in the following termed relative mean-squared sample error (RMSSE), can be expressed as

𝜀 R M S S E ( 𝑔 ) = l i m 𝑄 𝐸 1 𝑄 𝑄 / 2 𝑘 = 𝑄 / 2 | | 𝜖 𝑄 [ 𝑘 ] | | 2 = l i m 𝑄 𝐸 1 𝑄 𝑄 / 2 𝑘 = 𝑄 / 2 | | | | | ( , 𝑚 ) Λ 𝐻 [ , 𝑚 ] 𝑥 𝑄 , 𝑔 , 𝑚 × 𝑔 , 𝑚 [ 𝑘 ] 𝑥 𝑄 [ 𝑘 ] | | | | 2 . ( 2 6 ) Making use of the above assumptions, the RMSSE can be written as

𝜀 R M S S E ( 𝑔 ) = 1 + 𝐾 𝑁 ( , 𝑚 ) Λ 𝜙 𝑡 [ 𝑁 ] 𝜙 𝑓 2 𝜋 𝑚 𝐾 | | 𝑔 , 𝑔 , 𝑚 | | 2 2 𝑔 𝑆 d e l a y 𝜙 𝑡 , 𝑔 ( 2 7 ) as shown in the appendix. Having formulated both conditions for the window 𝑔 [ 𝑘 ] to define a tight Gabor frame (in Section 3) and the error resulting from the channel diagonalization based on 𝑔 [ 𝑘 ] , we can now turn to window optimization.

5. Window Design

Let us represent the window to be optimized in vector form 𝐠 [ 𝑔 [ 𝑎 0 ] 𝑔 [ 𝑏 0 ] ] T , choosing 𝑎 0 , 𝑏 0 such that [ 𝑎 0 , 𝑏 0 ] comprises the support of 𝑔 [ 𝑘 ] expressed in Section 3. We consider only real-valued windows. Additionally, in order to eventually arrive at a CO problem, we impose the following restrictions on the channel statistics.

(i)The time correlation function is subject to 𝜙 𝑡 [ ] 0 for all , as being the case for two-sided exponentially decaying and many other symmetrical Doppler power spectra.(ii)The frequency correlation function fulfills 𝜙 𝑓 ( 2 𝜋 𝑚 / 𝐾 ) + 𝜙 𝑓 ( 2 𝜋 𝑚 / 𝐾 ) 0 for all 𝑚 { 0 , , 𝐾 1 } , as, for instance, in the case of exponentially decaying delay power spectra.

We note that | 𝑔 , 𝑔 , 𝑚 | 2 can be expressed as ( 𝐠 𝑇 ( 𝐂 , 𝑚 ) 𝐠 ) 2 + ( 𝐠 𝑇 ( 𝐂 , 𝑚 ) 𝐠 ) 2 and ( ( 𝑔 𝑆 d e l a y ) 𝜙 𝑡 , 𝑔 ) as 𝐠 𝑇 ( 𝐃 0 ) 𝐠 with appropriate square matrices 𝐂 , 𝑚 and 𝐃 0 . As a consequence, the objective function (27) can be expressed in the form

𝜀 R M S S E ( 𝐠 ) = 𝐹 𝑘 = 1 𝑐 𝑘 𝐠 𝑇 𝐂 𝑘 𝐠 2 + 𝐠 𝑇 𝐃 𝐠 + 1 ( 2 8 ) for some 𝐹 depending on the support of 𝑔 [ 𝑘 ] , where 𝐂 1 , , 𝐂 𝐹 , 𝐃 are real matrices and the constants 𝑐 1 , , 𝑐 𝐹 are positive given the above restrictions.

Next, we need to incorporate the constraints under which 𝜀 R M S S E ( 𝐠 ) will be minimized. In order to formulate the constraints (16) on the window in the time domain, it is helpful to permute the samples in 𝐠 . Let us introduce a window ( [ 0 ] , , [ 𝑇 1 ] ) of length 𝑇 = 𝐿 𝐽 𝑃 𝐵 defined as

[ 𝑘 ] = 𝑔 𝐵 𝑓 ( 𝑚 , 𝑛 ) + 𝑏 + 𝑀 𝑝 , 𝑘 = 0 , , 𝑇 1 ( 2 9 ) with 𝑚 = ( 𝑘 m o d 𝐿 ) + 1 , 𝑛 = ( 𝑘 m o d 𝐿 𝐽 ) / 𝐿 + 1 , 𝑝 = ( 𝑘 m o d 𝐿 𝐽 𝑃 ) / ( 𝐿 𝐽 ) , and 𝑏 = 𝑘 / ( 𝐿 𝐽 𝑃 ) . The matrices 𝐖 0 ( 𝑧 ) , , 𝐖 𝐵 1 ( 𝑧 ) and the samples of the permuted window are related through

𝐖 𝑏 ( 𝑧 ) 𝑚 , 𝑛 = 𝑃 1 𝑝 = 0 [ 𝐿 ( 𝐽 ( 𝑏 𝑃 + 𝑝 ) + 𝑛 1 ) + 𝑚 1 ] 𝑧 𝑝 . ( 3 0 ) With (30) we can now translate the polyphase domain constraints (16) into constraints on the permuted window defined by 𝐡 [ [ 0 ] [ 𝑇 1 ] ] 𝑇 .

( 1 ) Case 𝐵 = 1 . There are 𝐽 constraints of form 𝐡 𝑇 𝐀 𝐡 = 𝐾 1 . The th diagonal matrix 𝐀 of size 𝐿 𝐽 𝑃 × 𝐿 𝐽 𝑃 is defined as

𝐀 𝑚 , 𝑛 = 1 i f 𝑚 = 𝑛 , 𝑚 𝑝 = 0 , , 𝑃 1 { ( 𝑝 𝐽 + 1 ) 𝐿 + 1 , , ( 𝑝 𝐽 + ) 𝐿 } , 0 o t h e r w i s e , ( 3 1 ) with { 1 , , 𝐽 } . Additionally, there are 𝐽 2 𝑃 ( 𝐽 + 1 ) 𝐽 / 2 constraints of form 𝐡 𝑇 𝐀 𝐡 = 0 . The corresponding matrices 𝐀 𝐽 + 1 , , 𝐀 𝐽 2 𝑃 ( 𝐽 1 ) 𝐽 / 2 can be defined as the elements of the set resulting from deleting duplicate elements and zero-matrices from

𝐀 𝐿 𝐽 𝑃 × 𝐿 𝐽 𝑃 g i v e n a s [ 𝐀 ] 𝑚 , 𝑛 = 𝐀 𝑚 𝑗 𝐿 , 𝑛 + 𝐀 𝑚 , 𝑛 𝑗 𝐿 𝑗 = 1 , , 𝐽 𝑃 ; = 1 , , 𝐽 ( 3 2 ) where in (32) we let [ 𝐀 ] 𝑚 , 𝑛 equal zero if either 𝑚 { 1 , , 𝐿 𝐽 𝑃 } or 𝑛 { 1 , , 𝐿 𝐽 𝑃 } .

( 2 ) Case 𝐵 > 1 . From each of the above-defined matrices 𝐀 1 , , 𝐀 𝐽 2 𝑃 ( 𝐽 1 ) 𝐽 / 2 , 𝐵 unique block diagonal matrices of dimension 𝑇 × 𝑇 are reproduced which contain the original matrix as one of the 𝐵 diagonal blocks of dimension 𝐿 𝐽 𝑃 × 𝐿 𝐽 𝑃 . Hence, there are 𝑊 𝐵 ( 𝐽 2 𝑃 ( 𝐽 1 ) 𝐽 / 2 ) constraints in total. The constraint matrices are mutually orthogonal in the sense that t r ( 𝐀 𝐀 𝑇 𝑚 ) = 0 for 𝑚 .

We can now formulate the optimization problem in the form

m i n 𝐡 𝑇 𝐹 𝑘 = 1 𝑐 𝑘 𝐡 𝑇 𝐂 𝑘 𝐡 2 + 𝐡 𝑇 𝐃 𝐡 s u b j e c t t o 𝐡 𝑇 𝐀 𝐡 = 𝑑 , = 1 , , 𝑊 , ( 3 3 ) where 𝐂 1 , , 𝐂 𝐹 , 𝐃 are the matrices resulting from 𝐂 1 , , 𝐂 𝐹 , 𝐃 by permuting the rows and columns in accordance with (29), and 𝑑 { 𝐾 1 , 0 } . This problem is difficult to tackle for large 𝑇 . Let us thus introduce 𝐇 𝐡 𝐡 𝑇 and reformulate the optimization problem as

m i n 𝐇 𝒮 𝑇 𝐹 𝑘 = 1 𝑐 𝑘 t r 2 𝐇 𝐂 𝑘 + t r 𝐇 𝐃 s u b j e c t t o t r 𝐇 𝐀 = 𝑑 , = 1 , , 𝑊 r a n k ( 𝐇 ) = 1 , ( 3 4 ) where 𝒮 𝑇 denotes the vector space of symmetric matrices of dimension 𝑇 × 𝑇 . In (34) we have a convex objective function, however, the set { 𝐇 𝒮 𝑇 r a n k ( 𝐇 ) = 1 } is nonconvex. Resorting to semidefinite relaxation, we obtain

m i n 𝐇 𝒮 𝑇 𝐹 𝑘 = 1 𝑐 𝑘 t r 2 𝐇 𝐂 𝑘 + t r 𝐇 𝐃 s u b j e c t t o t r 𝐇 𝐀 = 𝑑 , = 1 , , 𝑊 𝐇 0 ( 3 5 ) with 𝐇 0 denoting that 𝐇 is positive semidefinite. Since { 𝐇 𝒮 𝑇 𝐇 0 } is a convex subset of 𝒮 𝑇 , we now have a CO problem [28]. Having found a matrix 𝐇 0 𝒮 𝑇 corresponding to a global minimum of (35), we have two possible cases. If r a n k ( 𝐇 0 ) = 1 , a solution 𝐡 0 of (33) is readily obtainable from 𝐡 0 𝐡 𝑇 0 = 𝐇 0 and the optimal window 𝑔 C O [ 𝑘 ] is found through (29). If r a n k ( 𝐇 0 ) > 1 , which we observe in most of the cases, rank reduction methods must be employed. We compute a possibly suboptimal window 𝑔 C O [ 𝑘 ] by the following three steps.

(i)In order to reduce the rank to 1 , we resort to the matrix 𝐇 0 = ( 𝑁 / 𝐾 ) 𝐯 0 𝐯 𝐻 0 composed by the dominant eigenvector 𝐯 0 of 𝐇 0 , since 𝐇 0 is the matrix nearest to 𝐇 0 in terms of the Frobenius norm [29].(ii)We translate 𝑁 / 𝐾 𝐯 0 into a window ̂ 𝑔 [ 𝑘 ] taking the sample permutation defined in (29) into account.(iii)We finally obtain 𝑔 C O [ 𝑘 ] by the algorithm [30], which yields a window defining a tight frame and at the same time minimizes the distance to a given window (i.e., ̂ 𝑔 [ 𝑘 ] ) in terms of the 𝐿 2 -norm.

Employing steepest descent methods for solving (35) may result in very slow convergence, whereas alternative methods may not be applicable when the number of dimensions is large. Neglecting the quadratic terms in the objective function leads to the simplified optimization problem

m i n 𝐇 𝒮 𝑇 t r 𝐇 𝐃 s u b j e c t t o t r 𝐇 𝐀 = 𝑑 , = 1 , , 𝑊 𝐇 0 . ( 3 6 ) As shown in [6], the linear objective function t r ( 𝐇 𝐃 ) reflects the mean-squared deviation of 𝐻 𝒢 𝑥 from 𝒢 ( 𝑥 ) , that is, the model error in the TF domain. Problems of the form (36) are dealt with by SDP, a subfield of CO. For the efficient solution of these optimization problems a number of sophisticated software packages are widely available. However, because generally ( 𝐻 𝒢 𝑥 𝒢 ( 𝑥 ) ) 𝑔 the windows resulting from solving (36) do not minimize the time domain error signal, the magnitude of which determines the performance of the channel diagonalization.

6. Numerical Results

We consider a WSSUS channel with an exponentially decaying delay power spectrum, the sampled version of which reads

𝑆 d e l a y [ 𝑞 ] = 𝑢 ( 𝑞 ) 1 e x p 1 𝜏 R M S e x p 𝑞 𝜏 R M S ( 3 7 ) with 𝑢 ( 𝑞 ) denoting the unit step function and 𝜏 R M S the root mean-squared (RMS) delay spread [31]. As for the Doppler power spectrum, a two-sided exponentially decaying shape is assumed, which results in the time correlation function

𝜙 𝑡 𝑘 Δ = 1 1 + 2 𝜋 2 𝜈 2 R M S | | 𝑘 Δ | | 2 , ( 3 8 ) where 𝜈 R M S represents the RMS Doppler spread. Since choosing an oversampling factor 𝐾 / 𝑁 larger than one increases the degrees of freedom in the window design, we restrict our attention to scenarios with 𝐾 > 𝑁 , involving oversampled filter banks. Figure 3 shows optimized window functions for different channel conditions and their Fourier transforms. The waveforms were obtained numerically by solving (35) using interior point methods [28] for 𝑁 = 2 4 , 𝐾 = 3 2 , 𝑃 = 2 amounting to a window length of 240 samples. An RMS delay spread 𝜏 R M S of 3 samples and an RMS Doppler spread 𝜈 R 𝑀 𝑆 of 0.001 s a m p l e s 1 were assumed in Figure 3(a), while 𝜏 R M S = 3 , 𝜈 R M S = 0 . 0 1 in Figure 3(b). The two shown optimized windows achieve RMSSEs (27) of 1 6 . 0 1  dB and 8 . 4 4  dB. Figures 3(c) and 3(d) show the Fourier transforms of the optimized pulses in (a) and (b), respectively, versus the normalized frequency 𝜔 / 2 𝜋 . Obviously, the optimized waveforms become more concentrated in time domain as the Doppler spread increases (see Figure 3(b) versus Figure 3(a)). For increasing Doppler spreads the coherence time of the channel decreases, and the temporal support of the optimized window is reduced in order to limit the RMSSE.

fig3
Figure 3: Examples of optimized window functions in time domain ((a) and (b)) and in frequency domain ((c) and (d)) for different channel statistics: 𝜏 R M S = 3 , 𝜈 R M S = 0 . 0 0 1 (in (a) and (c)), 𝜏 R M S = 3 , 𝜈 R M S = 0 . 0 1 (in (b) and (d)).

The RMSSEs (27) achievable by optimized windows are shown in Figure 4 for the same lattice constants and similar types of delay/Doppler power spectra. The RMS delay spread 𝜏 R 𝑀 𝑆 ranges between 0.5 and 8 samples while the RMS Doppler spread 𝜈 R M S equals 0.01 sample s 1 . For every considered 𝜏 R M S a window 𝑔 C O [ 𝑘 ] was obtained by numerically solving the CO problem (35), and a window 𝑔 S D P [ 𝑘 ] by solving (36) through SDP, where both approaches required the above-mentioned additional steps for rank reduction. The global minimum of the objective function in (35) at 𝐇 = 𝐇 0 , that is prior to the rank reduction, serves as a lower bound in the figure. The offsets of 𝜀 R M S S E ( 𝑔 C O ) and 𝜀 R M S S E ( 𝑔 S D P ) from the lower bound reflect the impact of the rank reduction. Additionally, the figure shows the RMSSEs resulting from choosing a window 𝑔 R R C [ 𝑘 ] with a root-raised-cosine (RRC) shaped magnitude spectrum with width 2 𝜋 / 𝐾 and roll-off factor 𝐾 / 𝑁 1 . We choose this window function for comparison because it does constitute a tight Gabor frame while exhibiting superior TF localization properties compared to rectangularly shaped windows for instance. Finally, for the verification of 𝜀 R M S S E ( 𝑔 C O ) the signals 𝑦 = 𝑥 and ̂ 𝑦 = 𝒢 ( 𝐻 ( 𝒢 𝑥 ) ) were also obtained by simulations involving filter banks based on the optimized windows 𝑔 C O [ 𝑘 ] and random signal and WSSUS channel generators, and the resulting error signal analyzed.

172751.fig.004
Figure 4: Model errors by windows 𝑔 C O [ 𝑘 ] and 𝑔 S D P [ 𝑘 ] optimized through CO and SDP, respectively, and by window 𝑔 R R C [ 𝑘 ] with RRC-shaped magnitude spectrum versus 𝜏 R M S at 𝜈 R M S = 1 0 2 .

Obviously, solving (35) leads to better windows than solving (36). The considerable offset of the RMSSEs from the lower bound for smaller 𝜏 R M S indicates that here the rank reduction has a significant impact on the windows. We observe that rank reduction generally has a limited effect when the delay and Doppler spreads are of similar extent, that is, when in the TF plane the delay spread relative to the sampling interval in time (i.e., 𝜏 R M S / 𝑁 ) is of the same order of magnitude as the Doppler spread relative to the sampling interval in frequency (i.e., 𝜈 R M S / 𝐾 1 ).

The relatively high RMSSEs found in Figure 4 are a result of the product 𝜏 R M S 𝜈 R M S being in the order of 1 0 2 , a much larger value than encountered in typical mobile radio scenarios. In environments with such severe dispersion in both time and frequency, the model error performance can actually be improved by increasing the oversampling factor 𝐾 / 𝑁 . This can be seen in Table 1, showing some 𝜀 R M S S E ( 𝑔 C O ) observed under the same conditions as above except for choosing different lattice constants. An RMS delay spread of 1 sample is assumed here. The performance clearly improves with the oversampling factor.

tab1
Table 1: Model error for different oversampling factors.

7. Generic Matched Filter Receiver

The considered TF channel diagonalization does not rely on a particular signal format, making it suitable for application in multimode receivers [32]. The burst structures defined in the various standards for wireless communications differ substantially. However, commonly the bursts incorporate preamble signals for the channel estimation along with information-bearing signals which are usually subject to a linear modulation scheme. The transmitted baseband signals generally follow the form 𝑡 [ 𝑘 ] = 𝐼 𝑖 = 1 𝑠 𝑖 𝑧 𝑖 [ 𝑘 ] with 𝑧 1 [ 𝑘 ] , , 𝑧 𝐼 [ 𝑘 ] representing 𝐼 elementary waveforms, possibly complex exponentials such as in the case of OFDM, or pseudo-noise sequences as in the case of direct-sequence spread-spectrum systems. For performing channel estimation and information recovery the receiver needs to estimate the signals 𝑠 1 , , 𝑠 𝐼 on the basis of the known waveforms 𝑧 1 [ 𝑘 ] , , 𝑧 𝐼 [ 𝑘 ] . To this end, the inner receiver correlates the received signal 𝑟 [ 𝑘 ] with the elementary signals as appearing at the channel output, resulting in

𝑢 𝑖 = 𝑟 , 𝑧 𝑖 , 𝑖 = 1 , , 𝐼 .