Abstract

Recent discussions on viable technologies for 5G emphasize on the need for waveforms with better spectral containment per subcarrier than the celebrated orthogonal frequency division multiplexing (OFDM). Filter bank multicarrier (FBMC) is an alternative technology that can serve this need. Subcarrier waveforms are built based on a prototype filter that is designed with this emphasis in mind. This paper presents a broad review of the research work done in the wireless laboratory of the University of Utah in the past 15 years. It also relates this research to the works done by other researchers. The theoretical basis based on which FBMC waveforms are constructed is discussed. Also, various methods of designing effective prototype filters are presented. For completeness, polyphase structures that are used for computationally efficient implementation of FBMC systems are introduced and their complexity is contrasted with that of OFDM. The problems of channel equalization as well as synchronization and tracking methods in FBMC systems are given a special consideration and a few outstanding research problems are identified. Moreover, this paper brings up a number of appealing features of FBMC waveforms that make them an ideal choice in the emerging areas of multiuser and massive MIMO networks.

1. Introduction

In the past, orthogonal frequency division multiplexing (OFDM) has enjoyed its dominance as the most popular signaling method in broadband wired [1, 2] and wireless [3, 4] channels. OFDM has been adopted in the broad class of DSL standards as well as in the majority of wireless standards, for example, variations of IEEE 802.11 and IEEE 802.16, 3GPP-LTE, and LTE-Advanced. OFDM is known to be a perfect choice for point-to-point communications, for example, from a base station to a mobile node and vice versa. It offers a minimum complexity and achieves very high bandwidth efficiency. However, it has been noted that OFDM has to face many challenges when considered for adoption in more complex networks. For instance, the use of OFDM in the uplink of multiuser networks, known as OFDMA (orthogonal frequency division multiple access), requires full synchronization of the users’ signals at the base station input. Such synchronization was found to be very difficult to establish, especially in mobile environments where Doppler shifts of different users are hard to predict/track. Morelli et al. [5] have noted that carrier and timing synchronization represents the most challenging task in OFDMA systems. To combat the problem, some researchers have relaxed on the need for a close to perfect carrier synchronization among users and have proposed multiuser interference cancellation methods [69]. These methods are generally very complex to implement. Their implementation increases the receiver complexity by orders of magnitude [10]. Hence, one of the main advantages of OFDM, the low complexity, will be lost.

Another limitation of OFDM appears when attempt is made to transmit over a set of noncontiguous frequency bands, known as carrier aggregation. The poor response of the subcarrier filters in IFFT/FFT filter banks of OFDM introduces significant out-of-band egress noise to other users and also picks up significant ingress noise from them. The same problem appears if one attempts to adopt OFDM for filling in the spectrum holes in cognitive radios. Methods of reducing OFDM spectral leakage prove to be very limited in performance and may add significant complexity to the transmitter. For instance, the side lobe suppression techniques, like those proposed in [1113], can achieve an out-of-band emission suppression of only 5 to 10 dB, while they may add significant complexity to the transmitter and they will incur some loss in bandwidth efficiency.

Filter bank multicarrier (FBMC) is an alternative transmission method that resolves the above problems by using high quality filters that avoid both ingress and egress noises. Also, because of the very low out-of-band emission of subcarrier filters, application of FBMC in the uplink of multiuser networks is trivial [14, 15]. It can be deployed without synchronization of mobile user nodes signals. In the application of cognitive radios, the filter bank that is used for multicarrier data transmission can also be used for spectrum sensing [1620]. On the other hand, compared to OFDM, FBMC falls short in handing multiple-input multiple-output (MIMO) channels, although a few solutions to adopt FBMC in MIMO channels have been reported in the literature; for example, see [2123]. Nevertheless, as our recent research study has shown (see Section 8, below), in the emerging area of massive MIMO, FBMC is found as powerful as OFDM and in some cases superior to OFDM.

In the past, many attempts have been made to adopt FBMC in various standards. Apparently, the earliest proposal to use FBMC for multicarrier communications is a contribution from Tzannes et al. of AWARE Inc., in one of the asymmetric digital subscriber lines (ADSL) standard meetings in 1993 [24]. The proposed method that was called discrete wavelet multitone (DWMT) was further studied in [25, 26]. Despite enthusiasm from the research community (see [27] and the cited references therein), DWMT was not adopted in the ADSL standard. This was partly because of the perceived complexity of this method as compared to its rival, DMT (discrete multitone, an equivalent name for OFDM in DSL literature). Indeed, the DWMT structure proposed by Tzannes et al. was significantly more complex than that of DMT. The major part of the complexity of DWMT came from the equalization method that was adopted. The detailed discussion presented in [26] assumed that one needs an equalizer that combines signals from each subcarrier band and its adjacent bands. Typical equalizer lengths suggested in [26] were 21 real-valued taps per subcarrier. It was later noted by the author of this paper that if each subcarrier is sufficiently narrow such that it can be approximated by a flat gain, two real taps per subcarrier would be sufficient for equalization [27]. This observation led to further study of DWMT. In power line communications (PLC) community, it has been named wavelet OFDM and was adopted in the IEEE P1901 standard [28]. The main motivation for use of DWMT in DSL and its adoption later in PLC was to deal with ingress and egress noises, since both DSL and PLC use unshielded copper lines that are subject to strong radio interference. Moreover, in 1999, an FBMC method with nonoverlapping subcarrier bands was proposed as a solution for filtering the narrow-band interferences in very high-speed DSL (VDSL) channels [29]. The proposed method was called filtered multitone (FMT). This proposal that was included as an annex in one of the initial draft documents of VDSL [30] was further developed by a number of researchers [3134]. However, to avoid incompatibility with ADSL, FMT was not included in the final document of the VDSL standard [35]. Another unsuccessful story is an attempt by France Telecom [36] to introduce FBMC in the IEEE 802.22, a cognitive radio standard to access TV bands in wireless rural area networks (WRAN). Up to now, apparently, the only standard for radio transmission that uses FBMC is the TIAs Digital Radio Technical Standard [37].

Recent discussions on the fifth generation (5G) wireless communications have initiated a much stronger wave of interest in deviating from the main stream of OFDM systems. This shift of interest is clearly due to limitations of OFDM in the more dynamic and multiuser networks of future. A number of proposals have been made to adopt new waveforms with improved spectral containment. A good example of such activity is the 5GNOW project in Europe which challenges LTE and LTE-Advanced in coping with the dynamic needs of 5G. The 5GNOW has identified four alternative choices of waveforms to better serve 5G needs. These waveforms that are all built based on some sort of filtering may be thought as adoptions of FBMC method to suit different needs of various applications. We refer interested readers to the documents available at http://www.5gnow.eu for the details of the proposed waveforms by the 5GNOW group. Another major activity that has performed a broad study of FBMC is due to the PYHDYAS project (also in Europe). The PHYDYAS contributors have published heavily on various aspects of FBMC, including prototype filter design, equalization, synchronization, and application to MIMO channels. A summary of the major findings of the PHYDYAS contributors is presented in [21], and a complete list of their publications can be found at http://www.ict-phydyas.org.

The main thrust of this paper is to present a point of view of FBMC and its future applications as seen by the author. While we acknowledge the presence of a large body of works on FBMC, the paper details are geared towards the research outcomes of the author and his students in past 15 years. The paper emphasis is on the recent works of the author and his students. Many shortcomings of OFDM in dealing with the requirement of the next generation of wireless systems are discussed and it is shown how FBMC overcomes these problems straightforwardly. We present a derivation of FBMC systems that reveals the relationships among different forms of FBMC. A method of designing FBMC systems for a near-optimum performance in doubly dispersive channels is presented and its superior performance over OFDM is shown. The example considered is an underwater acoustic channel. Application of FBMC technique to massive MIMO communications is introduced and its advantages in this emerging technology are revealed. Last, but not the least, the problems of channel equalization and synchronization in FBMC systems are also given a special treatment and a number of outstanding research problems in this field, for future studies, are identified.

This paper begins with a historical overview of FBMC methods in Section 2. In order to keep the presentation in this paper a complement to those we have recently reported in [19, 38], the rest of this paper is organized as follows. A summary of the theoretical background based on which the various forms of FBMC are built are presented in Section 3. Methods of designing prototype filters for FBMC are presented in Section 4. Polyphase structures that are used for efficient implementation of FBMC systems are reviewed in Section 5. A few comments on complexity comparison of FBMC and OFDM are also made in this section. Channel equalization in FBMC systems is discussed in Section 6. Methods of carrier and symbol timing acquisition and tracking in FBMC systems are reviewed in Section 7. Here, a few particular features of FBMC systems that need special attention for their successful implementation are highlighted. Section 8 reminds the reader of a number of applications in the literature where FBMC has been found to be a good fit. This section also discusses the opportunities offered by FBMC in the emerging area of massive MIMO. The concluding remarks of the paper are made in Section 9.

Notations. The presentations in this paper follow a mix of continuous-time and discrete-time formulations, as appropriate. While refers to a continuous function of time, , is used to refer to its discrete-time version, with denoting the time index. The notation is used as frequency variable in , the Fourier transform of the continuous-time signal , and also as normalized frequency in , the Fourier transform of the discrete-time signal . We use to denote linear convolution, and the superscript to denote complex conjugate.

2. Review of FBMC Methods

FBMC communication techniques were first developed in the mid-1960s. Chang [39] presented the conditions required for signaling a parallel set of PAM symbol sequences through a bank of overlapping filters within a minimum bandwidth. To transmit PAM symbols in a bandwidth-efficient manner, Chang proposed vestigial sideband (VSB) signaling for subcarrier sequences. Saltzberg [40] extended the idea and showed how Chang’s method could be modified for transmission of QAM symbols in a double-sideband- (DSB-) modulated format. In order to keep the bandwidth efficiency of this method similar to that of Chang’s signaling, Saltzberg noted that the in-phase and quadrature components of each QAM symbol should be time staggered by half a symbol interval. Efficient digital implementation of Saltzberg’s multicarrier system through polyphase structures was first introduced by Bellanger and Daguet [41] and later studied by Hirosaki [42, 43]. Another key development appeared in [44], where the authors noted that Chang’s/Saltzberg’s method could be adopted to match channel variations in doubly dispersive channels and, hence, minimize intersymbol interference (ISI) and intercarrier interference (ICI).

Saltzberg’s method has received a broad attention in the literature and has been given different names. Most authors have used the name offset QAM (OQAM) to reflect the fact that the in-phase and quadrature components are transmitted with a time offset with respect to each other. Moreover, to emphasize the multicarrier feature of the method, the suffix OFDM has been added, hence, the name OQAM-OFDM. Others have chosen to call it staggered QAM (SQAM), equivalently SQAM-OFDM. In [38] we introduced the shorter name staggered multitone (SMT).

Chang’s method [39], on the other hand, has received very limited attention. Those who have cited [39] have only acknowledged its existence without presenting much detail, for example [41, 45, 46]. In particular, Hirosaki who has extensively studied and developed digital structures for the implementation of Saltzberg’s method [43, 47] has made a brief reference to Chang’s method and noted that since it uses VSB modulation, and thus its implementation requires a Hilbert transformation, it is more complex to implement than Saltzberg’s method. This statement is inaccurate, since as we have demonstrated in [38] Chang’s and Saltzberg’s methods are equivalent and, thus, with a minor modification, an implementation for one can be applied to the other. More on this is presented in Section 5. Also, as noted earlier, a vast literature in digital signal processing has studied a class of multicarrier systems that has been referred to as DWMT. It was later noted in [27, 48] that DWMT uses the same analysis and synthesis filter banks as the cosine modulated filter banks (CMFB) [49]. CMFB, on the other hand, may be viewed as a reinvention of Chang’s method, with a very different application in mind [38]. In [38], we also introduced the shorter name cosine modulated multitone (CMT) to be replaced for DWMT and/or CMFB.

One more interesting observation is that another class of filter banks which were called modified DFT (MDFT) filter bank has appeared in the literature [50]. Careful study of MDFT reveals that this, although derived independently, is in effect a reformulation of Saltzberg’s filter bank in discrete-time and with emphasis on compression/coding. The literature on MDFT begins with the pioneering works of Fliege [51] and later has been extended by others, for example [5255].

Finally, before we proceed with the rest of our presentation, it should be reiterated that we identified three types of FBMC systems: (i) CMT: built based on the original idea of Chang [39]; (ii) SMT: built based on the extension made by Saltzberg [40]; and (iii) FMT: built based on the conventional method frequency division multiplexing (FDM) [34].

3. Theory

The theory of FBMC, particularly those of CMT and SMT, has evolved over the past five decades by many researchers who have studied them from different angles. Early studies by Chang [39] and Saltzberg [40] have presented their finding in terms of continuous-time signals. The more recent studies have presented the formulations and conditions for ISI and ICI cancellation in discrete-time, for example, [46, 50]. On the other hand, a couple of recent works [19, 38], from the author of this paper and his group, have revisited the more classical approach and presented the theory of CMT and SMT in continuous time. It is believed that this formulation greatly simplifies the essence of the theoretical concepts behind the theory of CMT and SMT and how these two waveforms are related. It also facilitates the design of prototype filters that are used for realization of CMT and SMT systems. Thus, here, also, we follow the continuous-time approach of [19, 38]. Moreover, to give a complement presentation to those of [19, 38], an attempt is made to discuss the underlying theory mostly through the time-frequency phase-space, with minimum involvement in mathematical details. It is believed that this presentation also provides a new intuition into the relationship between SMT and CMT. Interested readers who wish to see the mathematical details are referred to [20, 38].

3.1. CMT

In CMT, data symbols are from a pulse amplitude modulated (PAM) alphabet and, hence, are real-valued. To establish a transmission with the maximum bandwidth efficiency, PAM symbols are distributed in a time-frequency phase-space lattice with a density of two symbols per unit area. This is equivalent to one complex symbol per unit area. Moreover, because of the reasons that are explained below a 90-degree phase shift is introduced to the respective carriers among the adjacent symbols. These concepts are presented in Figure 1. Vestigial side-band (VSB) modulation is applied to cope with the carrier spacing . The pulse-shape used for this purpose at the transmitter as well as for matched filtering at the receiver is a square-root Nyquist waveform, , which has been designed such that is a Nyquist pulse with regular zero crossings at time intervals. Also, , by design, is a real-valued function of time and . These properties of , as demonstrated below, are instrumental for correct functionality of CMT.

Figure 2 presents a set of magnitude responses of the modulated versions of the pulse-shape for the data symbols transmitted at and . The colors used for the plots follow those in Figure 1 to reflect the respective phase shifts.

Let , denote symbol time index, let , denote symbol frequency index, let denote the data symbol in the time-frequency lattice, and let be the phase shift that is added to the carrier of . Accordingly, a CMT waveform that is constructed based on the pulse-shape/prototype filter is expressed as where

The synthesis of according to (1) has the following interpretations. The terms may be thought as a set of basis functions that are used to modulate the data symbols . The data symbols can be extracted from straightforwardly if are a set of orthogonal basis functions. The orthogonality for a pair of functions and , in general, is defined as For the case of interest here, where the data symbols are real-valued, the orthogonality definition (3) can be replaced by the more relaxed definition: where indicates the real part. Definition (4) is referred to as real orthogonality.

It is not difficult to show that and, hence, for any pair of and ,

To develop an in-depth understanding of the CMT signaling, it is instructive to explore a detailed derivation of (5). To this end, we begin with the definition and note that this can be rearranged as

When and , after a change of variable to , (8) reduces to where the second equality follows from the fact that is a real-valued square-root Nyquist pulse and .

When , , and , where is an integer, where the second equality follows since is a square-root Nyquist pulse, designed for a symbol spacing . On the other hand, when , but ,

Next, consider the case where and . In that case, one finds that where the last identity follows, by applying the change of variable and noting that the expression under the integral will reduce to an odd function of . Following similar procedures, it can be shown that the real orthogonality also holds, when and is an odd integer, and when and is either an even or odd integer. Finally, for the cases where , the real orthogonality is trivially confirmed by noting that the underlying basis functions correspond to filters that have no overlapping bands. The stop-band quality of the frequency response of the prototype filter determines the accuracy of the equality when .

It is also worth noting that some of the recent derivations of FBMC that are presented in discrete-time design the respective prototype filter such that the respective real orthogonality is perfectly satisfied for the cases where as well, for example, [56]. However, one may realize that, in practice, the presence of channel destroys the orthogonality of the basis functions. The orthogonality of the basis functions is commonly recovered at the receiver using per subcarrier equalizers; see Section 6, below. Such equalizers, unfortunately, will not be able to recover the orthogonality of basis functions that belong to nonadjacent subcarriers. Hence, the design of a that satisfies the real orthogonality for may be a waste.

To summarize, the above derivations revealed that the following settings of the CMT parameters lead to the real orthogonality of the basis functions and allow symbol placement in the time-frequency lattice at the maximum density of two real symbols per unit area.(1)The symbol spacing along the time axis should be matched with the subcarrier spacing along the frequency axis.(2)The pulse-shape/prototype filter must be a real-valued square-root Nyquist filter for a symbol spacing . It should be also a linear phase; hence, when viewed as a zero phase filter, it satisfies the condition . The latter is not a necessary condition but is satisfied in most of the designs; see [38] for a more relaxed condition.(3)The above constraints are applied to assure orthogonality of the basis functions that are within the same subcarrier or adjacent subcarriers only. The orthogonality of basis functions that belong to the nonadjacent subcarriers is guaranteed by virtue of the fact that they correspond to filters with nonoverlapping bands. A more advanced design presented in Section 4.2 allows overlapping of the nonadjacent bands and yet satisfies the orthogonality condition.(4)The phase shifts indicated in Figure 1 can be modified to other choices, as long as a phase difference is preserved between each pair of adjacent points in the lattice.(5)Although in CMT the position of the lattice points is fixed, these points can be moved in the time-frequency plane as long as their relative position and phase differences remain unchanged. We use this point below to arrive at the SMT waveform by applying a simple modification to the CMT waveform (1).

The above equations may be combined to arrive at the CMT transmitter and receiver structures that are presented in Figure 3. As shown a synthesis filter bank (SFB) is used to construct the transmit signal, and the received signal is passed through an analysis filter bank (AFB) to separate the data streams of different subcarrier bands. Here, it is assumed that there are subcarrier streams and the data stream of the th subcarrier at the input to the SFB is represented by the impulse train: It is also worth noting that, in practice, the analyzed signals at the output of the AFB should be equalized. Here, to keep the presentation simple, we have not included the equalizers. Equalizer details are presented in Section 6.

3.2. SMT

SMT may be thought of as an alternative to CMT. Its time-frequency phase-space lattice is obtained from that of CMT (Figure 1) through a frequency shift of the lattice points down by , scaling the time axis by a factor and hence the frequency axis by a factor 2. Moreover, to remain consistent with the past literature of SMT (equivalently, OQAM-OFDM), some adjustments to the carrier phases of the lattice points have been made. This leads to the time-frequency phase-space lattice that is presented in Figure 4. The magnitude responses of the SMT pulse-shaping filters at various subcarriers and time instants and are presented in Figure 5. Note that here the PAM symbols are spaced by and subcarriers are spaced by . In SMT, each pair of adjacent symbols along time in each subcarrier is treated as real and imaginary parts of a QAM symbol. This leads to the transmitter and receiver structures that are presented as in Figure 6. Here, each data symbol belongs to a QAM constellation and, thus, may be written in terms of its in-phase and quadrature components as . Accordingly, the inputs to the SFB in Figure 6 are

3.3. FMT

FMT waveforms are synthesized following the conventional method of frequency division multiplexing (FDM). The subcarrier channels have no overlap, and thus ICI is resolved through use of well-designed filters with high stopband attenuation. ISI may be compensated for by adopting the conventional method of square-root Nyquist filtering that is used in single-carrier communications. For doubly dispersive channels, we have adopted a more advanced design [57] that compensates for both ICI and ISI through an effective method which is presented in Section 4.2.

For comparison with the CMT and SMT structures in Figures 3 and 6, respectively, and also as a basis for further development in later parts of this paper, Figure 7 presents the structures of transmitter and receiver blocks in an FMT communication system. Also, the magnitude responses of the FMT pulse-shaping filters at various subcarriers are presented in Figure 8. Note that here the subcarrier spacing is equal to , where is the roll-off factor of the pulse-shaping/prototype filter. Accordingly, FMT has a symbol density of complex symbols per unit area in the time-frequency plane. Hence, FMT is less bandwidth efficient than CMT and SMT.

4. Prototype Filter Design

The results of the previous section suggest that linear phase square-root Nyquist filters that have been widely used for single carrier data transmission are the most trivial choice for prototype filter in FBMC systems. This indeed remains an accurate statement as long as the underlying channel is time-invariant or varies slowly. However, in cases where the channel is fast varying, a more general class of square-root Nyquist filters that satisfy the Nyquist condition both along the time and frequency axis should be adopted. In this section, we first review couple of classical Nyquist designs developed by us [58] and others [59], and then discuss an example of more advanced designs that also have been developed in our research group [57] and is more appropriate for time-varying channels.

4.1. Prototype Filters for Time-Invariant Channels

The classical prototype filter for FBMC systems that was suggested in [39, 40] was the square-root raised-cosine (SRRC) filter. More specifically, both [39, 40] suggested using a SRRC filter with the roll-off factor . In practice, where FBMC systems are implemented in discrete-time, SRRC response should be sampled and truncated. The truncation of the SRRC response may result in a filter with poor frequency response and, hence, makes SRRC a poor choice. Advancements in digital filters design have led to very effective methods for designing square-root Nyquist (SR-Nyquist) filters for any specified finite length. These designs, two of which are presented here, aim at balancing the stopband attenuation and the Nyquist property of the designs.

Martin [59] has proposed a design method whose goal is to satisfy the Nyquist criterion approximately while achieving good attenuation in the stopband. A nice property of this method is that when the number of samples per symbol interval is and the filter length is , where is an integer greater than 1, there are only parameters that need to be found for the optimization of the design. More specifically, the prototype filter is constructed as where the coefficients through are to be optimized. The optimum choices of the coefficients through , for different values of , are tabulated in [59] and, also, in [60]. Beside being a simple and effective method, it is important to note that Martin’s design for has been adopted by the PHYDYAS project [21].

More recently, we have developed an algorithm for designing SR-Nyquist filters that can balance between the accuracy of the Nyquist constraints and the filter stopband attenuation [58]. To select the samples of the zero-phase impulse response of the SR-Nyquist filter, this algorithm defines and minimizes the cost function with respect to the elements . The first two terms on the right-hand side of (17) are to enforce Nyquist property on , and the third term is included to minimize the stopband response of . Also, the length of is assumed to be equal to . One may note that since is a forth-order function of the parameters , it is a multimodal function and hence its minimization may not be straightforward. However, fortunately, if could be initialized near its optimum choice, an iterative solution may be applied to search for the local minimum of . The search algorithm proposed in [58] follows a similar procedure to the one originally suggested in [61].

Figure 9 presents the magnitude responses of three designs based on (i) a sampled and truncated version of the impulse response of a SRRC filter: SRRC4; (ii) a SR-Nyquist design obtained following [59]: Martin4; and (iii) a SR-Nyquist design obtained following [58]: Farhang4. All designs are based on the roll-off factor and are for an FBMC system with subcarriers, and the suffix on the names indicates that, for all designs, ; that is, the designed filters have a length . In addition, for Farhang4, the parameter (defined in [58]) is set equal to 0.1.

As seen, and one would expect, SRRC design performs significantly inferior to both of the SR-Nyquist designs. From the two SR-Nyquist designs, the one based on [58] achieves a higher attenuation in the first few sidelobes of its stopband than the design based on [59]. The first sidelobe of the former design is 10 dB lower. Also, a study of the time domain responses of the designs reveals that none of them satisfies the Nyquist property perfectly. Hence, all are near Nyquist designs. However, the design of [59] is the closest to Nyquist, followed by the design delivered by [58] and, then, the SRRC design. Nevertheless, all the three designs are very close to Nyquist; hence, the choice of one against the others may be dominantly determined by its superior magnitude response.

4.2. Prototype Filters for Time-Varying Channels

When a channel is subject to dispersion both in time (due to multipath effects) and in frequency (due to variation of the channel in time), we say the channel is doubly dispersive. We also note that while the dominant effect of time dispersion in a channel is ISI, the dominant effect of frequency dispersion is ICI. Hence, in order to design prototype filters that address both time and frequency dispersions, it has been argued [44] that one should choose a pulse-shape with similar behavior along time and frequency. In particular, if is selected to be a SR-Nyquist along the time axis, it should be also made sure that is a SR-Nyquist along the frequency axis. To this end, the pulse-shape, , with the following property may be adopted: that is, a function that has the same form in both time and frequency domains.

The parameter in (18) is related to the symbol spacing in time, , and frequency, , and is given by Also, as one may understand intuitively, and are, respectively, chosen proportional to time dispersion, , and frequency dispersion, , of the channel; that is, . Hence, the following identity also holds: The definitions for and are usually vague. The time dispersion may be thought of as a coarse estimate of the duration of the channel impulse response, equivalently the span of the multipaths of the channel. Similarly, the frequency dispersion may be thought of as a coarse estimate of the span of Doppler spread of the channel.

The design of the prototype filter for time-varying channels is closely tied to the ambiguity function [6264]: where is a time delay and is a frequency shift.

Let be a prototype filter and let be a time frequency translated version of it which is defined as We note that where indicates proportionate to. The proportionate factor is a phase shift due to the delays of and whose value is irrelevant to our discussion here. Using (23), one may note that of (22), for all choices of and , will form a set of complex-valued orthogonal basis functions, if is chosen such that

We note that the case where corresponds to the familiar (time) correlation function: which reduces to the Nyquist constraints: Also, one may note that the integral on the right-hand side of (25) is equal to the convolution of and its matched pair , evaluated at time . Thus, and, hence, (26) implies that is a SR-Nyquist pulse. Moreover, the ambiguity function may be thought of as a generalization of the correlation function where correlation is found between and its modulated version at the frequency . Accordingly, we refer to the set of constraints (24) as the generalized Nyquist constraints.

Another key point that one should consider in the selection of , for time-varying channels, is minimization of its duration. Le Floch et al. [44] have noted that to maximize the density of the basis functions in the time-frequency space and, hence, maximize the bandwidth efficiency of the transmission, one should choose a that has maximum compactness in the time-frequency space. Maximum compactness, on the other hand, is quantified by the product , where and are, respectively, the second-order moments of the functions and . Moreover, the Heisenberg-Gabor uncertainty principle states that [65] where the equality holds only when is the Gaussian pulse . Also, the Gaussian pulse has the interesting property that ; that is, it satisfies the desirable property (18), with . However, does not satisfy the orthogonality conditions (24).

Attempts to design filters that satisfy the orthogonality conditions (24) and at the same time approach the Heisenberg-Gabor uncertainty lower bound (27) as close as possible have been made and design methods have been developed [44, 66]. The design presented in [44] is called isotropic orthogonal transform algorithm (IOTA) filter. IOTA design/algorithm was first introduced in a patent by Alard [67]. The designs proposed in [66], on the other hand, are referred to as Hermite pulses, reflecting the fact that their construction is based on a linear combination of a set of Hermite functions. In the rest of this section, we limit our emphasis to the design of Hermite pulses and emphasize the flexibilities that these designs provide in adopting to doubly dispersive channels. More details on this topic can be found in [57] (also, see [68, 69]). Discussion on IOTA design can be found in [19, 20, 44, 70].

The design procedure proposed by Haas and Belfiore [66] constructs an isotropic filter according to the equation: where is the set of Hermite functions defined as Note that and, thus, it is an isotropic function with parameter . Moreover, it can be shown that the set of functions for , , are also isotropic, with the same parameter. This implies that the construction (28) for any set of coefficients leads to an isotropic function. In [66], the coefficients have been calculated to construct a filter that satisfies the set of constraints (24), for parameters .

Haas and Belfiore’s design [66] allows transmission of QAM symbols with a density of symbol per unit area in the time-frequency space. This design of may also be used as the prototype filter in a CMT or SMT structure to increase the density to PAM symbols per unit area, equivalent to one QAM symbol per unit area. Alternatively, one can aim for a design with larger density than 0.5 and stay with transmitting QAM symbols. Examples of both designs are presented later. In the rest of this section, we follow the approach of [57] to present a broad class of Hermite filter designs that includes the design presented in [66] as a special case.

The basic equations for the design of Hermite pulses are obtained by substituting (28) in (21). This gives where Defining (30) may be rearranged as

(1) Haas and Belfiore Design. In [66], it was proposed that the coefficients through can be determined by substituting (33) into (24) for and other significant choices of and solving the resulting system of equations. It was, numerically, demonstrated that this leads to very good designs. To further clarify this procedure and pave the way for additional developments in the sequel, we discuss the method of [66] in the context of a specific design. Figure 10 presents the grid of all choices of . Here, it is assumed that the constraints (24) are applied at the origin and the 12 nearest grid points to it. In this figure, three different sets of grid points are identified.(1)The significant points are the ones at which the constraints (24) should be imposed. There are four such points and these are indicated by red disks.(2)Once the desired constraints are imposed at the significant points defined in , it follows from the even symmetry and isotropic property of that the same constraints will automatically be imposed at the rest of the points indicated by green disks.(3)The remaining grid points, indicated by white disks, will satisfy the constraints (24), within a good approximation, since the designed pulse decays exponentially/fast as increases.

We note that to satisfy the constraints (24) at the origin and the 12 nearest grid points to it, it is sufficient to apply constraints to only 3 of the latter points. A unique design is thus obtained if the design is made based on , , , and , that is, the choice of in (30) through (33).

It should be also noted that since, here, the designed filter is isotropic, with the parameter , the identity should hold. Hence, given a desired density , the design must be for the parameters . Once is obtained based on these parameters, applying a time scaling factor, one can set to any desired value. This will set equal to .

(2) Hexagonal Lattice. The spread of data symbols in Figure 10 and other lattice grids that have been presented in this paper, so far, follows an orientation that is referred to as rectangular lattice. In [71], it is noted that better designs may be obtained by adopting the hexagonal orientation. The reason behind this improvement in performance follows the fact that the hexagonal orientation allows maximum separation of points for a given symbol density. Interested readers are referred to the detailed discussions in [19, 20, 71].

To design prototype filters for an orientation that follows the hexagonal lattice, we choose the constrained points according to those depicted in Figure 11. It should be noted that we set and for this choice it is sufficient to enforce design constraints at the points indicated by red disks. As in the case of rectangular lattice, once the constraints are applied to these points, similar constraints will be automatically imposed to the points indicated by green disks, following the symmetric and isotropic property of the designs. Moreover, as in the case of rectangular lattice, by design, the remaining grid points, indicated by white disks, will satisfy the constraints (24), within a good approximation. Finally, a time scaling can be applied to to set , and accordingly , to any desired value.

(3) Robust Design. Building on the above findings, we noted that, in a practical design, the time and frequency dispersion introduced by a channel smear the null points of the ambiguity function [57, 68, 69]. Hence, as a result of the channel dispersion, the nulls will convert to shallow nulls. It is, thus, argued that instead of designing to introduce perfect nulls in , a more robust design is obtained by aiming for a design that results in deep (but imperfect) null areas around the grid points . To this end, a robust Hermite pulse is designed by minimizing the cost function: where is an area around the origin and through are the null areas that are aimed for.

To develop a numerical method for the minimization of , the ambiguity function is sampled along the time axis and the frequency axis . This converts (34) to where and are the sampling periods along the time axis and frequency axis, respectively, and

Removing the third term on the right-hand side of (17), one may realize that the cost functions and are similar. Both are fourth-order functions of the parameters that we seek to optimize. Hence, the iterative method developed in [58] can be readily adopted here as well. For this purpose, as in [58], an initial guess for the elements of the parameter vector should be made. It has been noted in [57] that a proper initial choice for is the vector whose first element is 1 and the rest of its elements are 0. That is, one should start with the Gaussian pulse as a first guess and add the higher order Hermite functions in the subsequent iterations.

(4) Numerical Examples. To conclude this section, we present the results arising from a few designs of IOTA and Hermite prototype filters. Also, we compare the results with those of the SR-Nyquist design of [58].

Figure 12 presents the time domain and the magnitude of frequency domain responses of (i) an IOTA design [67], (ii) a Hermite design [66], and (iii) a SR-Nyquist design [58]. All designs have the same filter length of and designed to serve a CMT/SMT system with a maximum of subcarrier bands.

Considering the results presented in Figure 12, one may make the following observations.(1)Both IOTA and Hermite designs have time-domain responses that are more compact than the time-domain response of the SR-Nyquist filter.(2)In the frequency domain, on the other hand, the SR-Nyquist design gives a more compact response.(3)The compact responses of IOTA and Hermite designs in time will make them less prone to distortion introduced by channel variation with time.(4)The broader responses of IOTA and Hermite design in frequency, on the other hand, make them more prone to the channel frequency selectivity.(5)Considering and , a balance has to be made between the choice of the symbol interval, , and the carrier spacing, . Obviously, by applying a proper time scaling to the prototype filter , such balance can be made.(6)When channel is time-invariant or varies slowly, the choice of SR-Nyquist results in the maximum immunity to the channel frequency selectivity.

The robust design approach that was presented above was first introduced in [69] and was further studied in [57, 68]. Here, to emphasize on the significance of this design approach, we present Figure 13 from [57]. This figure presents a set of results that compare the signal-to-interference ratio (SIR) of the robust isotropic designs (named FMT-dd: FMT for doubly dispersive channels) with OFDM and an FMT design according to the SR-Nyquist design of [58] (named FMC-c: conventional FMT). To measure SIR, the channel noise is set equal to zero. Three sets of results, corresponding to density values , , and are presented. The robust isotropic designs are obtained for a channel model in which dispersion in time and frequency is uniform in the intervals and , respectively, and and . The design for each density is fixed and its performance is examined for varying values of in the range of 0 to 0.1. For FMT-c, the prototype filter is designed in each of the three cases with the aim of achieving a stopband attenuation of 60 dB or better. Also, following the basic principle of FMT-c, the roll-off factors of the prototype filters for density values , , and are set equal to 1, 0.5, and 0.25, respectively. For OFDM, the density is set by adjusting the ratio of cyclic prefix length, , over the length of FFT, . In particular, we note that since in OFDM and , and, thus, .

The results presented in Figure 13 clearly show the expected superior performance of the robust designs over OFDM and the conventional FMT design. The more detailed results presented in [57] for an underwater acoustic (UWA) communication further confirm these results in an at-sea experimental setting.

5. Polyphase Structures and Computational Complexity

Polyphase structures are commonly used to implement FBMC transmitter and receiver. Direct mimicking of the continuous-time structures of Figures 3 and 6 to implement CMT and SMT, respectively, may lead to structures whose complexity is not optimized. One polyphase structure per each set of real symbols, equivalent to two polyphase structures per each pair of real symbol sets, should be implemented. This section highlights the fact that the two polyphase structures at the transmitter side can be combined together and hence reduce the complexity to one half. At the receiver side, special attention has to be paid so that proper equalizers can be applied to the analyzed signal components. The presentation in this section, although results in structures with the same complexity to those already published, arrives at structures in more intuitive way and, thus, hopefully assists the reader to have a better grasp of the concepts.

Polyphase structures for FMT transmitter and receiver also need particular attention to take care of the fact that sampling rate changes in the AFB (at the transmitter side) and in the SFB (at the receiver side) is different from the size of IFFT/FFT block. A method that takes into account this change of sampling rate is presented in a later part of this section.

5.1. Polyphase Analysis and Synthesis Filter Banks

The basic polyphase SFB block that may be used to construct the SFB blocks in both CMT and SMT transmitter is presented in Figure 14. This block which is widely available and well developed in the literature, for example, see [49, 72], has the following characteristics.(1)The inputs are a set of data sequences with the nominal rate of unity.(2)The SFB output has a rate of or, equivalently, times faster than the rate at the input.(3)The synthesis is performed based on an IFFT of size , with the inputs of index and greater set equal to zero. This is to allocate some guard band between the generated baseband signal and its images and, hence, facilitate additional filtering in the further stages of the transmitter.(4)The filtering blocks through are the polyphase components of the prototype filter .(5)This structure is an efficient implementation of a bank of filters with the respective inputs and the transfer functions , where is the Fourier transform of .(6)The commutator at the output serializes the outputs of the polyphase component filters, after arrival of each set of data symbols at the input.

Figure 15 presents the basic polyphase AFB that matches with the SFB of Figure 14. Here, . In the literature, and are distinguished by referring to them as type I and type II polyphase components [49]. When is fed to its input, within the accuracy provided by the prototype filter , the data sequences appear at its first outputs. In other word, if is passed through an ideal channel (free of distortion and noise), the SFB may be used to recover the transmitted data symbols . Obviously, channel introduces distortion and thus the AFB outputs should pass through a bank of equalizers to recover the data symbols .

5.2. Polyphase Structures for CMT and SMT

CMT and SMT systems, as was demonstrated in Section 3, are effectively the same modulations. For instance, a CMT waveform can be generated using an SMT waveform generator and subsequently apply a positive spectral shift of one half of the subcarrier spacing to the result. Alternatively, an SMT waveform may be generated using a CMT waveform generator and subsequently apply a negative spectral shift of one half of the subcarrier spacing to the result. It turns out that the development of structures for SMT waveform is more straightforward than those of the CMT. We thus continue this section with development of a pair of transmitter and receiver structures for SMT. The relevant CMT structures will follow through simple modifications of the SMT structures following the details provided in Section 3.

(1) SMT Transmitter. To develop a computationally efficient polyphase structure for SMT transmitter, we begin with rearranging the transmitter part of Figure 6 as in Figure 16. This, clearly, is obtained by separating the phase and quadrature parts of the SFB, combining the phase shifts at different points in the structure and adding the combined results to the real data symbols, and , at the input. The delay of in the prototype filters in the quadrature paths has been shifted to the corresponding filter bank output.

The structure that is presented in Figure 16 consists of a pair for SFBs. Clearly, each of these filter banks may be implemented efficiently using the basic polyphase structure of Figure 14. This implementation is presented in Figure 17. It involves two separate polyphase structures. However, a closer look at the inputs to the polyphase structures reveals that they are modulated versions of two real vectors: As demonstrated next, one may take advantage of this property to combine the two SFBs into one.

Looking back at the synthesis polyphase structure of Figure 14, it may be argued that one can start with the real vectors and at the input to the IFFT block and apply the modulation effect at the output of the IFFT through a circular rotation of the results. In that case, one may start with the computation of IFFT of the real vectors and . It is well known that this pair of IFFTs can be implemented through a single IFFT with a computational complexity of complex multiplications. On the other hand, we note that since the coefficients of the prototype filter are real-valued, the complexity of implementation of two sets of polyphase component filters in Figure 17 involves real by complex multiplications, equivalent to complex multiplications. Adding these results, the total complexity of the SMT transmitter is measured as complex multiplications per each set of complex output symbols. Recall that the length of the prototype filter was assumed to be equal . Here, we have replaced this by for brevity of the expressions. Also, we are only counting the number of multiplications as the measure of complexity. This is because, in practice, each multiplication is usually followed by an addition. Hence, without involving ourselves with details, we argue that the number of additions in each implementation is about the same as the number of multiplications.

It is also worth noting that a number of other authors also have taken note of the same symmetry properties in SMT waveform and accordingly proposed methods of reducing the complexity [7375]. All these works have reached the complexity numbers that are similar to those presented here.

(2) SMT Receiver. In the absence of channel, the analysis of an SMT signal at the receiver can be performed with the same complexity as its synthesis counterpart at the transmitter [74, 75]. However, the presence of the channel destroys the symmetry property of SMT waveform. Hence, the implementation approach used at the transmitter cannot be extended to the receiver. In particular, we note that although the final goal is to extract the real-valued data symbols, the analyzed signals are the preequalized ones and, hence, are complex-valued.

Following the same line of thoughts that led to Figure 17, the receiver side of Figure 6 can be rearranged as in Figure 18, where the AFB blocks follow the structure presented in Figure 15. This structure, yet, does not include the channel equalizers that should be added at each output branch. Also, as noted in the previous section, the equalizers may be single-tap or multitap. Furthermore, one may note that since the pair of AFBs in Figure 18 are the same, but separated by sample delay, they may be combined as one AFB. Moreover, the phase rotations at the output side may be absorbed in the equalizers. Combining these points, we arrive at the SMT receiver structure presented in Figure 19.

The receiver structure presented in Figure 19 has a complexity of one AFB plus equalizers for each set of real symbols, or . Assuming that each equalizer has complex tap weights, and noting that at each equalizer output we need either the real or imaginary part of the result, the total complexity of the receiver structure of Figure 19 is obtained as complex multiplications per each set of real output symbols. Alternatively, we may say the total complexity of the receiver structure of Figure 19 is complex multiplications per each set of complex output symbols.

5.3. Polyphase Structures for FMT

The polyphase SFB that is presented in Figure 14 implements a bank of synthesis filters whose center frequencies are at the normalized frequencies , equivalent to the unnormalized frequencies , respectively. Also, the sampling rate at the SFB output is . Equivalently, the sampling interval at the SFB output is . The same is true for the AFB that is presented in Figure 15.

In FMT, the situation is different. While the symbol rate remains equal to , each subcarrier bandwidth and accordingly the FMT waveform bandwidth increase by a factor of . Therefore, the sampling frequency of the synthesized waveform should be increased to . This, in turn, implies that , and accordingly, in discrete-time, data symbols at the SFB input should be up-sampled fold (it is assumed that is chosen such that will be an integer number). Similarly, at the receiver side, the output of the AFB should be decimated fold. Nevertheless, it should be noted that the center frequency of subcarriers remains unchanged; that is, it will be at , , , .

The fact that introduces some irregularity in the polyphase components which needs a special care. Most of the work in the literature that have addressed this issue have looked at the samples of underlying continuous time and accordingly have discussed how the corresponding polyphase filter bank should be implemented. This treatment, unfortunately, has led to a set of equations that are often hard to follow. Here, we present a solution that directly works with sampled signals/sequences [34]. This solution that has been presented for the first time in [72] is believed to be easier to follow. Here, we quote our original formulation from [72].

(1) FMT Transmitter. Figure 20 presents a discrete-time equivalent of the SFB of Figure 7. Following this figure, one obtains

To proceed, we write the time index as where is the integer part of and , is the remainder of . Similarly, may be written as where is the integer part of and is the remainder of . Next, we use (39) to substitute for in the exponential term in (38) to obtain where we have noted that . We may also note that is the -point IFFT of input symbols for . Using (41) and substituting (40) into (38), we obtain

Equation (42) may be interpreted and accordingly implemented as follows. For each set of input symbols , samples of are calculated. This calculation begins with performing an -point IFFT on the input symbols . The output samples from the IFFT are passed through the polyphase components of the prototype filter , where the polyphase components are with respect to the decimation factor . This process is depicted in a block diagram form in Figure 21. The commutator at the output of Figure 21 takes steps after arrival of each set of input symbols. through are a set of time-varying filters that are picked up from the set of polyphase components through of the prototype filter . For a given time , is taken from the output of and , where and are the integers defined according to (39) and (40), respectively.

Following Figure 21, the complexity of implementation of FMT transmitter, for each set of data symbols, is obtained as complex multiplications.

(2) FMT Receiver. Following the same line of arguments that led to Figure 20, the discrete-time implementation of an FMT receiver is obtained as in Figure 22. From this figure, it follows that, for ,

To proceed, we define where is the integer part of and , is the remainder part of . Using (44), (43) may be rearranged as

According to Figure 22, the recovered symbol is obtained after -fold decimation of . Moreover, in practice, a timing recovery loop determines a timing phase at which should be decimated. Equalization is also necessary to combat the distortion introduced by the channel. A single-tap or a multitap equalizer per subcarrier may be used. In the latter case, one may use a fractionally spaced equalizer to relax on the sensitivity of the FMT receiver to timing phase. In such cases, the decimation factor is smaller than .

Here, we present a general polyphase structure for computation of the samples of , for an arbitrary value of ; hence, any arbitrary choice of decimation factor is made possible. To this end, we write where is the integer part of and , is the remainder of divided by . Using (46) into (45), we get

Following (47), one may propose the analysis polyphase structure that is presented in Figure 23. The commutator takes the successive samples of the received signal and distributes them across the inputs of the time-varying filters through . The filters through are picked up from the set of polyphase components through of the prototype filter . For a given time , . The reader should be reminded that while in Figure 21 the time-varying filters through are picked up from the polyphase components of with respect to the decimation factor , in Figure 23   through are picked up from the polyphase components of with respect to the decimation factor .

Following Figure 23, the complexity of implementation of FMT receiver, for computation of each set of output samples, is obtained as complex multiplications. In FMT, unlike CMT and SMT, the use of symbol spaced multitap equalizer is sufficient. If an tap symbol spaced equalizer is used, one will find the complexity number complex multiplications per each set of detected data symbols.

5.4. Complexity Comparison with OFDM

Being an alternative to the widely adopted OFDM in various standards, it is always of interest and important to compare FBMC methods against OFDM. As noted in Section 1, the much higher quality of the prototype filter in FBMC makes it an ideal choice for many applications where OFDM faces challenges because of the poor frequency response of its prototype filter. Here, we discuss the additional complexity brought to FBMC (compared to OFDM) because of the use of a higher quality prototype filter.

We count the complexity of OFDM as one IFFT, complex multiplications, at the transmitter, and one FFT and single-tap equalizers, complex multiplications, at the receiver. Thus, recalling the complexity numbers that were presented above, one finds that where and , respectively, denote the relative complexity of SMT transmitter and receiver against OFDM. These numbers remain about the same in the case of CMT. To get a better feel of these complexity ratios in practical cases, consider a case where , (i.e., 20% of subcarriers are left as guard band), , and . For this case, we find and . For , the most likely value of if a multitap equalizer is used, . Hence, one may say, CMT and SMT have a complexity that is roughly two to three times that of the OFDM.

Similarly, for FMT, we find that

Comparing (48) and (50), one will find that SMT/CMT and FMT transmitters have the same complexity. On the other hand, comparing (49) and (51), one will find that FMT receiver has a lower complexity than SMT/CMT. In any case, it still remains fair to say FBMC systems, in general, are two to three times more complex than OFDM. Nevertheless, it should be noted that this comparison is with a basic OFDM. It does not consider the cases where advanced signal processing methods may have to be adopted to compensate for other channel effects, for example, carrier frequency offset among different users in an OFDMA network. In such cases, one often finds that OFDM is more complex than FBMC [10].

6. Equalization

In FBMC receivers, equalization is performed at the output of the analysis filter banks. It is often assumed that each subcarrier has a small bandwidth; hence, the channel may be assumed to be flat over each subcarrier band. In that case, a single-tap equalizer per subcarrier would suffice. In cases where the flat gain approximation may be insufficient, a multitap equalizer per subcarrier band may be necessary. For CMT and SMT systems, it is necessary to use a fractionally spaced equalizer. A tap-spacing of half symbol interval is the most convenient choice and, hence, has been suggested in the literature, for example, [43]. FMT on the other hand allows the use of symbol spaced equalizers.

Although, in many scenarios it may appear that a single-tap equalizer per subcarrier is sufficient, in practice, where carrier and clock mismatch between the transmitter and receiver is inevitable, a multitap equalizer can be instrumental. In particular, the difference between the clock used at transmitter to pass the signal samples to a digital-to-analog converter (DAC) and the one used at the receiver to control the rate of samples taken by an analog-to-digital converter (ADC) at the receiver introduces a constant drift in the timing phase of sampled signal. This drift, as explained in the next section, can significantly deteriorate the receiver performance. The use of a multitap equalizer can resolve this problem to a great extent. This is demonstrated through a numerical example in the next section.

Another point that should be emphasized here is the following. Most researchers, who have published in the area of FBMC, have limited their results to the use of one single-tap equalizer per subcarrier band. This has been to emphasize on the simplicity of equalization in FBMC systems and also to stay in par with OFDM. Alternatively, one may look at the use of multitap equalizers in FBMC systems as a mechanism that brings with itself a number of advantages. Firstly, the use of multitap equalizers removes any small frequency selectivity over each subcarrier band and, hence, facilitates adoption of a large symbol constellations and this, in turn, increases the bandwidth efficiency of transmission. Secondly, it allows reducing the number of subcarriers which, in turn, reduces the system (i) complexity (possibly compensating for the added complexity resulting from the use of multitap equalizers), (ii) latency, (iii) sensitivity to carrier frequency offset, and (iv) peak-to-average power ratio (PAPR).

7. Synchronization and Tracking Methods

Synchronization methods are necessary in any receiver to compensate for any difference between the carrier frequency of the incoming signal and the local oscillator used to demodulate it as well as to compensate for any clock mismatch between the transmitter and receiver. Pilot aided and blind synchronization methods have been widely studied in the literature from the beginning of development of data modems [76].

In majority of standards, including those of OFDM and FBMC, the pilot aided approaches have been adopted. For instance, all the OFDM packet formats that have been proposed for WiFi and WiMAX as well as those suggested in LTE and LTE-Advanced start with a short-training periodic preamble, for coarse carrier acquisition. This follows by a long preamble, consisting of two or more similar OFDM symbols, that is used for fine tuning of the carrier, timing acquisition, and channel estimation; for example, see [72, 77]. In addition, to be able to track channel variations during the payload of each packet, it is often proposed to insert distributed pilots in the time-frequency space. Interpolation methods are then used to obtain the channel gain and accordingly the equalizers gain, at the data points [7881].

A few researchers, who have looked at the packet design for FBMC, have also suggested a similar packet format to those of the OFDM [8286]. In particular, a periodic training consisting of a few repetitions of a pilot FBMC symbol is suggested to be used along with a least squares or a maximum likelihood estimator to estimate CFO and symbol timing. This is equivalent to long preamble in OFDM. Moreover, in [82], it is noted that the use of a short preamble, similar to that of OFDM, may be useful for coarse estimation of carrier, so that fine tuning of carrier and timing phase based on the long preamble can be established more robustly. Distributed pilots to track channel variations in payload part of FBMC packets have also been proposed [87, 88]. In the rest of this section, we make comments/suggestions for future research in three directions related to synchronization and tracking methods in FBMC systems.

7.1. Preamble Design

Figure 24 presents the packet format that has been proposed in [8286]. The short preamble consists of a few tones with wider separation than the subcarrier spacing to allow detection of large CFO values. It is used to obtain a coarse estimate of CFO, so that the residual CFO (after removal of the estimated CFO) will be within the range of half symbol spacing. The long preamble, then, may be used for fine tuning of the carrier. The works presented in [8385] do not mention the short preamble, but they make assumption that CFO is within the range of half symbol spacing. Naturally, to satisfy this assumption a mechanism similar to short preamble may be necessary.

There is also some difference in the long preamble proposed in [8385] and the one proposed in [82, 86]. In the context of SMT waveform, the long preamble proposed in [8385] is constructed based on a set of complex-valued pilot symbols that occupy all subcarriers. The long preamble proposed in [82, 86], on the other hand, is real-valued and only occupies the alternate subcarriers; the other subcarriers are left empty. This modification/simplification of the long preamble as demonstrated through numerical results in [86] improves the accuracy of the CFO estimates significantly.

Here, to make suggestions to further improve the preamble design, we first note that the preamble is a packet component that constitutes a loss in the bandwidth efficiency of transmission. Clearly, an attempt to reduce the length of the preamble improves the bandwidth efficiency. The research so far in the above cited works has led to the following observations. (i) A preamble with a set of pilot symbols that are well separated in frequency is needed to assure detection and correction of large CFO values. (ii) A longer preamble that allows a more accurate estimate of CFO may be also necessary. On the other hand, one may realize that to obtain the CFO and symbol timing estimates, it is not necessary to send pilots on all the subcarriers. The number of parameters that characterize the channel response (determined by the length of the channel impulse response) is usually much smaller than the number of subcarriers. This means, a fraction of subcarriers, whose number is greater than or equal to the length of the channel impulse response, suffice for pilot subcarriers in the preamble. This, in turn, implies that one may propose to combine the short and long preambles into one preamble, and by taking such approach, obviously, a shorter preamble will result. We note that the preamble proposed in [82, 86], compared to those proposed in [8385], may be viewed as a step forward in this direction. While the works in [8385] use all subcarriers to transmit the preamble pilots, [82, 86] allocate pilots only to alternate (even or odd) subcarriers.

Another consideration that needs one’s attention in designing an effective preamble is the choice of the prototype filter that is used for preamble. Assuming that the preamble consists of only a long preamble, the preamble length is determined by the length of the prototype filter and the number of its repetitions. Accordingly, a shorter prototype filter leads to a shorter preamble and thus a more bandwidth efficient packet. The above suggestion to space out the pilot subcarriers in the preamble can be taken advantage of to shorten the length of the preamble. This is because wider subcarrier allows one to increase the bandwidth of each subcarrier and accordingly reduce the symbol period and thus the length of the corresponding prototype filter. Nevertheless, it should be also noted that a broader band of each subcarrier can make it prone to some level of frequency selectivity that should be taken into account while identifying the channel parameters.

7.2. Channel Tracking

Tracking of channel variation (including variations due to CFO and clock timing drift), during the payload transmission, is usually made through distributed pilots that are transmitted along with the data symbols. This topic, as noted above, has been well studied for OFDM. However, it has received very limited attention in the FBMC literature. Stitz et al. [88] have studied the problem of pilot-aided synchronization and tracking methods in CMT/SMT-type systems. They have shown that by applying specific pilots designed for CMT/SMT, the carrier frequency offset (CFO), fractional time delay (FTD), and channel response can be accurately estimated. The work presented in [89] has taken a different approach. It takes note of the fact that each signal component at the output of the analysis filter bank, before performing equalization and taking its real (or imaginary) part, has the form of (or ), where arises from ICI and ISI. It is thus argued that if the same (known) symbol is transmitted over two instants of times at the same subcarrier or allocated to the real and imaginary parts of an OQAM symbol in a SMT waveform, the analyzed signal samples give a pair of equations that can be solved to obtain an estimate of .

In [27] (also see [90]), it has been noted that the use of PAM symbols in CMT (and similarly in SMT, if the real and imaginary parts of each OQAM symbol are viewed as a pair of PAM symbols) allows development of a blind equalization algorithm. This algorithm may also be used for channel tracking. Hence, in CMT and SMT channel tracking is possible without any use of pilot symbols. To elaborate more on this possibility, we note that the convergence behavior of the blind equalizer that is developed in [27] was thoroughly analyzed in [91] (also see [92, 93]). It was found that the CMT/SMT blind equalizer has a well-behaved performance surface around its optimal setting. This implies that if the equalizer gain is set near its optimal value and channel subsequently varies slowly, the blind equalization algorithm of [27] can easily track the channel variation. This observation that yet to be thoroughly studied will add an additional advantage to CMT and SMT over OFDM in terms of bandwidth efficiency.

7.3. Timing Tracking

As noted above, any difference between the clock frequency at the transmitter and its counterpart at the receiver introduces a timing drift. The common method of compensating for this timing drift is to add a timing recovery loop to the system. In a single carrier receiver, the addition of a timing recovery loop as part of the receiver processing is a relatively straightforward task and is well understood [72, 94]. In an OFDM system, with a not very long payload, the timing drift may be absorbed by the CP, provided that not the full length of the CP has been consumed by the channel transient response. Drift of timing phase introduces a linear phase across frequency and that can be easily compensated by the single-tap equalizers at the FFT outputs. In FBMC, on the other hand, the absence of CP/guard interval makes an FBMC receiver more sensitive to timing drift. However, fortunately, this problem can be resolved trivially through the use of a set of multitap fractionally spaced equalizers, one per subcarrier.

To demonstrate the impact of per subcarrier fractionally spaced equalizers on resolving the problem of timing drift, we consider a CMT/SMT transceiver system which we simulate following the schematic diagram presented in Figure 25. The samples of the transmit signal are converted from digital to analog using a DAC that operated at a clock rate of . At the receiver, the received signal is sampled through an ADC with a clock that has a rate of . The clock frequencies and are different but typically are close. In practice, their difference may be as large as 50 ppm (part per million). This corresponds to the case where each clock has a precision of 25 ppm.

For the purpose of demonstration here, we consider the case where channel is absent. In that case, the effect of using a pair of DAC and ADC with different clock rates is like resampling the generated waveform with a resampling ratio . This concept can be easily applied in a MATLAB simulation environment that we use, using the function “resample.” The results of simulating a CMT system with 64 active subcarriers and a clock drift of 50 ppm are presented in Figure 26. The results show the signal-to-interference ratio (SIR) averaged over all subcarriers, as one proceed over successive CMT symbols. Timing phase is set to zero (i.e., perfect timing) at the beginning. The drift of timing phase introduces some ISI that cannot be resolved when single-tap equalizers are used but can be compensated through multitap equalizers. Perfect compensation may require a long equalizer. However, a 3- or 5-tap equalizer is sufficient to avoid a significant drop of SIR. We consider an SIR of above 30 dB as acceptable, given that, in typical practice, communication systems are often designed for output SINR values 20 dB or less. A margin of 10 dB should be sufficient to consider ISI/ICI due to timing drift as negligible. For the numerical example presented here, while a single-tap equalizer can preserve the SIR of above 30 dB for the first 400 CMT symbols, the use of a 3-tap and 5-tap equalizer increases this range to 1200 and 2500 CMT symbols, respectively.

Another possible solution that may be adopted to resolve the difference between the DAC clock at the transmitter and ADC clock at the receiver may be proposed by taking note of the following observation. An FBMC waveform (being CMT, SMT, or FMT) is a cyclostationary signal whose statistics are repeated every second, where is an integer multiple of the symbol period . In FMT and SMT, , and in CMT, . We conjecture that taking note of this property, one can design a digital resampler circuit that may be added to the output of ADC, at the receiver, for tuning the number of samples per each period of equal to a predefined value. Such solution is yet to be found through a clever study of the underlying signals.

8. Applications

In [19], OFDM and FBMC are contrasted with respect to their suitability for different applications. It is noted that while for many applications OFDM is a perfect choice, FBMC may be a superior choice for other applications. The applications in which FBMC may be preferred are (i) the uplink of multiuser multicarrier systems, (ii) cognitive radios, (iii) doubly dispersive channels, and (iv) digital subscriber lines and power line communications. Detailed discussion on why FBMC is a better choice in these applications is provided in [19]. The rest of this section emphasizes on an emerging class of communication systems, massive (multiuser) MIMO, whose significance as an enabling technology for 5G has recently been discussed in the literature and other forums.

Massive MIMO, in essence, is a multiuser technique, somewhat similar to code division multiple access (CDMA). In its simplest form, each mobile terminal (MT) has a single antenna, but the base station (BS) has a large number of antennas. The spreading code for each user is then the vector of channel gains between the respective MT antenna and the multiple antennas at the BS. Accordingly, by increasing the number of antennas at the BS, the processing gain of each user can be increased to an arbitrarily large value. In the pioneering work of Marzetta [95] it is noted that, in the limit, as the number of BS antennas tends to infinity, the processing gain of the system tends to infinity and, as a result, the effects of both noise and multiuser interference (MUI) are completely removed. Therefore, the network capacity (in theory) can be increased unboundedly by increasing the number of antennas at the BS [95].

Motivated by Marzetta’s observations [95] multiple research groups in recent past have studied a variety of implementation issues related to massive MIMO systems, for example, [96100]. Also, different groups have started the development of testbeds to confirm the theoretical observation of [95], in practice, for example. An assumption made by Marzetta [95] and followed by other researchers is that orthogonal frequency division multiplexing (OFDM) may be used to convert the frequency selective channels between each MT and the multiple antennas at the BS to a set of flat fading channels. Accordingly, the flat gains associated with the set of channels within each subcarrier constitute the spreading gain vector that is used for dispreading of the respective data in upstream as well as for precoding of the respective downstream data.

In a recent work [101], embarking on the above concepts, we have introduced the application of filter bank multicarrier (FBMC) to the area of massive MIMO communications. The interesting finding in this work is that unlike in conventional MIMO where applicability of FBMC (CMT and SMT, in particular) is found limited, FBMC offers a number of appealing properties in the application of massive MIMO. In particular, the linear combining of the signal components from different antennas at BS (equivalent to dispreading in CDMA) smooths channel distortion across each subcarrier band. This property, which we call self-equalization, allows one to relax on the requirement of having approximately flat gain for the subcarriers. This, in turn, allows one to reduce the number of subcarriers in an assigned bandwidth. Reducing the number of subcarriers has the following impacts, which all position FBMC as a strong candidate in the application of massive MIMO.(1)The complexity of both transmitter and receiver will be reduced, since the underlying polyphase structures will be based on smaller size IFFT/FFT blocks.(2)The system latency will be reduced. This is because the length and, thus, the corresponding group delay of the underlying prototype filter are reduced.(3)The system sensitivity to CFO will be reduced.(4)The PAPR of the transmit signal will be reduced.

The preliminary research results that are presented in [101], clearly, open up a new research direction for exploration of FBMC techniques in the area of massive MIMO.

9. Conclusions

This paper presented a tutorial review of the class of filter bank multicarrier (FBMC) communication systems, with emphasis on the recent results of the authors research laboratory. It was noted that by building FBMC systems based on well-designed prototype filters, the spectrum of each subcarrier can be contained within a limited bandwidth. This, in turn, allows transmission over noncontiguous bands, a property that makes FBMC an ideal choice for many applications, including the uplink of multiuser multicarrier networks and cognitive radios. It was also pointed out that FBMC systems can be adopted to doubly dispersive channels to achieve a performance far superior to that of OFDM. The polyphase structures that have been proposed for efficient implementation of FBMC systems were reviewed and their complexities were compared with that of a standard OFDM system. It was noted that the complexity of FBMC transmitters is close to (but less than) twice their OFDM counterpart, and the complexity of FBMC receivers is close to (but less than) three times that of their OFDM counterpart. However, it was also noted that, in some applications, for example, in the uplink of multiuser systems where synchronization cannot be made perfectly, hence, multiuser cancellation techniques should be applied, OFDM can be far more complex than FBMC. Moreover, it was noted that more research is needed to better understand the problems related to synchronization, equalization, and tracking of channel variations in FBMC systems. Possible application of FBMC in the emerging area of massive MIMO was also highlighted, and a number of advantages that FBMC offers in this application were identified.

Conflict of Interests

The author declares that there is no conflict of interests regarding the publication of this paper.