Abstract
This paper investigates the signal-to-interference ratio and the achievable rates of underwater acoustic (UA) OFDM systems over channels where time and frequency dispersion are high enough that (i) neither the transmitter nor the receiver can have a priori knowledge of the channel state information and (ii) intersymbol/intercarrier interference (ISI/ICI) cannot be neglected in the information-theoretic treatment. The goal of this study is to obtain a better understanding of the interplay between interference and the achievable transmission rates. Expressions for these rates take into account the “cross-channels” established by the ISI/ICI and are based on lower bounds on mutual information that assume independent and identically distributed input data symbols. In agreement with recent statistical analyses of experimental shallow-water data, the channel is modeled as a multivariate Rician fading process with a slowly time-varying mean and with potentially correlated scatterers, which is more general than the common wide-sense stationary uncorrelated scattering model. Numerical assessments on real UA channels with spread factors around 10−1 show that reliable OFDM transmissions at 2 to 4 bits/sec/Hz are achievable provided an average signal-to-noise ratio of 15 to 20 dB.
1. Introduction
The various collections of underwater acoustic (UA) channels surveyed worldwide highlight the diversity of UA propagation environments and confirm in many cases the bad reputation of these channels as communication media [1–4]. UA communication systems are usually prone to time and frequency dispersion due to multipath propagation and Doppler effects, and the absorption of acoustic waves at high frequencies strongly limits their bandwidth. Finding systems that are robust to the environment, while maintaining acceptable data rates, remains the major difficulty faced by UA communication system designers. For a given propagation channel, this interplay between robustness and data rate results in practice in choosing different modulation schemes according to the importance we give to one or the other characteristic, the optimal trade-off being unknown in most cases. Historically, single-carrier modulations with receivers relying on channel equalizers in the time domain have been widely studied and used in practice for high-speed underwater communications [5]. Recently, multi-carrier (MC) systems such as OFDM (orthogonal frequency division multiplexing) [6–9] have generated much interest due to the simplicity of receivers and the flexibility they offer.
For time-invariant channels, modulation-basis functions of common OFDM signals (e.g., cyclic prefix-based OFDM) can be seen as “eigen” functions of the channel operator and ensure the absence of interference at reception. This way of “diagonalizing” the channel allows the use of simple algorithms for recovering the information from the received signal. For double-dispersive channels, and particularly in the UA context, perfect channel diagonalization can rarely be achieved as the environment is generally random so that the channel eigenstructure differs from one channel realization to another. The channel diagonalization can then only be performed in some approximate sense [10–12] and interference due to time-frequency dispersion becomes inevitable. However, even if UA-OFDM systems can hardly avoid interference, compared to single-carrier approaches, multiplexing the information to be transmitted on a time-frequency grid offers the opportunity to optimize more degrees of freedom. OFDM system design is classically approached from the viewpoint of intersymbol/intercarrier interference (ISI/ICI) through maximization of the signal-to-interference ratio (SIR) [10, 11, 13–15]. For a given set of channels, finding the MC signaling scheme that maximizes the average SIR is a way of designing robust systems that do not require complex equalization algorithms at reception. While the SIR may be a good figure of merit to assess the robustness of communications in doubly dispersive environments, it does not reveal the effect of the chosen signaling scheme on the information rate. The robustness improvement is generally paid back by the loss of spectral efficiency induced by the use of time and/or frequency guard intervals required to limit the interference. For instance, typical UA-OFDM systems use a guard interval between symbols that is lower-bounded by the maximum delay spread of the channel, which often lasts several tens of milliseconds. Compared to the active symbol duration, this interval is usually not negligible and can significantly reduce the transmission efficiency. Therefore, it remains unclear whether or not OFDM systems should tolerate slightly increased interference but operate at higher data rate.
Motivated by recent results in information theory and UA channel modeling, an information-theoretic analysis of the trade-off between maximization of SIR and minimization of guard intervals is proposed in this paper. More precisely, our main goal is to obtain a better understanding of the interplay between interference and the achievable transmission rate of UA-OFDM systems. We pay special attention to UA channels, where time and frequency dispersions are high enough so that ISI/ICI cannot be neglected in the information-theoretic treatment. Our analysis addresses two questions. First, what are the achievable rates of UA-OFDM in highly dispersive channels? And as a corollary, what are the consequences of OFDM design choices on these rates?
The target of our analysis is the investigation of UA-OFDM information rate. To this end, we believe that the following aspects need to be accounted for: the UA channel is selective both in time and frequency; the UA channel cannot systematically be modeled as a wide-sense stationary uncorrelated scattering (WSSUS) process; no perfect channel state information (CSI) is available at the transmitter nor the receiver; interference is not negligible a priori.
These aspects are important as they may have a strong impact on the achievable rates of UA-OFDM systems. (A1) is particularly true in shallow-water environments where the spread factor (product between the delay and Doppler spread) is usually around and can even exceed in some cases [1]. The WSSUS assumption discussed in (A2) implies that the channel correlation function is time-invariant and that the scatterers with different path delays are uncorrelated so that the second-order statistics of the channel are reduced from four to two dimensions [16]. While this assumption may be valid for data transmission at low bandwidth with static communication endpoints, it is not the case with moving platforms and/or when the path-loss associated with each channel path cannot be assumed to be constant over the transmission bandwidth [2, 17]. (A3) corresponds to what is commonly referred to as the noncoherent setting where neither the transmitter nor the receiver knows the current realization of the channel perfectly [18–20]. This assumption has to be contrasted with the coherent setting where a genie provides the receiver with perfect CSI. (We warn the reader that the word coherent is here used in an information-theoretic context and its definition slightly differs from the one used in a demodulation context.) For most channels, the coherent model is not realistic since receivers are not genie-aided, and the effort to acquire the CSI usually induces some rate loss (pilots insertion, channel estimation errors, etc.). In addition, assuming perfect CSI at the transmitter is also optimistic for most practical cases since the low propagation speed of acoustic waves imposes strong constraints on the nature of the CSI provided by a feedback link. Since we focus on highly dispersive channels, we consider in (A4) that interference must be explicitly accounted for in the information theoretic-treatment.
To the best of our knowledge, the information rate of OFDM systems under such general assumptions has not yet been derived. However, recent works presented in [17] and more particularly in [21] give some useful ingredients to derive this rate. In [17], the authors derive bounds on the achievable rate of UA-OFDM systems and consider the aspects (A1) and (A3), and (A2) in part. Although correlated scattering is taken into account in their channel model, they assume wide-sense stationarity. As for interference, it is neglected in their analysis. In [21], Durisi et al. explicitly account for interference terms but present their results for WSSUS Rayleigh fading channels, which is not appropriate for the majority of UA channels [2, 22].
The main contributions of this paper are the following: (i) based on the UA channel characterization presented in [2], we present an exact analysis of ISI/ICI of UA-OFDM systems transmitting in non-WSSUS channels. The channel is modeled as a multivariate Rician fading process with a slowly time-varying mean and with potentially correlated scatterers;(ii) the information rate of UA-OFDM systems is analyzed under the general scenario described by the aspects (A1)–(A4);(iii) in order to extract guidelines useful for UA-OFDM system design, theoretical results are then numerically assessed on rectangular pulse shaping OFDM transmitting over experimental UA channels surveyed at sea.
This paper is organized as follows. Section 2 is devoted to the presentation of the system model and the main assumptions. Signal-to-interference ratio and achievable rates of OFDM systems transmitting over UA channels are derived in Section 3. In Section 4, we discuss the impact of channel and OFDM parameters on the information rate through various numerical experiments. Finally, conclusions are given in Section 5.
2. System Model
2.1. Notation
Throughout this paper, lowercase boldface letters denote vectors, for example, , and uppercase boldface letters denote matrices, for example, . The superscripts and denote transposition and Hermitian transposition, respectively. The Hadamard (elementwise) products of two matrices and is written as . The entries of a matrix are denoted by , where the indices and start at 0. designates a submatrix of a block matrix . The Kronecker symbol is denoted by . We let designate a diagonal square matrix whose main diagonal contains the elements of the vector . The inner product between two signals and is denoted by . designates the distribution of a jointly proper Gaussian random vector [23] with mean and covariance matrix . Finally, denotes expectation.
2.2. Channel Model
We consider a doubly selective baseband equivalent underwater acoustic channel, modeled as a random linear time-varying system that maps input signals into output signals according to the I/O relationship: where is the channel impulse response, and denotes the ambient noise.
According to recent results on the statistical characterization of UA channels [2], the impulse response is modeled as a trend stationary random process so that, for all , and with is called the trend and is a slowly time-varying deterministic component. is a zero-mean wide-sense stationary random process assumed to be Gaussian. This model describes the UA channel as a multivariate Rician fading process with a slowly time-varying mean. can be interpreted as the contribution of (pseudo) deterministic physical phenomena to channel fluctuations (wave undulation, range/depth dependence, bathymetry changes, etc.), and represents the channel fluctuations attributable to scatterers that result in fast fading. Note that since no particular assumption is made about the correlation of scatterers, the model is very general and includes the WSSUS model as a subset. (Under the WSS assumption, the channel impulse response would satisfy (4) as well as . Under the assumption that for , the channel would be said to exhibit delay uncorrelated scattering (US).) Without loss of generality, the channel is assumed to be normalized so that We define the channel Rice factor as the power ratio between the deterministic trend and the random component, that is, We recall that is deterministic and that is wide-sense stationary so that does not depend on .
The ambient noise is assumed to be Gaussian and to result from the mixture of four sources [24]: turbulence, shipping, waves, and thermal noise with nonflat power spectral densities (PSDs). We, therefore, modeled as a nonwhite zero-mean wide-sense stationary Gaussian random process with correlation function: and PSD Simple approximated models for and are given in Appendix A.
In addition to the channel impulse response, another channel function that will be important for our treatment is the delay-Doppler spreading function: The channel I/O relation (1) can now be written as The spreading function is assumed to be compactly supported on a rectangle and satisfies where and denote the maximum time delay spread and the maximum Doppler spread, respectively. This assumption leads to the following definition of the channel spread factor: Note that this assumption eases the analysis proposed in this paper but is only an approximation of real channels behavior. In practice, to set values to and , it is often required to resort to more empirical definitions (e.g., threshold-based definitions, as used in Section 4.2). Various definitions of delay and Doppler spreads for real channels are discussed in [1, Section 4.5].
2.3. OFDM Signal
OFDM signaling schemes can be described by two Weyl-Heisenberg (WH) sets [10, 11, 13]: the one used at transmission, expressed as and the one used at reception, defined as where are the time and frequency shifts of the prototype function and . The signaling scheme is here assumed to be (bi)orthogonal, so that To ease the readability of the results presented in the sequel, we shall restrict our analysis to orthogonal receive pulses (i.e., ). (Note that nonorthogonal receive pulses introduce noise correlation and noise-enhancement that can be harmful for advanced equalization techniques. As an example, cyclic prefix based OFDM receiver are orthogonal whereas zero-padding receivers (with ) are not.)
The transmitted signal is where is the number of subcarriers, and is the approximate duration of the transmitted signal. denotes the data symbols. Since little is known about the exact structure of optimal signaling under the general constraints listed in the introduction, we restrict our analysis to zero-mean, independent and identically distributed (i.i.d.) symbols. We assume that the average power of the input signals is limited so that where is the maximum average power available. The signal-to-noise ratio (SNR) is then defined as where denotes the system bandwidth.
At reception, the output signal is projected onto the set to obtain can be developed as where the second term on the right-hand side (RHS) of (20) represents the intersymbol and intercarrier interference.
The relation (20) can be compactly expressed as where the channel input and output vectors of size are respectively defined by and where is defined analogously. The channel matrix is given by where the matrix block of size satisfies Since we do not neglect interference, the matrix is not diagonal and can be decomposed as follows: where is the direct channel vector corresponding to the main diagonal of and is the ISI/ICI cross-channel matrix containing the offdiagonal terms of .
3. Information Theoretic Analysis of UA-OFDM Systems
Three fundamental characteristics of the sets and are generally involved in the optimization/performance of MC systems:(i)(bi)orthogonality: for an ideal channel where , perfect demodulation is obtained iff , and satisfies the condition (15); (ii)localization: localization of a prototype function involves the Heisenberg uncertainty principle and characterizes its time-frequency concentration so that it directly affects the power of interference observed at reception; (iii)density: spectral efficiency of MC systems is directly proportional to the density of the time-frequency grid that supports the transmission scheme. For instance, adding guard intervals between OFDM symbols reduces the density due to the fact that in this case, but as the product TF gets larger, the power of ISI/ICI diminishes. Ideally, we would like to construct a MC system that is (bi)orthogonal, with well-localized prototype functions (to limit the interference) and with a dense time-frequency grid (to maximize the spectral efficiency). However, these three conditions cannot be satisfied simultaneously due to the Balian-Low theorem [25, Th. 4.1.1.] More precisely, well-localized (bi)orthogonal pulses can only be found for (see [26, Ch. 2] for more details). This loss in spectral efficiency is usually the price to pay to mitigate ISI/ICI over double-dispersive channels. (Note that other approaches, e.g. [13] or [27], privilege localization and spectral efficiency over (bi)orthogonality.)
These elements highlight the difficulty of finding a compromise between a low interference at reception and a maximal use of the degrees of freedom offered by the channel. The optimal trade-off between low interference and high spectral efficiency is a key ingredient in OFDM system design that has yet to be found. To provide some guidelines that will help us to progress toward the optimal solution, we suggest to study the signal-to-interference ratio as well as the information rate of UA-OFDM systems. The information rate, defined as the amount of information that can be transmitted with arbitrarily small error probability, appears as a good figure of merit for system design as it jointly considers interference and spectral efficiency.
3.1. Signal-to-Interference Ratio
The signal-to-interference ratio at the symbol and the subcarrier is defined as where (a) follows from the assumption that the are i.i.d. with zero mean. The numerator represents the average power of the diagonal entries of , and the denominator the power of its offdiagonal entries. Let be the cross-ambiguity function of and defined as follows: The signal as well as the interference power can be expressed as a function of . More precisely, as shown in Appendix B: where is the channel scattering function defined as with . Note that in the case where the scatterers are assumed to be uncorrelated, the scattering function is simplified to .
The first term on the RHS of (28) represents the power carried by the fast fading random part of the channel, and the second term corresponds to the deterministic part of the channel. Similarly, the interference power satisfies where is the interference power corresponding to the random part of the channel and is expressed as Note that does not depend on since, according to (4), is wide-sense stationary.
is the interference power due to the deterministic part of the channel and is given by
Using the above quantities, we now define the average signal-to-interference ratio as
3.2. Achievable Rate
Let be the set of probability distributions on that satisfy the constraints given in (17). The maximum achievable rate for an OFDM system is then given by [28] where is the mutual information between and with , the differential entropy of . In the noncoherent setting, the maximum achievable rate is notoriously hard to characterize analytically. However, by evaluating the mutual information for a specific input distribution, and by relying on the following inequality on mutual information [29]: we can get a lower bound on that yields an information-theoretic criterion useful for the analysis of UA-OFDM systems. Note that the first term on the right-hand side of (35) corresponds to the coherent information rate under perfect channel knowledge at reception, and the second term can be interpreted as a penalty term that quantifies the rate loss due to the lack of channel knowledge.
Theorem 1. The maximum achievable rate of an OFDM system with i.i.d. input symbols satisfying the average-power constraint (17) and transmitting over the channel modeled by (2) is lower-bounded as , where Here, the entries of the noise power vector are defined as and denotes the covariance matrix of the direct channel vector , whose entries are expressed as
Proof. See Appendix C.
Note that the penalty term in (36) only depends on the random component of the channel so that acquiring CSI at reception gets more costly as the channel gets more fluctuating (e.g., estimating gets more difficult as the power of its off-diagonal entries increases).
To get a better insight into the achievable rate, the following corollary presents a simplified scenario of transmission that leads to a more tractable expression of the lower bound.
Corollary 2. In the case where the noise is assumed to be white and the scatterers uncorrelated, the maximum achievable rate is lower bounded as , where Here, denotes the SNR defined in (18), is the PSD of the zero-mean stationary channel process , and is expressed as and satisfies
Proof. See Appendix D.
Note that the scenario depicted in Corollary 2 may be acceptable for systems transmitting in small bandwidth (on the order of kHz), where the noise PSD can be assumed flat and where the propagation loss associated with each channel path is approximately constant over the transmit bandwidth, thus reducing the correlation between channel arrival paths.
4. Numerical Analysis
We next examine the signal-to-interference ratio and the bounds of the previous section in various scenarios. Using a synthetic channel model, impact of time-frequency dispersion on the information rate is first discussed in Section 4.1. Bounds on the information rate applied to experimental double-dispersive UA channels surveyed at sea are then analyzed in Section 4.2. Common OFDM systems with rectangular pulse shaping are used as a framework in our investigation. and are thus defined as where denotes the guard time between OFDM symbols.
4.1. Synthetic Channel Model
To illustrate the impact of channel dispersion on the performance of OFDM systems, we first consider a canonical channel model. It has no particular physical justification, but mimics a bad scenario from the viewpoint of a communication system [20] and will help us to provide general trends on OFDM system robustness against channel dispersion.
We assume the following environment: (i) Rayleigh fading, that is, ; (ii) uncorrelated scatterers with a brick-shaped scattering function ; (iii) white Gaussian noise.
Figure 1 shows the information rate as a function of the channel spread factor defined in (12). The grid parameters and are chosen according to the grid-matching rule [30]: , and is chosen so as to maximize (39). is set to 15 dB and kHz. is randomly chosen between 1 and 50 ms, and between 0.1 and 50 Hz. As expected, the achievable rate of OFDM systems is strongly affected by both the delay and the Doppler spread and decreases as the channel gets more dispersive. Note that as the rectangular prototype function is not equally localized in time and frequency, there is not a single performance point for a given spread factor. For and dB, OFDM systems should be able to communicate at roughly 4 bits/s/Hz, which represents 80% of the achievable rate in an AWGN channel at the same SNR. Such systems can be relatively efficient as long as . However, for , there is no guarantee that any data can be reliably transmitted.

4.2. Experimental UA Channels Recorded at Sea
Three different shallow water channels, recorded in the Atlantic ocean and the Mediterranean sea, are considered. Table 1 summarizes the main characteristics of these channels, and Figure 2 shows the evolution of their respective power delay profiles as a function of time. Channel (a) results from data collected by the DGA-TN (Direction Générale de l'Armement-Techniques Navales) in the Atlantic ocean off Brest (France) in October 2007, and channels (b) and (c) result from sea trials performed by Thales Underwater Systems in the Mediterranean sea off La Ciotat (France) in October 2004. From the raw data and for each channel, the trend is separated from the random component using the empirical mode decomposition method [2]. The maximum time delay spread is estimated as the difference between the longest and the shortest delay, where the average power delay profile exceeds 1% of its maximum value (i.e., taps that are 20 dB below the strongest tap are assumed to result from noise and are artificially set to 0). The scattering function is obtained from a correlogram estimate of PSD. The maximum Doppler spread is similarly defined from the Doppler power spectrum as the maximum delay spread from the average power delay profile. (The Doppler power spectrum is defined as .) To compute the various expectations required to evaluate the bound , a large number of channel realizations are generated using the channel stochastic replay approach presented in [2]. Throughout this analysis, and satisfy the grid-matching rule mentioned previously (i.e., ).

(a)

(b)

(c)
In Figure 3, the average SIR is plotted as a function of for the three channels. It can be noticed that as increases, the duration of the guard interval increases as well, which results in a lower interference at reception. The SIR increases significantly as long as is such that the guard interval duration is lower than the maximum time delay spread. A further increase of produces a slighter increase of the SIR, which indicates that ISI is more detrimental than ICI. The average SIR also depends on the channel properties. As expected, for a given , the larger the spread factor, the smaller the SIR.

Through the evolution of the achievable rate (36) as a function of and the SNR, Figure 4 shows possible trade-offs between interference minimization and loss of signal-space dimensions. It provides a measure of reassurance that current practice in designing OFDM systems for underwater channels is reasonable. That is, oversizing guard intervals duration (i.e., choosing large ) compared to the channel maximum delay spread is not much detrimental to the information rate, whereas a too small can significantly decreases this rate, especially in highly dispersive channels such as channel (c). The results of Figure 4 also suggest that significant rate improvements are possible compared to state-of-the-art UA-OFDM systems. For instance, in channels (b) and (c), reliable OFDM transmissions at 2 to 4 bits/sec/Hz are achievable provided an average signal-to-noise ratio of 15 to 20 dB, whereas in the same SNR range, single-input single-output UA-OFDM systems usually operate with a spectral efficiency around 1 bit/sec/Hz [6–8]. The lower bound (36) obtained for channel (a) corroborates the results of the previous subsection related to channels with small spread factors, that is, over such channels we should be able to communicate at 80% of the theoretical rate obtained over AWGN channels.

(a)

(b)

(c)
UA-OFDM systems are not genie-aided and have to spend some resources to acquire CSI at reception, with the consequence of decreasing the data rate. Insights on how CSI impact the information rate can be obtained through the numerical analysis of the ratio , where is defined as the achievable rate of UA-OFDM systems with perfect channel knowledge at reception. According to Appendix C, we have which corresponds to the first term on the RHS of (36). Note that is also an upperbound on . (This can easily be shown by noticing that .) As shown in Figure 5, the penalty induced by the absence of CSI is stronger for channels with larger Doppler spread (estimating gets more difficult as the channel starts to fluctuate more rapidly) and can lead up to a 30% rate loss for a SNR of 20 dB. In addition, decreases with the SNR, which indicates that CSI acquisition may become a rate-limiting factor at high SNR.

5. Conclusions and Perspectives
The information-theoretic analysis provided in this paper led to the following conclusions: (i) the information rate decreases with the channel spread factor but remains acceptable (i.e., greater than 1 bits/sec/Hz) as long as this factor is smaller than and the signal-to-noise ratio is greater than 15 dB;(ii) numerical assessments on real UA channels with spread factors around showed that reliable OFDM transmissions at 2 to 4 bits/sec/Hz are achievable provided an average signal-to-noise ratio of 15 to 20 dB;(iii) current practices in designing OFDM systems for underwater channels are reasonable. More precisely, slightly oversizing guard intervals duration compared to the channel maximum delay spread is not much detrimental to the information rate, whereas underestimating this duration can be devastating.
Although quite realistic, the system model used in this paper could be more constrained. In particular, to strengthen our results, it would be interesting to add to our model a peak-power limitation, as in [31]. It is well known that OFDM systems can be sensitive to this limitation when power amplifiers do not operate with a large backoff. One way to tackle the problem would be to consider, in the information theoretic-treatment, the nonlinear distortion due to possible clipping as additional noise. Another point that deserves further attention is to study the information rate bound as a figure of merit for pulse-shaping optimization. While experimental results showed that large rates can be achieved with rectangular pulses, the bound provided in this paper could be tightened by maximizing it over all Weyl-Heisenberg sets.
Appendices
A. Noise Model Approximation
To model the ambient noise in the sea, four sources are usually considered: turbulence, shipping, waves, and thermal noise. These four noise components can be modeled by a colored Gaussian noise with the following empirical power spectral density (PSD) given in dB re per Hz as a function of frequency in Hz [24]: where is the shipping activity, and is the wind speed in m/s. The baseband equivalent noise PSD, as defined in (8), is then given by where is the carrier frequency corresponding the th subcarrier.
As noticed in [24], in the frequency region where most OFDM communication systems operate (1 kHz to 100 kHz), the noise PSD decays almost linearly on the logarithmic scale. (Surface motion, caused by wind-driven waves is the main contributor to the noise in that frequency range.) This indicates that a simple expression may be found for the autocorrelation function . In fact, by expressing as where and , we get the following PSD [32]: that turns out to be a good approximation of the noise PSD in frequency range of interest. This approximation is shown in Figure 6 with , and is chosen such that the noise powers of models (A.2) and (A.4) perfectly match at 10 kHz.

B. Computation of the Signal and the Interference Power
For all , we have that where (a) and (b) follows from (9), (13), and (14), and (c) follows from the change of variables and from (27).
From this expression, we can now derive the signal power where (a) follows from (B.1) and (9). In (b), we use that , and we implicitly assume that the prototype functions and have a compact support and that the channel average component is approximately constant (in ) over that support. If we consider rectangular prototype functions, the duration of their support is upper-bounded by , which represents a few tens or hundreds of milliseconds. This has to be compared with the fluctuation period of , which is rather a few tens or hundreds of seconds [2]. Note that, theoretically, some prototype functions can have an infinite support. However, the contribution to the received power from the part of the pulse that has a support greater than few times the symbol period is rather negligible (e.g., for most infinite-length prototype functions such as Gaussian, Raised-Cosine, etc., we usually have that ). (d) holds because, according to (4), the zero-mean random part of the channel is wide-sense stationary so that , where is the channel scattering function defined in (29). Note that in the case where the scatterers are assumed to be uncorrelated , so that (B.2) simplifies to
Similarly to (B.2), the interference power can be derived from the following development: where (a) follows from (B.1) and (9). For (B.2) and (B.4) alike, in (b) we assume that is approximately constant over some period of time. We here consider that does not fluctuate much over the duration that corresponds to the maximal time difference between two interfering OFDM symbols, that is, . Once again, this assumption is not restricting since for most OFDM systems the duration only represents a few tens or hundreds of milliseconds in worst-case scenarios. In the case where the scatterers are assumed to be uncorrelated, (B.4) simplifies to
C. Proof of Theorem 1
A lower bound on can be obtained by evaluating the mutual information for a specific input distribution. Specifically, is chosen such that . The proof of Theorem 1 next relies on the following information theoretic inequality [29, 33]:
C.1. Computation of
The computation of is straightforward since (i) conditional on , is distributed according to a complex Gaussian distribution with a covariance matrix equal to ; (ii) conditional on and , is complex Gaussian with a covariance matrix equal to . The entries of the noise covariance matrix are given by Here, (a) is based on Appendix A where it is shown that can be well approximated by a function that decays very fast compared to common pulse durations (i.e., ). (b) follows from the orthogonality of the receive pulse and (c) from (8). Consequently, where the entries of the vector are defined as
C.2. Computation of
The offdiagonal elements of being generally nonnull in highly dispersive environments, the derivation of is not that easy. Influenced by [21], we next seek an upper bound on the penalty term by splitting into an interference-free part and an interference-only part, so that where are two independent random vectors such that and , with .
Let us note that In (a) and (b), we used the data processing inequality, (c) follows from the chain rule, (d) holds because and are conditionally independent given , in (e) we expressed mutual information as a function of entropy, (f) holds because and are conditionally independent given and , and also because and are conditionally independent given and . Finally, the fact that conditioning reduces entropy leads to (g).
Using that is Gaussian given and , and as a consequence of Jensen's inequality, can be upper-bounded as where denotes the covariance matrix of the direct channel vector . From (B.1), we can express the entries of as
We next seek an upper bound on . Let be the conditional covariance matrix of the vector given . being Gaussian given , using Hadamard's and Jensen's inequalities, is then upper-bounded as follows: where the last equality holds because the input symbols are i.i.d. with zero mean, so that with the interference power due to the random part of the channel as defined in (31). Equation (C.9) can be further simplified by noticing that and do not depend on ; therefore,
From (C.1), (C.3), (C.7), and (C.11), and for all , can be lower-bounded as follows:
The bound is then tightened by choosing that minimizes the penalty term, which concludes the proof.
D. Proof of Corollary 2
D.1. White Noise Assumption
Under the white noise assumption, the noise PSD is flat over the entire bandwidth so that , where is a constant. In that case, the entries of vector are all equal to , and the SNR satisfies . Given that , from Theorem 1 we have
D.2. Uncorrelated Scattering Assumption
In the case where the scatterers can be assumed as uncorrelated, the lower bound on the information rate can be further simplified.
First, the channel scattering function is reduced from three to two dimensions so that does not depend on anymore and is expressed as
Second, the covariance matrix becoming block-Toeplitz, the extension of Szegö's theorem to two-level Toeplitz matrices can be applied, that is [34, Th. 3]: where is the power spectral density of the zero-mean multivariate random process , with , that is, By noticing that the entries on the main diagonal of are all equal and by applying Hadamard's inequality, we have where is the PSD of the zero-mean stationary channel process and is expressed as Corollary 2 is then obtained by noticing that .
Acknowledgments
The authors are grateful to the DGA-TN for providing part of the experimental data used in this paper. This work was supported in part by the ONR MURI Grant N00014-07-1-0738, the ONR Grant N00014-10-1-0576, and the Grant NSF 0831728.