- About this Journal ·
- Abstracting and Indexing ·
- Advance Access ·
- Aims and Scope ·
- Annual Issues ·
- Article Processing Charges ·
- Articles in Press ·
- Author Guidelines ·
- Bibliographic Information ·
- Citations to this Journal ·
- Contact Information ·
- Editorial Board ·
- Editorial Workflow ·
- Free eTOC Alerts ·
- Publication Ethics ·
- Reviewers Acknowledgment ·
- Submit a Manuscript ·
- Subscription Information ·
- Table of Contents

International Journal of Distributed Sensor Networks

Volume 2013 (2013), Article ID 471962, 16 pages

http://dx.doi.org/10.1155/2013/471962

## Enhanced Mobile Multiple-Input Multiple-Output Underwater Acoustic Communications

^{1}Department of Electrical and Computer Engineering, University of Florida, Gainesville, FL 32611, USA^{2}MathWorks Inc., Natick, MA 01760, USA

Received 12 January 2013; Revised 22 April 2013; Accepted 15 May 2013

Academic Editor: Lei Shu

Copyright © 2013 Kexin Zhao et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

#### Abstract

This paper focuses on mobile multiple-input multiple-output (MIMO) underwater acoustic communications (UAC) over double-selective channels subject to both intersymbol interference and Doppler scaling effects. Temporal resampling is implemented to effectively convert the Doppler scaling effects to Doppler frequency shifts. Under the assumption that the channels between all the transmitter and receiver pairs experience the same Doppler frequency, a variation of the recently proposed generalization of the sparse learning via iterative minimization (GoSLIM) algorithm, referred to as GoSLIM-V, is employed to estimate the frequency modulated acoustic channels. GoSLIM-V is user parameter free and is easy to use in practical applications. This paper also considers turbo equalization for retrieving the transmitted signal. In particular, this paper reviews the linear minimum mean-squared error (LMMSE) based soft-input soft-output equalizer involved in the turbo equalization scheme and adopts a fast implementation of the equalizer that achieves negligible detection performance degradation compared to its direct implementation counterpart. The effectiveness of the considered MIMO UAC scheme is demonstrated using both simulated data and measurements recently acquired during the MACE10 in-water experiment.

#### 1. Introduction

Achieving reliable underwater acoustic communications (UAC) with high data rate is difficult owing to the unique challenges imposed by the underwater acoustic environment [1, 2]. In typical UAC, the difference in the propagation time between the earliest and latest arrivals could span tens to hundreds of symbol periods [3], which translates into long channel impulse response (CIR) and severe intersymbol interference (ISI) at the receiver side. Moreover, the presence of Doppler effects, owing to the relative motions between the transmitter and receiver platforms and the dynamic underwater acoustic medium, induces temporal scaling (stretching or compression) to the transmitted signals [4]. Doppler-induced scaling effects impair the reliability of UAC, especially in the case of a phase-coherent detection scheme [3]. Furthermore, the scarcely available bandwidth permitted by the acoustic channel imposes an upper bound on the attainable symbol rate [2]. Therefore, the pursuit of high data rate in UAC leverages the multiple-input multiple-output (MIMO) scheme, which offers enhanced reliability and/or increased data rates compared to its single-input counterpart [5–7]. The focus of the present paper is on effective mobile MIMO UAC over double-selective acoustic channels suffering from both ISI and Doppler scaling effects.

Converting the double-selective channel into an ISI channel via temporal resampling is an effective way to tackle mobile UAC difficulties [4]. Although the Doppler scaling effects can be largely mitigated via such a temporal resampling process, the residual Doppler still causes frequency shift on the received measurements. Coherent UAC requires the receiver to acquire knowledge of the underlying channel after temporal resampling via channel estimation [7]. Channel estimation could be conducted either in the training-directed mode, using known training sequences, or in the decision-directed mode, using the detected payload symbols [5, 6]. A preferable tool to characterize a channel subject to both ISI and Doppler frequency shift is the scattering function (SF), which essentially decouples the acoustic channel into a bank of paths that experience different delays and Doppler frequencies [8]. The major concern in SF-based channel estimation is that the problem becomes over parameterized with too many degrees of freedom. It is practically more beneficial to look for a channel model with the smallest number of parameters, but one that still sufficiently reflects the defining characteristics of the acoustic channel of interest. Along this line of thought, it is assumed in [9, 10] that, at each receiver, the channel taps for all the transmitters experience the same Doppler frequency, but different receivers experience different Doppler shifts. The number of unknowns in the frequency dimension, as a consequence, is significantly reduced. Under this assumption, CIRs and the underlying Doppler frequency could be estimated in a separate manner [9] or in a joint manner by employing the generalization of the sparse learning via iterative minimization (GoSLIM) algorithm [10]. It is demonstrated in [10] that GoSLIM outperforms the separate estimation algorithm proposed in [9] in terms of estimation accuracy and robustness against suboptimal training sequences.

In [11], the aforementioned channel model is further simplified by assuming that the channel taps for all the transmitter and receiver pairs experience the same Doppler frequency. As a consequence, the impact of the Doppler frequency shift on the received measurements across all the receivers is taken into account through one unknown common frequency. Accordingly, a variation of GoSLIM, referred to as GoSLIM-V (V stands for variation), is proposed for channel estimation in [11]. Like GoSLIM, GoSLIM-V addresses sparsity through a hierarchical Bayesian model, and because GoSLIM-V is user parameter free, it is easy to use in practical applications. It is demonstrated in [11] that the employment of GoSLIM-V not only reduces the overall complexity in the channel estimation stage but also slightly improves the detection performance compared to its GoSLIM counterpart. Due to this reason, GoSLIM-V is used as the channel estimation algorithm in the present paper.

Following the channel estimation is the design of the detection scheme for extracting the transmitted signals. The channel-induced phase shift should be first compensated out using the Doppler frequency estimate [9]. Such phase compensation task, along with the aforementioned temporal resampling process, effectively converts a double-selective channel subject to both Doppler scaling effects and ISI to an ISI channel, which allows for the employment of various equalization techniques that can effectively combat ISI. We use a linear minimum mean-squared error (LMMSE) based filter for symbol detection. In a MIMO setup, on top of ISI, multiple simultaneously transmitted signals act as interferences to one another. Therefore, interference cancellation scheme also plays a critical role in the overall detection performance. A hard decision based interference cancellation scheme, including vertical BLAST (V-BLAST) [12] and RELAX-BLAST [5], subtracts out the hard decisions of detected signals from the received measurements to aid the detection of the remaining signals. By combining V-BLAST with the cyclic principle of the RELAX algorithm [13], RELAX-BLAST provides superior detection performance over V-BLAST at the cost of slightly increased complexities [5, 6].

The detection performance can be further enhanced by employing a soft interference cancellation scheme, including turbo equalization [14–16]. For a receiver employing turbo equalization, both the equalizer and decoder involved are configured as soft-input soft-output. The detection performance improves as the soft information cycles between the equalizer and decoder. The main drawback of the turbo equalization scheme is the increased computational complexity compared to its hard-decision-based counterparts. To address this problem, we consider a low complexity approximation of soft-input soft-output equalizer [14]. We will show via numerical and experimental examples that the employment of the proposed approximate equalizer enjoys a computational complexity comparable to RELAX-BLAST and provides only slightly degraded detection performance compared to an exactly implemented turbo equalizer.

The rest of the paper is organized as follows. Section 2 presents a system outline. Section 3 describes a model for the acoustic channel subject to both ISI and Doppler scaling effects and reviews the temporal resampling procedure. Section 4 formulates the channel estimation problem in both training-directed and decision-directed modes and then introduces GoSLIM-V as the channel estimation algorithm. Section 5 first formulates the symbol detection problem and then details the LMMSE based soft-input soft-output equalizer and its low complexity approximation. Section 6 presents the simulation results of the turbo equalization scheme, followed by the experimental results obtained from analyzing the MACE10 in-water measured data. The paper is concluded in Section 7.

*Notation*

Vectors and matrices are denoted by boldface lowercase and uppercase letters, respectively, denotes the Euclidean norm of a vector, is the modulus and is the complex conjugate of a scalar. and denote the transpose and conjugate transpose, respectively, of a matrix or vector, denotes an identity matrix of appropriate dimension, and denotes the estimate of . represents a diagonal matrix in which the elements of are on the diagonal. and represent the real and the imaginary components of a complex-valued scalar, respectively. denotes the Kronecker product of two matrices and . Other mathematical symbols are defined after their first appearance.

#### 2. System Outline

Consider an mobile MIMO UAC system equipped with transmit transducers and receive hydrophones. The transmitted payload sequences are divided into multiple blocks, each of which is encoded separately. Figure 1(a) demonstrates the construction of a single payload symbol block (the construction of other blocks follows the same procedure). Denote as the th source bit for . are first fed into a rate convolutional encoder with generator polynomials and . The encoded bits are then passed to a random interleaver, followed by a quadrature phase-shift keying (QPSK) modulation using Gray code mapping. In Figure 1, interleaver and deinterleaver modules are represented by and , respectively. Next, the so-obtained QPSK payload symbols are demultiplexed into the payload blocks, each consisting of symbols, across the transmitters in a round-robin fashion (this is where the “DEMUX” module in Figure 1(a) comes into play). More specifically, in our design, corresponds to the th symbol sent by the th transmitter, denoted as , for and . Accordingly, we denote and as the two consecutive interleaved bits in that map to according to the formula given below: Since , the support of is a 4-element alphabet set .

The structure of a receiver employing a turbo equalization scheme is shown in Figure 1(b). The measurements acquired by the receive hydrophones are first resampled, followed by channel estimation and phase compensation. After phase compensation, the double-selective channel is converted to an ISI channel, and the turbo equalization scheme is employed herein to retrieve the transmitted information. The superior detection performance promised by turbo equalization is mainly due to its mechanism of cycling soft information between the equalizer and the decoder [14]. Accordingly, turbo equalization consists of two key modules, namely, a soft-input soft-output equalizer and a soft-input soft-output decoder [17]. The soft information of a generic bit , commonly known as the log-likelihood ratio (LLR), is defined as
where represents the probability of being 0. As shown in Figure 1(b), the multiplexed and deinterleaved version of , the *a posteriori extrinsic* LLR generated by the equalizer, forms the *a priori* inputs to the decoder. Conversely, the interleaved and demultiplexed version of , the *a posteriori extrinsic* LLR generated by the decoder, serves as *a priori* information to the equalizer. The subscript or reminds us that the LLRs are generated by the equalizer or the decoder, respectively. The soft information is cycled between the equalizer and the decoder multiple times before making hard decisions on the source bits. Note that the interleaver and deinterleaver involved at the transmitter and receiver have the same structure, whereas the “DEMUX” module inside the dashed rectangle in Figure 1(b) is different from that in Figure 1(a) in the sense that the former and the latter demultiplex, respectively, the soft information and QPSK symbols . Once , the hard decisions on the source bits, are available, we follow the steps in the symbol generation process shown in Figure 1(a): are fed into the convolutional encoder, followed by random interleaving, QPSK mapping, and demultiplexing. This way, an error free decoding ensures a perfect recovery of the transmitted QPSK symbols . The recovered payload symbols will be used in the decision-directed channel estimation stage; see Figure 1(b).

#### 3. Double-Selective Channel with Doppler Scaling Effects

In this section, we start with the modeling of the double-selective channel suffering from both ISI and Doppler scaling effects. Then we describe the temporal resampling procedure to mitigate the Doppler scaling effects. After that, we provide a practical approach to estimate the Doppler scaling factor.

##### 3.1. Channel Model

By adopting a single-carrier communication scheme, at the th transmitter, the continuous baseband signal (generated by passing the discrete payload symbols to a pulse shaping filter) and its corresponding frequency modulated signal are related through where represents the carrier frequency. For simplicity, the pulse shaping filter, frequency modulation, and real component extraction operation are not shown in Figure 1(a).

Due to multipath effects, the actual transmitted signals can reach the receive hydrophones via different propagation paths with different delays. Herein, the underlying acoustic channel between each transmitter and receiver pair is characterized by resolved paths. The th path between the th transmitter and th receiver pair (, , and ) will affect the transmitted signal in three aspects, namely, amplitude attenuation, Doppler scaling, and delay, which are denoted, respectively, by three real-valued scalars , , and . The signal transmitted via the th path and acquired by the th receiver can be written as . By taking into account all of the transducers and resolved paths, the received signal at the th hydrophone can be expressed as (for simplicity, the noise term is omitted for the time being)

We assume that the propagation paths for all the transmitter and receiver pairs experience a common Doppler scaling factor and the resolved paths are synchronized among all the transmitter and receiver pairs, that is, and (interested readers are referred to [18] for a detailed treatment of synchronization procedure). By using these assumptions, (4) reduces to Substituting (3) into (5) yields

##### 3.2. Temporal Resampling

By resampling the received measurements using a factor , the resampled signal is given by [4, 19, 20] Then, the baseband received signal , which is related to via , can be expressed as where represents the th channel tap between the th transmitter and the th receiver pair, for , , and . It can be readily verified that as long as , we have . Accordingly, (8) can be approximated as One observes from (9) that effective temporal resampling (meaning ) converts the Doppler scaling effects to Doppler frequency shifts with the frequency given below:

Therefore, the determination of the resampling factor plays a crucial role in the effective mitigation of the Doppler scaling effects.

##### 3.3. Resampling Factor Estimation

We take advantage of the preamble and the postamble of a data packet to estimate [19] (the structure of a data packet will be discussed in Section 6). By cross-correlating the received signal with the known preamble and postamble, the receiver estimates the time duration of a packet [4]. By comparing with , the duration of the same packet at the transmitter side, the Doppler scaling factor can be estimated as Although this method is conceptually simple and easy to implement, its accuracy is sensitive to the signal-to-noise-ratio (SNR).

More accurate Doppler scaling factor estimate can be achieved via channel estimation instead of cross correlation. Based on the CIRs estimated from the two measurement segments in response to the preamble and postamble, the change in the time duration imposed on the packet can be inferred from the tap shift of the principal arrivals of these two CIRs. Then the Doppler scaling factor estimate can be computed as

The obtained using (12) is more robust against the noise contamination than the one from (11). We will show later on in Section 6.2.1 via the MACE10 in-water experimental data that the method in (12) works well in practice.

#### 4. Channel Estimation

Since the obtained using (12) can never be perfectly accurate, after temporal resampling, Doppler frequency shifts (see (10)) still exist, although Doppler scaling effects become negligible. We start below with the problem formulation of channel estimation in both training-directed and decision-directed modes [5, 6]. Then, we propose the GoSLIM-V algorithm for jointly estimating the underlying CIRs and Doppler frequency.

In what follows, and in (9) are represented in discrete-time form. Unless otherwise stated, it is assumed that the channel taps for all the transmitter-receiver pairs experience the same Doppler frequency .

##### 4.1. Training-Directed Mode

The initial task of the receiver is to acquire knowledge of the underlying channel between all transmitter and receiver pairs using the training sequences. By adopting the cyclic prefix scheme in [7], the training sequence at the th transmitter () is given by where is the core training sequence and the leading symbols form the cyclic prefix. In general, we have . From an amplifier efficiency point of view, it is practically desirable to use unit modulus (unimodular) training sequences, that is, for and .

For MIMO UAC over acoustic channels subject to both ISI and Doppler frequency shifts, the measurement vectors can be written as [8, 21] where contains the synchronized measured symbols (for instance, maps to ) at the th receiver. is given by where and represents additive noise at the th receiver. In addition, characterizes the channel of length between the th transmitter and the th receiver, where and ( is defined after (8)). Finally, the so-called Doppler shift matrix in (14) is constructed as for , where and represent the Doppler frequency and symbol period, respectively.

The ISI and Doppler shift effects can be viewed separately in (14). More specifically, the term indicates the net contribution of ISI channels, while the impact of the Doppler effects on the measurements comes through only, which corresponds to the assumption that all the CIR taps involved (recall that we have transmit transducers, receive hydrophones, and each transmitter-receiver pair corresponds to an -tap channel) experience the same Doppler frequency . The purpose of setting the first diagonal element of to 1 (see (18)) is to eliminate ambiguities. In our example, relative to , a generic measurement, say , experiences a phase shift of .

We express (14) in a more compact form as where and . By stacking up the measurements, (19) can be rewritten as where , , , , and . Then the training-directed channel estimation reduces to estimating and from the measurement vector and known . The subject of synthesizing unimodular training sequences, coupled with the employment of the cyclic prefix scheme, to facilitate ISI channel estimation is thoroughly treated in [6]. The shifted PeCAN waveforms [22] are used as the training sequences in the MACE10 in-water experimentations.

##### 4.2. Decision-Directed Mode

The decision-directed channel estimation problem is only a slight twist of its training-directed counterpart. For the former, we use the hard decisions of the previously estimated payload symbols, instead of the training symbols, to estimate the channels; see Figure 1(b). Accordingly, (14) can still be used, where contains the measurements at the th receiver belonging to the time index interval , and for , where and represent the hard decisions of the first and last previously estimated symbols (some of them could be the known training symbols), respectively, used for updating the channel. (For notational simplicity, is used in both (16) and (22) to represent two similar but different quantities. The use of , however, should be clear from the context.) The tracking length is represented as , that is, the number of rows of . To conform with the matrix dimensions, the Doppler shift matrix now is by , constructed as . Similar to the training-directed mode, the channel estimation problem in the decision-directed mode aims to estimate and from the measurement vector and known formed from the decision-directed in (22).

##### 4.3. Channel Estimation Algorithm: GoSLIM-V

The channel estimation algorithm, in either training- or decision-directed mode, has the generic form given by (20). We remark that the number of elements in , namely, , might vary in different modes. in (20) is assumed to contain circularly symmetric independent and identically distributed (i.i.d.) complex-valued Gaussian random variables with zero-mean and variance , denoted as . The problem is then to estimate (contained in ) and given and . In UAC systems, the channel is usually sparse; that is, although it contains unknowns, many of them can be approximated as zero [23]. We present the GoSLIM-V algorithm to solve this channel estimation problem. Note that since contains the CIRs of all transmitter-receiver pairs, the GoSLIM-V algorithm will estimate them simultaneously.

Consider the following hierarchical Bayesian model: where (23) follows directly from the assumption . Let be the variance of for , , and , and define , , and . Then the covariance matrix in (24) is constructed as .

Furthermore, by considering a flat prior on , , and , the channel vector , Doppler frequency , the covariance matrix (or more precisely, its diagonal elements ), and the noise power can be estimated based on the MAP criterion By combining (23), (24), and (25), and by taking the negative logarithm of the cost function, the optimization problem formulated in (25) becomes which can be solved using an alternating approach; at each iteration, one of the parameters , , , and is updated while keeping the other three fixed. In this way, the single difficult joint optimization problem is divided into 4 simpler separate subproblems. GoSLIM-V keeps iterating until a predefined iteration number is reached. Under mild conditions, the cyclic optimization scheme guarantees that the GoSLIM-V algorithm converges, at least to a local minimum of (26) [24].

The 5 steps of the GoSLIM-V algorithm at the *t*th iteration are outlined below. (1)Given , the optimal that minimizes the cost function in (26) is given by
for , , and . For better numerical stability, we set (or equivalently ) to zero if . (2)Once is available, the CIR is updated as
While inverting , its zero diagonal entries are removed, and the associated columns in are discarded.(3)Next, using the most recently obtained in (28), we estimate the Doppler frequency . For ease of exposition, we denote , where and represent, respectively, the th element of the measurement vector and with , . It is easy to verify that
Since the constant term in (29) is not a function of , minimizing the cost function in (26) is equivalent to solving
Since the summation term within the parenthesis above is nothing but the discrete-time Fourier transform (DTFT) of the sequence evaluated at frequency , is obtained as the location of the dominant peak of the real part of the DTFT. (4)Using the and most recently obtained via (28) and (30), respectively, we finally estimate the noise power as
(5)Set . Go back to Step 1 if is less than a predefined iteration number, or terminate otherwise.

In the training-directed mode, the channel characteristics in general are not available a priori. In our examples, is initialized using the standard matched filter, is initialized as 0, and the noise power is initialized with a small positive number, for instance, . Our empirical experience suggests that the GoSLIM-V algorithm does not provide significant performance improvements after 15 iterations or less.

#### 5. Symbol Detection

In this section, we proceed to study the detection of the payload symbols given the estimates of CIRs and Doppler frequency obtained by GoSLIM-V. The detection task is achieved via two steps: phase compensation followed by turbo equalization. As shown in Figure 1(b), turbo equalization consists of an equalizer and a decoder, both configurated as soft-input soft-output. The decoder is conventionally implemented by the Max-Log-MAP algorithm [17], and our focus herein is on the soft-input soft-output equalizer. We first formulate the symbol detection problem and then describe the phase compensation procedure. After that, we elaborate the LMMSE-based turbo equalization design and discuss its low complexity approximation.

##### 5.1. Problem Formulation

Treating the transmitted symbols as the unknowns and the CIRs and Doppler frequency as known, the measurement vector in (14) can be expressed as [8, 21] where the estimated CIR matrix is given by for and . The entry here represents the estimate of in (17) given by GoSLIM-V at the conclusion of the iteration. Also, The variable represents the time index corresponding to the payload symbols of current interest. Although represents different portions of the received signal in (15), (21), and (35), its use should be clear from the context. Per the discussions following (18), once is available, the estimated Doppler shift matrix in (32) can be constructed as

When detecting symbols, we use the estimates and obtained from the previous channel update and we treat and in (32) as known.

##### 5.2. Phase Compensation

Stacking up all the measurements, (32) can be written as or equivalently as where and , , , , and . The phase compensation task is simply achieved by multiplying to both sides of (38), yielding where and . Given , still has the distribution of since is unitary.

Phase compensation, along with the aforementioned temporal resampling process, effectively converts the original double-selective channel to an ISI channel. Given the phase-compensated measurement vector , the estimated CIR matrix , and , we consider using an LMMSE-based soft-input soft-output equalizer to compute the information of the transmitted signal.

##### 5.3. LMMSE-Based Soft-Input Soft-Output Equalizer

As shown in Figure 2, an LMMSE-based soft-input soft-output equalizer can be functionally divided into four modules. The LLR preprocessor calculates the mean and the variance of each QPSK payload symbol , denoted as and , respectively, from the information and generated by the decoder for and . Next, the transmitted symbol is estimated via LMMSE filtering given and in (39), along with and . Specifically, as demonstrated in Figure 2, the LMMSE filter is applied to the residual signal generated by subtracting out the so-called soft interferences from the phase-compensated measurements. The soft interferences characterize the contributions of all the payload symbols except , the one of the current interest, in terms of soft information. Based on the symbol estimates , the LLR generator provides the LLR outputs and (, ), which will be fed into the soft-input soft-output decoder as LLR; see Figure 1(b). In the following, these modules will be elaborated further.

###### 5.3.1. A *Priori* LLR Preprocessor

In this task, we calculate and from and . According to the definitions, the mean and variance of are given by [14] where denote the four QPSK constellation points of and for ; see the definition of after (1). One can see from (41) that depends on , and the evaluation of in (40) requires for . Since the interleaved bits can be reasonably assumed to be independent of each other due to the employment of the random interleaver (see Figure 1(a)), is determined as the product of the probabilities of the two interleaved bits that map to . For instance, and map to according to (1), and, therefore, , where for a generic interleaved bit we have for and . Equation (42) follows from (2) and .

Plugging (42) into (40) gives which, combined with (41), yields .

###### 5.3.2. LMMSE Filtering

Depending on whether the *a priori* LLR information is incorporated or not, two types of LMMSE filters are studied in the following.

*In the Absence of A Priori Knowledge*

The equalizer is performed in the absence of *a priori* knowledge at the very first iteration before using the decoder. This scenario amounts to setting for and , which implies that and according to (43) and (41) for and . In this case, the LMMSE filter coefficient vector, denoted as , is given by [5, 25]
Here, represents the noise power estimate given by GoSLIM-V at the conclusion of the iteration, denotes the steering vector corresponding to in (38) for , and is the estimate of defined in (17). An estimate of is obtained by applying to the phase-compensated measurement vector obtained in (39) as

*In the Presence of A Priori Knowledge*

In this case, the LMMSE estimate of is given by [14] where represents the LMMSE filter coefficient vector. In (46), each component of is the expected value of the corresponding component of calculated in (40). In (47), the covariance matrix , and Each component of is obtained according to (41).

Equation (46) suggests that the estimation of depends on its own LLR information and , whose impact on comes through and . From the belief propagation theory point of view, the generation of information of a payload symbol needs to avoid such dependency [26]. To achieve this goal, we modify (46) as Compared to (46), the presence of the two additional terms in (49), namely, and , resembles a scenario of and (or equivalently, ) in (46), as if is estimated without incorporating its own LLR information.

Define
Then, (49) can be rewritten as
In (51), the terms within the square brackets correspond to the output of the soft interference generator in Figure 2. To get , the LMMSE filter coefficient vector in (50) is applied to the residual measurement vector . Note that (52) includes (45) as a special case when no *a priori* knowledge is available, that is, for and .

###### 5.3.3. A *Posteriori* LLR Generator

This task calculates the LLR and from the symbol estimates obtained in (45) or (52) for and .

We assume that, given , is a circularly symmetric i.i.d. complex-valued Gaussian random process, that is, , where the mean and variance are calculated, respectively, as and [14]. Under this assumption, the output LLR of the two consecutive bits mapping to is calculated as [14] for and .

Let and . Then in (47) and in (50) can be rewritten as and , respectively. One observes that the derivation of requires the inversion of for each transmitter at each time index, whereas the computation of needs to invert at each time index. Consequently, by following (47) and (50) directly, the computational complexity of calculating is approximately times more expensive than obtaining .

Since , the use of the matrix inversion lemma gives Right multiplying on both sides of (54) yields which, combined with (53), follows Complexitywise, the LLR calculation formula in (56) is preferable over (53) since, as we just remarked, it is more efficient to calculate than . Due to this reason, LLR is calculated according to (56) in our numerical and experimental examples provided later on.

##### 5.4. Low-Complexity Approximate LMMSE Filtering

Although the calculation of *a posteriori* LLR according to (56) is more efficient than (53), it still constitutes the major computational bottleneck in turbo equalization mainly because needs to be calculated at each time index. To further reduce the computational complexity, we consider a low-complexity approximate LMMSE filter whose coefficient vector is given by [14]
where . Since is constant for each transmitter over one payload block (hence the time index is dropped in (57)), the overall complexity of calculating is approximately times faster than deriving according to (47). Substituting in (56) with yields
We hereafter refer to the turbo equalization scheme that calculates the *a posteriori extrinsic* information according to (56) and (58) as exact-LMMSE turbo and approximate-LMMSE turbo, respectively.

Note that matrix inversion is an indispensable stage in calculating the LMMSE filter coefficients in (44), (47), and (57). To expedite the calculation, we can make use of the conjugate gradient (CG) method and fast Fourier transform (FFT) operations, as elaborated in [27]. Although [27] focuses on the efficient calculation of the LMMSE filter coefficients in the form of (44), the extension to a more general scenario in (47) or (57) is straightforward. In the present paper, both exact-LMMSE turbo and approximate-LMMSE turbo are implemented using the FFT-based CG method.

#### 6. Numerical and Experimental Results

##### 6.1. Numerical Results

Consider transmitting four payload blocks simultaneously over time-invariant ISI channels using a MIMO UAC system equipped with transmitters and receivers. Block length is fixed at . The four payload blocks across the transmitters are constructed from a randomly generated binary source sequence of length according to the procedure detailed in Section 2. We simulate frequency-selective channels involved in the MIMO UAC system. To resemble practical UAC scenarios, these simulated CIRs are estimated from MACE10 in-water experimental data and each CIR has taps. CIRs have been normalized to 1, that is, for and . The received data samples are then constructed according to (14). Since Doppler effects are not considered in this example, . The noise vector is assumed to contain circularly symmetric i.i.d. complex-valued Gaussian random variables with zero-mean and variance . The simulation of ISI channels, combined with the assumption that each receiver has perfect knowledge on the channel characteristics , suggests that we can bypass the temporal resampling, channel estimation, and phase compensation modules in Figure 1(b) and apply exact-LMMSE turbo, approximate-LMMSE turbo, and RELAX-BLAST directly to the received measurements. Figures 3(a) and 3(b) show the average coded bit error rate (BER) given by exact-LMMSE turbo and approximate-LMMSE turbo, respectively, along with the RELAX-BLAST performance at different SNRs, where SNR is defined as . Each point is averaged over 500 Monte-Carlo trials. The binary source sequence and the noise pattern vary from one trial to another. The curve labeled as “No Iteration” is obtained by employing the equalizer and the decoder only once, that is, the feedback loop is yet to be formed. In addition, the average coded BER given by RELAX-BLAST is obtained after three iterations. We can see from Figure 3 that both types of turbo equalization schemes effectively reduce the coded BER as the iteration proceeds and significantly outperform RELAX-BLAST, and exact-LMMSE turbo provides only slightly better detection performance than approximate-LMMSE turbo. Complexity wise, the average time required to finish one trial is 18.64 s, 0.49 s, and 0.19 s on an ordinary workstation (Intel Xeon E5506 processor 2.13 GHz, 12 GB RAM, Windows 7 64-bit, and MATLAB R2010b) for exact-LMMSE turbo, approximate-LMMSE turbo, and RELAX-BLAST, respectively. Consequently, approximate-LMMSE turbo is preferred over its exact-LMMSE turbo counterpart since the former provides almost the same detection performance as the latter but with a computational complexity on the same order as RELAX-BLAST.

##### 6.2. MACE10 In-Water Experimental Results

###### 6.2.1. Experiment Specifics

The MACE10 in-water experiment was conducted by the Woods Hole Oceanographic Institution (WHOI) off the coast of Martha’s Vineyard, MA, USA, in June 2010. A source array consisting of 4 transducers was vertically deployed at a depth of 80 m and towed by a vessel. At the receiver side, a 12-element hydrophone array was mounted on a buoy. The vessel moved from the minimum range of 500 m away from the receiving array outbound to the maximum range of 4000 m and then inbound back to the minimum range. The carrier frequency, sampling frequency, and symbol rate employed in the MACE10 experiment were 13 kHz, 39.0625 kHz, and 3.90625 kHz, respectively. By transmitting sequences simultaneously and incorporating the measurements acquired from all of the receiver elements for analysis, we established a MIMO UAC system.

The structure of a transmitted data package is shown in Figure 4. Each package consists of 4 packets. The first packet conveys a grayscale Gator mascot and the subsequent 3 packets are combined from a colored mascot. The RGB components of the colored image were transmitted in the 2nd, 3rd, and 4th packets, respectively. Each pixel of the Gator grayscale image is represented by 5 bits, corresponding to 32 different intensities (e.g., pure white and pure dark pixels are represented by 11111 and 00000, resp.). The 64-pixel by 100-pixel grayscale mascot image, as a consequence, is represented by a total of 32 k source bits. Accordingly, a colored mascot image is represented by 96 k bits. The contrast of the grayscale image, as well as the hue of the colored image, has been carefully adjusted so that the image carries approximately equal numbers of 1s and 0s.

As shown in Figure 4, each packet is constructed as follows: time-marking sequences are placed at the beginning of each packet to facilitate the temporal resampling procedure; two guard intervals, each containing 500 silent symbols, are placed, respectively, before and after the segments containing the payload symbols and training sequences. The payload symbols contain the information of the Gator mascot image. We herein elaborate how to generate the 1st packet from the grayscale Gator mascot image (the packet generation for each of the RGB components of the colored image follows the same procedure). Specifically, the 32 k source bits are first interleaved so that the bits fed into the convolutional encoder module have an equal chance of being 0 or 1; see Figure 5. The so-obtained 32 k interleaved source bits are then divided into 32 groups, each containing 1 k bits. The bits in the group () will be used to construct the th payload symbol block across the 4 transmitted sequences, and the construction procedure follows Figure 1(a). Note that in Figure 5, the depth of the interleavers and is 32 k and 2 k, respectively. Figure 5 illustrates a scenario with . The shifted PeCAN training sequences with length , in conjunction with cyclic prefix symbols, form the training section, which is located between the 16th and 17th payload blocks. This MIMO UAC design leads to a net coded data rate of 11.7 kbps. The data package was transmitted periodically and recorded by the receiver array. A total of 120 epochs were available and they are referred to as “E001”−“E120,” respectively.

To estimate the Doppler scaling factor, we treat the time-marking sequences at the beginning of a packet as its preamble and those at the beginning of the subsequent packet as the postamble. Take the 2nd packet of epoch “E002,” for example. For the channel between the 1st transmitter and the 1st receiver, the superimposed modulus of the CIRs obtained by GoSLIM-V from the preamble and postamble is shown in Figure 6. The indexes of the principal arrivals for the preamble and postamble are 12 and 21, respectively. Hence, the time duration change imposed on the packet is , where is the symbol period defined after (18). Then the Doppler scaling factor can be estimated according to (12).

To assess the performance of the resampling process, the CIR and Doppler frequency evolutions obtained by GoSLIM-V before we resample the 2nd packet of epoch “E002” are shown in Figures 7(a) and 7(c), respectively. In comparison, Figures 7(b) and 7(d) demonstrate the corresponding CIR and Doppler frequency evolutions obtained after resampling the packet, respectively. We can see from Figure 7 that the temporal resampling procedure successfully reduces the Doppler scaling effects to Doppler frequency shifts. The relative speed between the transmitter and the receiver arrays can be estimated as , using a common underwater sound speed of m/s. It is interesting to look at Figure 8 where the vessel speed estimated during the resampling stage is plotted on top of the GPS reference information provided by WHOI (the GPS device was equipped on the moving vessel). The good agreement between these two curves verifies the effectiveness of the resampling procedure we employ. The analysis presented hereafter is based on the resampled measurements.

###### 6.2.2. Performance Evaluation

By fixing the tracking length at , the channel tracking starts with training-directed channel estimation using GoSLIM-V. Then we perform phase compensation on the received measurements, followed by the detection of the first payload symbols contained in the 17th payload block for each transmitted sequence using RELAX-BLAST, exact-LMMSE turbo, and approximate-LMMSE turbo; see Figure 5. Next, the channels are updated in the decision-directed mode using symbols (containing the previously detected payload symbols, as well as a portion of the training sequence as well). With the updated CIRs and Doppler frequency, after phase compensation, the subsequent payload symbols contained in the 18th block are estimated using the same symbol detection scheme. This process continues until all of the 16 payload blocks to the right-hand side of the training sequences are detected. This same tracking scheme can be applied in a reverse manner to the detection of the 16 payload blocks ahead of the training sequences.

We deem a packet to be successfully detected if the resulting coded BER is less than 0.1. After analyzing a total of 480 packets available, Table 1 summarizes the successfully detected packet percentage, the zero BER packet percentage, the coded BER averaged over the successful packets, and the time ratio of the time consumed to process a packet on the workstation specified in Section 6.1 to ( is defined in (12)) obtained using exact-LMMSE turbo, approximate-LMMSE turbo, and the RELAX-BLAST scheme, respectively. The results are obtained by applying 3 iterations for all of the three types of detection schemes considered. One observes from Table 1 that BER-wise, both exact-LMMSE turbo and approximate-LMMSE turbo outperform RELAX-BLAST significantly, compared to exact-LMMSE turbo, approximate-LMMSE turbo greatly reduces the computational time at the cost of slight BER performance degradation, and compared to RELAX-BLAST, approximate-LMMSE turbo improves the BER performance by two orders of magnitude without significantly increasing the computational complexities. These observations are in line with those made from the numerical examples in Section 6.1. Moreover, we analyze epoch “E054” that leads to perfect recovery of both the grayscale and colored mascots (see Figures 9(b) and 9(d)) using either exact-LMMSE Turbo or approximate-LMMSE turbo. In comparison, the grayscale and colored mascots recovered from epoch “E054” using RELAX-BLAST are shown in Figures 9(a) and 9(c), respectively, with the corresponding coded BERs being and . We note that the turbo equalization schemes are highly effective.

To further illustrate the detection performance of turbo equalization, Table 2 shows the coded BER averaged over all of the 480 packets at different iteration numbers obtained by exact-LMMSE turbo and approximate-LMMSE turbo. We can see from Table 2 that the coded BER improves with iteration. Empirical experience indicates that the detection performance for both types of turbo equalization converges after three iterations. Next, we choose one payload block and denote as the LLR soft information of the corresponding 1 k source bits generated by the Max-Log-MAP decoder. Figures 10(a)–10(d) and 10(e)–10(h) show obtained by exact-LMMSE turbo and approximate-LMMSE turbo, respectively, at different iteration numbers. in Figure 1(b) are the hard decisions determined from . Specifically, if then ; otherwise, (see (2)). In Figure 10, the circles indicate bit errors. We can see from Figure 10 that the LLRs of source bits are moving away from zero as the iteration proceeds (the first iteration has the most significant impact), which suggests that with the help of cycling soft information, the decoder is more and more confident about the corresponding source bits being 0 or 1.

#### 7. Conclusions

For double-selective channels encountered in mobile MIMO UAC, we have demonstrated via the MACE10 in-water experimental data analysis that it is reasonable to assume a common Doppler scaling factor imposed on the propagation paths among all the transmitter and receiver pairs when the Doppler effects are mainly induced by the relative motions between the transmitter and receiver arrays. Temporal resampling has been used to effectively convert the Doppler scaling effects to Doppler frequency shifts. A data-adaptive sparse channel estimation algorithm, referred to as the GoSLIM-V algorithm, is used to estimate the underlying CIRs and Doppler frequency in a joint manner. For symbol detection, we have investigated the turbo equalization schemes implemented by the LMMSE-based soft-input soft-output equalizer as well as its low complexity approximation. The latter provides only slightly degraded detection performance but at a significantly lower computational complexity compared to the former and is thus preferred. The effectiveness of the considered approaches has been verified using both numerical and the MACE10 in-water experimental results.

#### Acknowledgments

This work was supported in part by the Office of Naval Research (ONR) under Grant no. N00014-10-1-0054. The authors gratefully acknowledge WHOI for the fruitful collaborations with them to conduct the in-water experimentations and for sharing data with them.

#### References

- M. Chitre, S. Shahabudeen, and M. Stojanovic, “Underwater acoustic communications and networking: recent advances and future challenges,”
*Marine Technology Society Journal*, vol. 42, no. 1, pp. 103–116, 2008. View at Publisher · View at Google Scholar · View at Scopus - D. B. Kilfoyle and A. B. Baggeroer, “State of the art in underwater acoustic telemetry,”
*IEEE Journal of Oceanic Engineering*, vol. 25, no. 1, pp. 4–27, 2000. View at Publisher · View at Google Scholar · View at Scopus - M. Stojanovic, J. A. Catipovic, and J. G. Proakis, “Phase-coherent digital communications for underwater acoustic channels,”
*IEEE Journal of Oceanic Engineering*, vol. 19, no. 1, pp. 100–111, 1994. View at Publisher · View at Google Scholar · View at Scopus - B. Li, S. Zhou, M. Stojanovic, L. L. Freitag, and P. Willett, “Multicarrier communication over underwater acoustic channels with nonuniform Doppler shifts,”
*IEEE Journal of Oceanic Engineering*, vol. 33, no. 2, pp. 198–209, 2008. View at Publisher · View at Google Scholar · View at Scopus - J. Ling, T. Yardibi, X. Su, H. He, and J. Li, “Enhanced channel estimation and symbol detection for high speed multi-input multi-output underwater acoustic communications,”
*Journal of the Acoustical Society of America*, vol. 125, no. 5, pp. 3067–3078, 2009. View at Publisher · View at Google Scholar · View at Scopus - J. Ling, X. Tan, T. Yardibi, J. Li, H. He, and M. L. Nordenvaad, “Enhanced channel estimation and efficient symbol detection in MIMO underwater acoustic communications,” in
*Proceedings of the 43rd Asilomar Conference on Signals, Systems and Computers*, pp. 600–604, Pacific Grove, Calif, USA, November 2009. View at Publisher · View at Google Scholar · View at Scopus - D. Tse and P. Viswanath,
*Fundamentals of Wireless Communication*, Cambridge University Press, New York, NY, USA, 2005. - W. Li and J. C. Preisig, “Estimation of rapidly time-varying sparse channels,”
*IEEE Journal of Oceanic Engineering*, vol. 32, no. 4, pp. 927–939, 2007. View at Publisher · View at Google Scholar · View at Scopus - A. Song, M. Badiey, and V. K. McDonald, “Multichannel combining and equalization for underwater acoustic MIMO channels,” in
*Proceedings of the MTS/IEEE Oceans Conference (OCEANS '08)*, pp. 15–21, Quebec, Canada, September 2008. View at Publisher · View at Google Scholar · View at Scopus - J. Ling, K. Zhao, J. Li, and M. Lundberg Nordenvaad, “Multi-input multi-output underwater communications over sparse and frequency modulated acoustic channels,”
*Journal of the Acoustical Society of America*, vol. 130, no. 1, pp. 249–262, 2011. View at Publisher · View at Google Scholar · View at Scopus - K. Zhao, J. Ling, and J. Li, “On estimating sparse and frequency modulated channels for MIMO underwater acoustic communications,” in
*Proceedings of the 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton '11)*, pp. 453–460, Monticello, Ill, USA, September 2011. View at Publisher · View at Google Scholar · View at Scopus - P. W. Wolniansky, G. J. Foschini, G. D. Golden, and R. A. Valenzuela, “V-BLAST: an architecture for realizing very high data rates over the rich-scattering wireless channel,” in
*Proceedings of the URSI International Symposium on Signals, Systems, and Electronics (ISSSE '98)*, pp. 295–300, October 1998. View at Scopus - J. Li and P. Stoica, “Efficient mixed-spectrum estimation with applications to target feature extraction,”
*IEEE Transactions on Signal Processing*, vol. 44, no. 2, pp. 281–295, 1996. View at Publisher · View at Google Scholar · View at Scopus - M. Tüchler, A. C. Singer, and R. Koetter, “Minimum mean squared error equalization using a priori information,”
*IEEE Transactions on Signal Processing*, vol. 50, no. 3, pp. 673–683, 2002. View at Publisher · View at Google Scholar · View at Scopus - M. Tüchler, R. Koetter, and A. C. Singer, “Turbo equalization: principles and new results,”
*IEEE Transactions on Communications*, vol. 50, no. 5, pp. 754–767, 2002. View at Publisher · View at Google Scholar · View at Scopus - R. Koetter, A. C. Singer, and M. Tüchler, “Turbo equalization,”
*IEEE Signal Processing Magazine*, vol. 21, no. 1, pp. 67–80, 2004. View at Publisher · View at Google Scholar · View at Scopus - P. Robertson, E. Villebrun, and P. Hoeher, “Comparison of optimal and sub-optimal MAP decoding algorithms operating in the log domain,” in
*Proceedings of the IEEE International Conference on Communications*, pp. 1009–1013, June 1995. View at Scopus - J. Ling, H. He, J. Li, W. Roberts, and P. Stoica, “Covert underwater acoustic communications,”
*Journal of the Acoustical Society of America*, vol. 128, no. 5, pp. 2898–2909, 2010. View at Publisher · View at Google Scholar · View at Scopus - B. S. Sharif, J. Neasham, O. R. Hinton, and A. E. Adams, “Computationally efficient Doppler compensation system for underwater acoustic communications,”
*IEEE Journal of Oceanic Engineering*, vol. 25, no. 1, pp. 52–61, 2000. View at Publisher · View at Google Scholar · View at Scopus - P.-P. J. Beaujean and L. R. LeBlanc, “Adaptive array processing for high-speed acoustic communication in shallow water,”
*IEEE Journal of Oceanic Engineering*, vol. 29, no. 3, pp. 807–823, 2004. View at Publisher · View at Google Scholar · View at Scopus - J. C. Preisig, “Performance analysis of adaptive equalization for coherent acoustic communications in the time-varying ocean environment,”
*Journal of the Acoustical Society of America*, vol. 118, no. 1, pp. 263–278, 2005. View at Publisher · View at Google Scholar · View at Scopus - H. He, D. Vu, P. Stoica, and J. Li, “Construction of unimodular sequence sets for periodic correlations,” in
*Proceedings of the 43rd Asilomar Conference on Signals, Systems and Computers*, pp. 136–140, Pacific Grove, Calif, USA, November 2009. View at Publisher · View at Google Scholar · View at Scopus - T. Yardibi, J. Li, P. Stoica, M. Xue, and A. B. Baggeroer, “Source localization and sensing: a nonparametric iterative adaptive approach based on weighted least squares,”
*IEEE Transactions on Aerospace and Electronic Systems*, vol. 46, no. 1, pp. 425–443, 2010. View at Publisher · View at Google Scholar · View at Scopus - W. I. Zangwill and B. Mond,
*Nonlinear Programming: A Unified Approach*, Prentice-Hall, Englewood Cliffs, NJ, USA, 1969. View at Zentralblatt MATH · View at MathSciNet - N. Wiener,
*Extrapolation, Interpolation, and Smoothing of Stationary Time Series*, Wiley, New York, NY, USA, 1949. View at Zentralblatt MATH · View at MathSciNet - R. J. McEliece, D. J. C. MacKay, and J.-F. Cheng, “Turbo decoding as an instance of Pearl's “belief propagation” algorithm,”
*IEEE Journal on Selected Areas in Communications*, vol. 16, no. 2, pp. 140–152, 1998. View at Google Scholar · View at Scopus - J. Ling, X. Tan, J. Li, and M. L. Nordenvaad, “Efficient channel equalization for MIMO underwater acoustic communications,” in
*Proceedings of the IEEE Sensor Array and Multichannel Signal Processing Workshop (SAM '10)*, pp. 73–76, Ma’ale Hahamisha, Israel, October 2010. View at Publisher · View at Google Scholar · View at Scopus