Abstract

We introduce an improved image transmission scheme over wireless channels with flat Rayleigh fading. The proposed scheme jointly optimizes bit power and modulation level to maximize the peak signal-to-noise ratio (PSNR) of the reconstructed image and hence improves the perceptual quality of the received image. In this optimization process, the significance of bits with regard to the overall quality of the image is exploited. The optimality of the proposed algorithm is demonstrated using the Lagrange method and verified through an iterative offline exhaustive search algorithm. For practical implementation, a look-up table is used at the transmitter for assigning the bit power and modulation level to each bit stream according to the received signal-to-noise ratio (SNR) observed at the receiver. The proposed scheme has low complexity since the look-up table is computed offline, only once, and used for any image which makes it suitable for devices with limited processing capability. Analytical and simulation results show that the proposed scheme with jointly optimized bit power and variable modulation level provides an improvement in PSNR of about 10 to 20 dB over fixed power fixed modulation (16-QAM). A further reduction in complexity is achieved by using the average signal-to-noise ratio rather than the instantaneous SNR in selecting the system parameters.

1. Introduction

Recent advances in wireless communications have made it more possible to transmit multimedia traffic (voice, image, video, and data) over high-speed wireless links [1, 2]. This is expected to attract more users of wireless networks with diverse applications such as video on demand, mobile TV, and medical services. Such applications, however, pose many challenges to system design since they are both power- and bandwidth-hungry applications with strict time requirements. It is therefore imperative to develop new transmission schemes that use radio resources as efficient as possible, especially for mobile users. A key point in the design of multimedia systems is how to tailor the transmission and reception schemes according to the type of traffic being sent and the dynamics of the wireless medium. Several alternatives, which can be employed either separately or jointly, have been investigated in the past, such as channel coding, source coding, power and rate control, and adaptive modulation.

Channel coding is an essential constituent of any digital communication system since it allows the system to use less signal power to achieve a target bit error rate (BER). However, channel coding results in an increase in the required bandwidth which consequently results in a reduction in the effective transmission rate. One way to reduce the loss in bandwidth efficiency due to channel coding is by using data partitioning and layered coding in association with unequal error protection (UEP) [3]. In UEP, the coded image information is usually divided into two or more partitions. For instance, the first partition typically contains the DC components and protected with a low code rate followed by one or more partitions containing the AC components that are coded at higher code rates [47]. A variation of the channel coding approach is joint source/channel coding (JSCC) [813] to achieve significant performance improvement, in contrast to Shannon’s information separation theorem [14].

The quality of wireless channels varies due to time-varying path-loss and multipath fading [15]. However, significant improvements can be achieved through opportunistic methods that adapt to varying channel conditions such as power control [16] and rate control [17]. For instance, less transmit power is used when the channel is good while more power is used if the channel is in deep fade. Another algorithm that can be used is to use higher data rates for good channels and lower or even cease transmission when the channel is bad. In [18], a quasioptimal power allocation scheme for multimedia communications over CDMA systems was suggested. Recently, joint power allocation and source coding were proposed in [19] for image transmission over flat Rayleigh fading channels, and a robust channel-based source coder was developed. A joint power allocation and error control scheme for video transmission was also proposed in [20]. Finally, some work was done to jointly optimize the power allocation and allocated bandwidth among users in [21]. These systems require a reliable and fast feedback link from the receiver side about the channel status to allow the transmitter to adjust its parameters (power level, data rate, etc.) according to the channel conditions.

Another approach for improving the quality and data rate of wireless links is to jointly optimize the modulation scheme with some of the aforementioned techniques. For instance, a combined source coding and modulation scheme for image and video transmission was presented in [22]. Besides considering only additive white Gaussian noise (AWGN), the focus was mainly on source rate and modulation level, and no details were presented about bit power allocation. An integrated cross-layer design that jointly optimizes scheduling, modulation, and power control was proposed in [23]. The bit error rate was used as the basis for the modulation level optimization, and the importance of bits to link quality was not considered in that work. Furthermore, no restriction was set on the peak-to-average ratio of the transmitted signal. Another work that optimized the modulation level and transmitted power was presented in [24] with focus on diversity combining. Again, bit error rate was used as the optimization criterion, and the significance of bit importance to the message quality was not considered. Other related work where the different resources are jointly optimized under different systems and channel conditions have been suggested in [2528].

In a recent work presented in [29], the authors proposed a power allocation algorithm based on the importance of different bits for improving image quality. It was shown that the proposed algorithm provides a gain of 3 to 4 dB in PSNR for image transmission over Rayleigh fading channels compared to the fixed power scenario. In this paper, we demonstrate the efficacy of the joint optimization of variable modulation and power allocation for transmission of sensitive information over wireless channels. Thus, we consider the case of uncompressed images. While this might look a simplified assumption, it is worth noting that certain applications require that transmitted and received data should be entirely identical. For instance, in telemedicine, digitized information such as CT images should not be changed through the transmission process, making diagnosis based on transmitted images as truthful as that based on original images. Any losses of image information during transmission might cause degradation in the quality of medical images. This in turn will affect the accuracy of the diagnosis. It is also worth noting that physicians involved in telemedicine are liable for their diagnosis based on transmitted images. This is another reason why differences in image quality before and after transmission should be minimal. Future work could consider the impact of joint optimization of variable modulation and adaptive power allocation on transmission of compressed images.

The advantage of the proposed algorithm is that it did not require any increase in the bandwidth. However, it suffered from an undesirable increase in the peak-to-average power ratio (PAPR) in the transmitted signal. In this paper, we extend the work presented in [29] to higher modulation levels to jointly optimize the modulation level along with power allocation. Specifically, we develop the PSNR analytical relationships and the optimum solution under AWGN and fading conditions. Then, we present an exhaustive search algorithm to iteratively find the optimum solution. The paper then discusses practical implementation of the proposed system based on a look-up table approach. We also demonstrate the applicability of the optimization algorithm regardless of the transmitted image as well as the ability of the system to work based on average rather than instantaneous SNR feedback. Finally, simulations and analytical results to validate the developed models as well as illustrate achievable performance gains are presented and discussed.

The rest of the paper is organized as follow. Section 2 presents the details of the proposed system as well as the signal model. Section 3 describes the iterative optimization algorithm. Simulations and numerical results are presented in Section 4. Finally, the paper is concluded in Section 5.

2. Proposed System Description

We consider the baseband transmission system shown in Figure 1. In this system, raw images are encoded and delivered to mobile clients via a base station or an access point. Initially, a raw image is converted to its digital representation. Then, the signal is sampled, quantized, and coded into binary format and made available to transmission by the 𝑀-QAM system. Each sample is coded into 𝑁 bits. After source coding, the coded bit stream will be presented to the power allocation algorithm. The data is first demultiplexed into 𝑁 parallel streams, with each stream having the bits of similar importance from different samples; that is, most significant bits (MSB) are grouped together, then next MSB, and so forth. As we discuss later, a look-up table is then used to find the optimum energy per bit to be allocated to each stream. All bits in the same stream but from different samples (pixels), that is, having the same significance, will be allocated the same amount of bit energy. After that, each stream will be modulated using an 𝑀-QAM scheme which could either use fixed or variable modulation level. For a fixed 𝑀-QAM system, all streams are transmitted using the same modulation level. The only difference of the proposed scheme from a conventional 𝑀-QAM system is that the symbols will have different energy levels according to the significance of the bits being transmitted. In case of variable 𝑀-QAM, not only the symbol energy is different among the streams but also the modulation level. For instance, some streams may have low modulation level (e.g., 4-QAM) while other streams may have a higher modulation level (e.g., 16-QAM) and so on. Finally, the 𝑁 streams are multiplexed back and sent through the channel. The baseband transmitted signal can be written as𝑠(𝑡)=𝑘=0𝑁1𝑖=0𝑤𝑖𝑔𝑘𝑖[]𝑇𝑡𝑘𝑁+𝑖𝑏,(1) where 𝑤𝑖 is the transmitted power, 𝑔𝑘𝑖(𝑡) contains information about the 𝑖th symbol in the 𝑘th block, and 𝑇𝑏 is the symbol duration. Both 𝑔𝑘𝑖(𝑡) and the bits constituting the 𝑖th symbol depend on the modulation level of the 𝑖th stream. For example, when 4-QAM is used the symbol consists of 2 bits while it consists of 6 bits when 64-QAM is used and so on. The wireless channel is modeled as a flat Rayleigh fading channel with received signal given by𝑟(𝑡)=𝛽𝑠(𝑡)+𝑛(𝑡),(2) where 𝛽 represents the complex channel coefficient with amplitude following the Rayleigh distribution and uniform phase over [0,2𝜋). The additive white Gaussian noise (AWGN) is represented by 𝑛(𝑡) with zero-mean and two-sided power spectral density of 𝑁0/2.

The received signal 𝑟(𝑡) is processed using a maximum likelihood detector (ML) before which the received signal is first demultiplexed into 𝑁 parallel streams so that bits of the same significance, for example, MSBs, and using the same modulation level are grouped together. Each stream is then detected using a ML detector to decide on the received bits. It is assumed that the detector knows the modulation level of each stream in order to detect it. To further illustrate the detection process, consider block 0, that is, 𝑘=0. The sampled output of the matched filter after processing the received signal 𝑟(𝑡) at 𝑡=(𝑖+1)𝑇𝑏 is 𝑟0𝑖=𝛽𝑠0𝑖+𝑛0𝑖. The ML detector estimates the transmitted 𝑀𝑖-QAM symbol as ̂𝑠0𝑖=𝑥𝑖̂𝑗 where ̂𝑗=argmin𝑗=1,,𝑀𝑙{𝑟𝑖𝛽𝑥𝑖𝑗2} and {𝑥𝑖0,,𝑥𝑖(𝑀𝑙1)} is the set of 𝑀𝑖-QAM complex constellation points. Then, the received bits are Ψ𝑀𝑖(̂𝑠0𝑖), where Ψ𝑀𝑖() is a mapping function that maps symbol to bits in the 𝑀𝑖-QAM constellation. Note that the mapping function performs symbol to bits mapping such that the symbols are Gray coded (adjacent symbols differ in only one bit).

In what follows, we give the details of the proposed scheme. Define the power vector 𝑤=[𝑤0𝑤1𝑤𝑁1] and the modulation vector 𝑀=[𝑀0𝑀1𝑀𝑁1]. In a conventional 𝑀-QAM scheme, all bits carry the same amount of energy and use the same modulation level, that is, 𝑤𝑖=𝑤𝑘 and 𝑀𝑖=𝑀𝑘 for 𝑖,𝑘=0,1,2,,𝑁1 and 𝑖𝑘. As will be shown later, this allocation is suboptimal, and we propose to optimize the allocated amount of energy and modulation level to each symbol; that is, we optimize 𝑤 and 𝑀 such that more protection is provided to the most important bits for the reconstructed image quality. In doing so, we keep in mind the following constraints.(1)The average energy per bit is kept the same as that used by the conventional equal energy scheme.(2)The bandwidth efficiency is kept greater than or equal to that of the conventional fixed modulation level scheme.

We remark that the gains achieved by optimizing power and modulation levels will be achieved at the expense of an increase in the peak-to-average power ratio (PAPR) of the transmitted signal which would result in reduction in the power efficiency of the transmitter. However, we will present an iterative offline algorithm that computes the optimum combination of power and modulation while maintaining the PAPR below a certain level. Thus, we can limit the increase in PAPR as desired. This will be discussed in the following section.

3. Proposed Optimization Algorithm

3.1. Optimization Criterion

Minimizing the average bit error rate (BER) during transmission is commonly a main objective when designing a communication system. The rationale behind such a choice is that minimizing the BER will result in better quality of signal transmission. This is not typically the case for image and video communications. From a perceptual quality perspective, the bits used to represent an image or a video frame do not carry the same level of importance. Hence, when errors occur in more important bits the quality of the reconstructed image will be severely degraded when compared to the case of errors happening in less important bits. Thus, the BER is a good quality measure only when all bits have the same importance. A better performance measure in such scenarios would be the peak signal-to-noise ratio (PSNR) of the reconstructed image rather than the BER. The PSNR is an objective quality metric that is most commonly used to assess the quality of reconstructed images and video sequences when transmitted over lossy channels and/or when lossy compression algorithms are involved. The PSNR is defined using the mean square error (MSE) between the original and reconstructed images. We argue that the MSE or alternatively the PSNR is better quality measure than the BER. To assert this argument, consider Figure 2 from a previous work [29]. In [29], we proposed a power allocation algorithm for image transmission over wireless channels. While the reconstructed image in Figure 2(a) has lower BER than the reconstructed image in Figure 2(b), obviously, Figure 2(a) has a worse perceptual quality when compared to Figure 2(b). This is actually a result of the fact that the MSE of the image in Figure 2(b) is lower than the MSE value of which for Figure 2(a). This also means that the PSNR of the image in Figure 2(b) is higher than the PSNR for the image in Figure 2(a).

It is important to note that the MSE is related to the BER through the mean of the received SNR which in turn affects the achieved PSNR. Also note that from the receiver perspective acceptable perceptual quality of the reconstructed image is the most essential factor. Since the PSNR is the commonly accepted objective quality metric in the literature, it is more natural to optimize the PSNR and use it as the quality metric. Therefore, we propose to minimize the MSE (maximize PSNR) rather than the BER in order to achieve better quality of received images. In this process, we need to establish a relationship between the MSE and BER for those applications. For a system with 𝑁 bits per sample, there are 2𝑁 possible sequences to be transmitted. The binary representation of sample 𝑥𝑗 is given by the 𝑗th row of the following 2𝑁×𝑁 matrix:𝐻=000000111101111(3) with elements 𝑗𝑘. The MSE in the received signal is given by1MSE=2𝑁2𝑁1𝑗=0𝑥𝑗̂𝑥𝑗2𝑃𝑥𝑗,(4) where ̂𝑥𝑗 is the estimate of the 𝑗th sample reconstructed after detecting the 𝑁 bits and is used to represent that sample. 𝑃(𝑥𝑗) is the a priori probability that the 𝑗th sample is transmitted. Without loss of generality, consider the transmission of a zero-valued sample (i.e., 00000, which is the first row in (3)). Assume that this sample was received in error. Then, the possibly received 𝑁-bit sequence is one of the other 2𝑁1 sequences in (3) other than 00000. The probability that the 𝑖th sample with a decimal value 𝑖 is reconstructed is given by𝑃𝑆𝑖=𝑁1𝑘=0𝑃𝑘𝛾𝑖0(𝑘)+1𝑃𝑘𝛾𝑖0,(𝑘)𝑖=0,1,2,,2𝑁1,(5) where 𝑃𝑘 is the probability that the 𝑘th bit is in error and 𝛾𝑖0(𝑘) is𝛾𝑖0(𝑘)=0𝑖𝑓0𝑘=𝑖𝑘,1𝑖𝑓0𝑘𝑖𝑘,(6) where 𝛾𝑖0(𝑘) represents the complement of 𝛾𝑖0(𝑘). The MSE for the above case is calculated asMSE0=12𝑁12𝑁1𝑖=1𝑖2𝑁1𝑘=0𝑃𝑘𝛾𝑖0(𝑘)+1𝑃𝑘𝛾𝑖0(𝑘).(7) The MSE for other samples can be obtained following a similar procedure, and the average MSE can be calculated by averaging over all possible transmitted samples. It is possible to show that, on average, all MSE values are approximately the same, and hence (7) will be the average MSE. We note that the PSNR is related to the MSE as follows:PSNR=10log10𝐼2MSE,(8) where 𝐼 is the maximum intensity value. In what follows, we will use the terms MSE and PSNR interchangeably.

Note that the probability that the 𝑘th bit is in error for the 𝑀-QAM system in AWGN is upper bounded by𝑃𝑘𝑎𝑘erfc𝑏𝑘𝑤𝑘𝐸𝑏𝑁0,(9) where “erfc’’ is the error complementary function, 𝐸𝑏/𝑁0 is the average energy per bit-to-noise power spectral density ratio, and 𝑎𝑘 and 𝑏𝑘 are parameters that depend on the 𝑘th bit modulation level 𝑀𝑘 as follows:𝑎𝑘=411/𝑀𝑘2log2𝑀𝑘,𝑏𝑘=3log2𝑀𝑘2𝑀𝑘1.(10)

For the 𝑘th bit within the 𝑁 bits representing a sample, the objective is to find the combination of modulation level 𝑀𝑘 and the transmitted power 𝑤𝑘 that will result in the minimum MSE (i.e., maximum PSNR). To reduce the complexity of the optimization process, we will limit our interest to variable M-ary QAM modulation that can be 4-QAM, 16-QAM, or 64-QAM. Hence, the total number of possible combinations to be considered for the modulations levels is 𝐿=3𝑁. For example all 𝑁 bits can use 4-QAM; 𝑁1 bits with 4-QAM and one bit with 16-QAM; 𝑁1 bits with 4-QAM and one bit with 64-QAM, and so forth. Even with this restriction, we notice that the number of combinations is still high. For instance, for a system with 8 bits per sample, that is, 𝑁=8, we need to consider 6561 possibilities. Although, the search for the optimum level is done off line (as will be explained later) it is possible to significantly reduce the number of possible combinations by imposing two conditions that ensure the maximization of bandwidth efficiency relative to fixed 16-QAM system (used as a bench mark) and adequate error protection of most significant bits as follows.(1)The bandwidth efficiency of the considered combinations is equal to or greater than that of the conventional 16-QAM. This means that combinations with low bandwidth efficiency are excluded from the search. For example, the cases for using 4-QAM for all 𝑁 bits (𝑀=4); 4-QAM for 𝑁1 bits and 16-QAM for one bit; 4-QAM for 𝑁2 bits and 16-QAM for the two bits, and so forth will have lower bandwidth efficiencies compared with the case of 16-QAM for all bits and hence will be discarded. However, combinations with modulation levels such as 64-QAM for all 𝑁 bits; 64-QAM for 𝑁1 bits and 4-QAM for one bit, and so forth will have higher bandwidth efficiencies compared to 16-QAM for all bits and hence will be considered in the optimization process.(2)The modulation level assigned to most significant bits is less than or equal to that of lower significance bits, that is, 𝑀𝑖𝑀𝑗 where 𝑖<𝑗;𝑖,𝑗=0,1,,𝑁1. This condition ensures that most significant bits (most important for image quality) have more protection against channel errors since they use a lower modulation level compared to lower significant bits.

The above two conditions will significantly reduce the number of combinations to be considered to 𝐿𝑠. For example, with 𝑁=8 only 𝐿𝑠=25 out of the total 𝐿=6561 combinations satisfy the two conditions mentioned above and hence will result in less complexity in the optimization process.

The problem now is reduced to finding the optimum power vector 𝑤opt,𝑠 where 𝑠=0,1,,𝐿𝑠1 for each of these modulation level combinations as will be discussed in next section.

3.2. Optimum Solution

The problem at hand can be stated as follows. For every combination of modulation levels in the subset 𝐿𝑠 defined in the previous section, we would like to find the optimum power vector 𝑤opt,𝑠 where 𝑠=0,1,,𝐿𝑠1 such that the MSE is minimized subject to the constraint that the average energy per bit is kept constant. We use the Lagrange method to find a global minimum for the multivariable MSE function 𝑓(𝑤) obtained by substituting (9) in (7) subject to a constraint 𝑦(𝑤)=𝑁1𝑘=0𝑤𝑘=𝑁. Note that this constraint guarantees that the average energy per 𝑁 bits is 𝐸𝑏; that is, the system has the same average energy as the conventional equal power scheme. As demonstrated in the Appendix, the MSE function 𝑓(𝑤) as a function of 𝑤𝑘,𝑘=1,,𝑁 is well approximated by a convex function for high 𝐸𝑏/𝑁0values.

We start by finding all values of 𝑤 and 𝜆 such that𝑓𝑤=𝜆𝑦𝑤,𝜕𝑓𝑤𝜕𝑤𝑗=𝜆𝜕𝑦𝑤𝜕𝑤𝑗=𝜆(11) for 𝑗[0,𝑁1] where 𝜆 is the Lagrange multiplier. The MSE function for 𝑀-QAM in AWGN is given by𝑓𝑤=12𝑁12𝑁1𝑖=1𝑖2×𝑁1𝑘=0𝑎𝑘erfc𝑏𝑘𝑤𝑘𝐸𝑏𝑁0𝛾𝑖0(+𝑘)1𝑎𝑘erfc𝑏𝑘𝑤𝑘𝐸𝑏𝑁0𝛾𝑖0.(𝑘)(12) The partial derivative of the MSE function can be written as𝜕𝑓𝑤𝜕𝑤𝑗=12𝑁12𝑁1𝑖=1𝑖2𝜕𝜕𝑤𝑗×𝑁1𝑘=0𝑎𝑘erfc𝑏𝑘𝑤𝑘𝐸𝑏𝑁0𝛾𝑖0(+𝑘)1𝑎𝑘erfc𝑏𝑘𝑤𝑘𝐸𝑏𝑁0𝛾𝑖0.(𝑘)(13)

Using the fact that 𝜕/𝜕𝑥[𝑢erfc(𝑣𝑥)]=(𝑢𝑣/𝜋𝑣𝑥)𝑒𝑣𝑥, we can further reduce (13) into𝜕𝑓𝑤𝜕𝑤𝑗=12𝑁12𝑁1𝑖=1𝑖2𝑁1𝑘=0𝜒𝑘𝑗𝛾𝑖0(𝑘)+𝜒𝑘𝑗𝛾𝑖0(𝑘),(14) where,𝜒𝑘𝑗=1𝜋𝑏𝑘𝑤𝑘𝐸𝑏/𝑁0𝑎𝑘𝑏𝑘𝐸𝑏𝑁0𝑒𝑏𝑘𝑤𝑘𝐸𝑏/𝑁0𝑎if𝑘=𝑗,𝑘erfc𝑏𝑘𝑤𝑘𝐸𝑏𝑁0if𝑘𝑗,𝜒𝑘𝑗=1𝜋𝑏𝑘𝑤𝑘𝐸𝑏/𝑁0𝑎𝑘𝑏𝑘𝐸𝑏𝑁0𝑒𝑏𝑘𝑤𝑘𝐸𝑏/𝑁0if𝑘=𝑗,1𝑎𝑘erfc𝑏𝑘𝑤𝑘𝐸𝑏𝑁0if𝑘𝑗.(15) Substituting (14) in (11), we end up with 𝑁 vector equation, in addition to the constraint equation, to be solved. This vector equation is explicitly expressed as12𝑁12𝑁1𝑖=1𝑖2𝑁1𝑘=0𝜒𝑘𝑗𝛾𝑖0(𝑘)+𝜒𝑘𝑗𝛾𝑖0([].𝑘)=𝜆𝑗0,𝑁1(16) The solution of the above problem results in an optimum power vector 𝑤opt for a specific combination of modulation levels in the 𝐿𝑠 set of possible combinations. To find the global optimum power vector, the above problem is solved for each combination. Finally, a total of 𝐿𝑠 solutions for the optimum power vector are obtained, and only solutions that result in a PAPR less than a present threshold are used to find the global optimum solution as described below.

It is desirable to choose the optimum power vector and modulation levels that will result in the maximum PSNR and maximum bandwidth efficiency. However, some of the solutions will maximize the PSNR but with small gain in bandwidth efficiency while others may result in small gain in PSNR with higher improvement in bandwidth efficiency. This is illustrated in Table 1. Thus, there are two practical ways that can be used in making the choice:(1)set a target fixed PSNR to be achieved and select the solution with highest bandwidth efficiency;(2)set target fixed bandwidth efficiency and choose the solution that provides the highest PSNR.

Note that the obtained solution is valid for a particular value of 𝐸𝑏/𝑁0 and optimization is needed for all expected operating range of 𝐸𝑏/𝑁0. In other words, the optimum power vector and modulation level combination for an 𝐸𝑏/𝑁0=0dB are different from those for 𝐸𝑏/𝑁0=5dB, and so forth. Finally, a look-up table that has the different values of 𝐸𝑏/𝑁0 and corresponding optimum vector is calculated and used for setting the transmission parameters of the bit streams. We stress that the above operation is done off line without any changes made to the look-up table during normal system operation regardless of the image to be transmitted. This can be explained by the observation that the MSE and PSNR will be the same regardless of the value of the pixel transmitted, and hence the optimum solution will be the same for any image. However, this does not imply that the quality is the same for different images. All the sender needs to do is to get the operating 𝐸𝑏/𝑁0 from the receiver (through a feedback channel) and select the solution from the already stored look-up table. Since a copy of the look-up table can be prestored at the receiver, the receiver does not require any further information from the transmitter about the modulation levels. While the used modulation level can be communicated to the receiver as a preamble, recall that the receiver feeds back the average SNR to the transmitter which in turn uses it to decide the power and modulation levels. Thus, it is more natural for the receiver to maintain a look-up table similar to the one used by the transmitter. Using this look-up table, the receiver can deduce the modulation level used by the transmitter. This is expected to add a very little complexity to the receiver when compared to a conventional receiver with fixed modulation.

In the following section, we describe an iterative procedure for solving the above optimization process through an exhaustive search through all combinations of power assignment and modulation levels. We will compare the results obtained from this exhaustive search procedure with that obtained from the analytical derivation to validate their accuracy.

3.3. Exhaustive Search Algorithm

To verify the optimality of the method described in Section 3.2, a numerical iterative exhaustive search algorithm was implemented as follows.

Initialization:(1)Initialize the power weight vector to all ones (assume the energy is the same for all bits, that is, 𝑤𝑖=1 for 𝑖=0,1,,𝑁1).(2)Calculate the MSE using (7) and (9) for a given 𝐸𝑏/𝑁0.(3)Define two bits, 𝐵 is borrowing power bit and 𝐷 is donating power bit.(4)Set the maximum limit of the PAPR to PAPRmax and set the energy step size to Δ𝐸𝑏.

Updating:(1)Set 𝐵=𝑁1 (MSB as borrower) and 𝐷=0 (LSB as donor).(2)Reduce the energy for the 𝐷 bit by Δ𝐸𝑏 and increase the energy of the 𝐵 bit by the same amount such that during the 𝑛th iteration we have 𝐸𝑏𝐷(𝑛)=𝐸𝑏𝐷(𝑛1)Δ𝐸𝑏,𝐸𝑏𝐵(𝑛)=𝐸𝑏𝐵(𝑛1)+Δ𝐸𝑏.(17) Note that within a block of 𝑁 bits, the minimum energy per bit is zero, and the maximum energy per bit is 𝑁𝐸𝑏, where 𝐸𝑏 is the average energy per bit.(3)Recalculate the MSE using (7) and (9) and keep changing the energy of the two bits until you find the minimum value of MSE while the PAPR is kept less than PAPRmax.(4)Repeat the same procedure in (2) and (3) above with the borrower bit kept at 𝐵=𝑁1 while the donor bit 𝐷 is incremented by one until all other bits are used.(5)Next, reduce the borrower bit by one to optimize the second most significant bit (𝐵=𝑁2) and repeat steps (2) till (4).(6)The above steps are repeated until all bits are optimized, that is, 𝐵=0.(7)During every iteration, the minimum MSE is searched for, and PAPR is ensured to be within the limit of PAPRmax. The optimum power vector is obtained after all bits have been utilized.

The above process is repeated for all modulation level combinations in the set 𝐿𝑠 and for every value of 𝐸𝑏/𝑁0 to form the look-up table as explained before. Finally, the image signal is transmitted with this optimized power and modulations levels.

It is important to make the following observations about the proposed algorithm. (a) The algorithm is implemented only once and thus the optimum power allocation can be precomputed in advance for each value of 𝐸𝑏/𝑁0. (b) The proposed scheme is guaranteed to converge to the combination that gives the minimum MSE because it performs an exhaustive search. Furthermore, there is no issue with convergence speed since this operation is done off line and no need to recalculate the power and modulation levels after building the look-up table.

3.4. Flat Rayleigh Fading Case

In mobile radio systems, the transmitted signal undergoes multipath propagation due to reflection, refraction, and scattering of radio waves in the wireless channel. This leads to the fluctuation of the received signal amplitude in a random manner; usually following a Rayleigh distribution for non-line-of-sight cases. The rapidly varying amplitudes result in variation of the instantaneous received signal-to-noise ratio as represented by the 𝐸𝑏/𝑁0. Hence, the proposed joint power and modulation level optimization needs to be modified to take into consideration the probability of error of the modulation scheme in Rayleigh fading in order to get correct estimates of the MSE and build the proper look-up table.

The best case would be obtained if the instantaneous value of the received 𝐸𝑏/𝑁0 is fed back to the transmitter to select the appropriate parameters (power and modulation levels) from the already constructed look-up table. However, such scenario requires not only a reliable but also a fast feedback link that is much faster than the variation in the received signal caused by the radio channel. In what follows we give an example of how fast the feedback channel should be. Consider a mobile terminal with a speed of 100km/h and a carrier frequency of 2GHz. As a result, the maximum Doppler frequency 𝑓𝐷 is given by 𝑓𝐷=𝑣𝑓𝑐/𝑐=((100×103)/3600)×((2×109)/(3×108))=185 HZ. Thus, the coherence time of the channel is roughly 1/𝑓𝐷=5.5ms. Therefore, assuming that the 𝐸𝑏/𝑁0 values require 3 to 4 bits to be encoded, a feedback link with a data rate of 600 to 800 bps is needed. We also would like to highlight the fact that the reliability of the feedback link could always be improved using a strong forward error correction (FEC) scheme. Obviously, the expected overhead of the needed FEC scheme will be minimal due the small size (number of bits) of the fed back information. Furthermore, to reduce the complexity and computational overhead, we construct the look-up table based on the average 𝐸𝑏/𝑁0 and require the receiver to send back the average 𝐸𝑏/𝑁0 rather than the instantaneous 𝐸𝑏/𝑁0. In Section 4, the simulation results show that the performance of the proposed scheme when based on the average 𝐸𝑏/𝑁0 is very close to the performance of which when based on the instantaneous 𝐸𝑏/𝑁0. In Section 4, we have also studied the sensitivity of the proposed system to possible errors in the feedback link.

To generate the look-up table for the average 𝐸𝑏/𝑁0-based scheme, we start by noting that the bit error probability for 𝑀-QAM in Rayleigh fading is given by𝑃𝑘1log2𝑀𝑘1𝛼𝑘𝑤𝑘𝐸𝑏/𝑁01+𝛼𝑘𝑤𝑘𝐸𝑏/𝑁0,𝛼𝑘=3log2𝑀𝑘4𝑀𝑘1.(18)

Using (18) in (7) we obtain the MSE function for 𝑀-QAM modulation in Rayleigh fading as𝑓𝑤=12𝑁×12𝑁1𝑖=1𝑖2𝑁1𝑘=01log2𝑀𝑘1𝛼𝑘𝑤𝑘𝐸𝑏/𝑁01+𝛼𝑘𝑤𝑘𝐸𝑏/𝑁0𝛾𝑖0+1(𝑘)1log2𝑀𝑘1𝛼𝑘𝑤𝑘𝐸𝑏/𝑁01+𝛼𝑘𝑤𝑘𝐸𝑏/𝑁0×𝛾𝑖0(𝑘)(19)

Starting from the previous expression and following the same procedure explained in Section 3.2, we end up with the same vector equation in (16) to be solved but with the following coefficients given below:𝜒𝑘𝑗=12log2𝑀𝑘𝛼𝑘𝛼𝑘𝐸𝑏/𝑁01+𝛼𝑘𝑤𝑘𝐸𝑏/𝑁031if𝑘=𝑗,log2𝑀𝑘1𝛼𝑘𝑤𝑘𝐸𝑏/𝑁01+𝛼𝑘𝑤𝑘𝐸𝑏/𝑁0if𝑘𝑗,𝜒𝑘𝑗=12log2𝑀𝑘𝛼𝑘𝛼𝑘𝐸𝑏/𝑁01+𝛼𝑘𝑤𝑘𝐸𝑏/𝑁031if𝑘=𝑗,1log2𝑀𝑘1𝛼𝑘𝑤𝑘𝐸𝑏/𝑁01+𝛼𝑘𝑤𝑘𝐸𝑏/𝑁0if𝑘𝑗.(20) In addition, the same exhaustive search algorithm presented in Section 3.3 can be used by employing (18) instead of (9) in step 3 in the updating part, and the input to the look-up table is the average 𝐸𝑏/𝑁0 rather than the instantaneous 𝐸𝑏/𝑁0.

4. Analytical and Simulation Results

In this section, we present analytical and simulation results for the PSNR obtained using the proposed scheme for both AWGN and Rayleigh fading channels. The conventional equal-power 16-QAM is used as a reference to demonstrate both the improvement in PSNR and bandwidth efficiency. The number of bits per sample (pixel) is 𝑁=8, and the number of modulation levels can be either 4, 16, or 64. We also investigate the degradation in PAPR compared to the conventional 16-QAM system as we optimize the power level of different bits. Initially, we consider an image transmission over AWGN channel, with 𝐸𝑏/𝑁0=3dB, as shown in Figure 3. It is obvious that the proposed jointly optimized power and modulation levels result in a superior performance in terms of image quality when compared to the conventional equal power 16-QAM and the power optimized 16-QAM schemes.

The figure also shows different images transmitted using the proposed scheme without altering the look-up table. The improvement is maintained and hence supports our initial claim that the proposed optimization scheme is image independent and thus has low complexity. This low complexity results from the fact that the optimization is done in terms of the PSNR and hence is independent of the transmitted image. This means that the proposed algorithm can be used for different images with no changes. It is worth noting that the three images in Figure 3 have the same PSNRs; however, the quality of the images looks different due to the inherent limitations of PSNR as a quality measure [30]. This limitation is explained by the fact that PSNR is a full-reference metric that requires an a priori knowledge of the original image which is typically not available at the receiver side. In addition, it is known that PSNR values are not necessarily correlated with acceptable perceptual quality; that is, a reconstructed image with perceptually acceptable quality could be associated with a relatively low PSNR. Therefore, we argue that PSNR besides being full-reference metric is not enough to assess the received image quality in the presence of transmission errors.

To quantify the improvement of the proposed scheme over fixed power conventional systems, we calculate a gain parameter that indicates how much less 𝐸𝑏/𝑁0 the proposed scheme needs compared to the conventional system in order to achieve the same PSNR. The gain in 𝐸𝑏/𝑁0 achieved by the proposed power optimized 16-QAM system over the conventional equal power 16-QAM system is shown in Figure 4. Different limits on the PAPR were set to see the maximum gain achievable by the optimization process and at what expense in terms of the increase in PAPR. We see that a gain of about 2 dB in 𝐸𝑏/𝑁0 is obtained as we relax the limit in PAPR from 0dB (no increase) to no limit on the PAPR where the highest gain from the optimization can be achieved. We also note that most of the gain is achieved with an increase of about 6dB in PAPR. It is observed that the gain is reduced as the PSNR increases (high 𝐸𝑏/𝑁0) and the equal power allocation scheme results in optimum performance; that is, at high 𝐸𝑏/𝑁0 all bits would be important and need to be equally protected.

Similar observations for the case of joint power and modulation level optimized system can be seen from Figure 5, however with much higher gain (about 6.5dB). The combination is used in this figure with modulation levels of (64,64,64,64,16,16,4,4). Note that with the limit on PAPR set to 0dB, that is, no power optimization, the gain (about 3dB) is mainly due to variable modulation level. More gain is achieved as we relax the PAPR constraint and optimize the power as well. One other advantage of the variable 𝑀-QAM is the improvement in bandwidth efficiency compared to the 16-QAM system. This can be quantified by noticing that the average modulation level with optimum performances is 22.5 (from Table 1) which results in a gain of about 12.5% in bandwidth efficiency. Thus, Figures 4 and 5 indicate that most of the gain (about 6.5 dB in PSNR) is achieved when variable modulation is used with an increase of about 5 dB in PAPR. However, if the system has more conservative limit on the PAPR then a smaller gain (about 3 dB) is obtained with the limit of 0 dB on PAPR. It is worth noting that the PAPR increases because we are changing the amplitude of the transmitted bits to optimize the quality of the reconstructed image.

Figure 6 shows the PSNR for image transmission over an AWGN channel with 8 bits per sample, 𝑁=8, as a function of the 𝐸𝑏/𝑁0. The results demonstrate the validity of the proposed optimization scheme using the Lagrange method since it has excellent agreement with the exhaustive search procedure. Both 16-QAM fixed modulation with optimized power only and joint power and modulation level optimization cases are presented. The PSNR for a nonoptimized 16-QAM system is also shown to demonstrate the improvement in PSNR of the proposed scheme. For instance, at 𝐸𝑏/𝑁0=6dB, gains in PSNR of 10 and 20dB are achieved by the power optimized 16-QAM system and the joint power and modulation level optimized system, respectively.

As we explained earlier, the proposed scheme uses a look-up table to select the best combination of power and modulation level for each bit. The only input needed for the look-up table is 𝐸𝑏/𝑁0. This is expected to be provided through a feedback channel from the receiver to the sender. In any real implementation, there will be some error in providing such feedback. For example, suppose the actual 𝐸𝑏/𝑁0=3dB but the feedback has an error of +2 dB. Then, the sender will use the entries of the look-up table for 𝐸𝑏/𝑁0 corresponding to 5 dB and there will be some loss in performance. Figure 7 shows the sensitivity of the PSNR performance to such errors. As shown in the figure, for 𝐸𝑏/𝑁0=3dB, both the power optimized and joint power and modulation level optimized schemes can tolerate an error of ±5 dB without noticing any significant degradation in the PSNR. However, for 𝐸𝑏/𝑁0=10dB, they can tolerate an error of ±3 dB. Yet, both schemes are better than the conventional scheme even with an error of ±10 dB. Figure 7 also shows that the PSNR performance of the proposed system is not very sensitive to estimation errors. It can actually tolerate errors up to ±5 dB. Note that the proposed scheme always outperforms conventional systems even in the presence of large estimation errors. This shows that the system is robust to errors in estimating 𝐸𝑏/𝑁0.

The PSNR performance in a flat Rayleigh fading channel is presented in Figure 8. Both Lagrange method and the exhaustive search iterative algorithm gave the same performance in fading, which proves the accuracy of the analytical optimization in the fading case. The power optimized 16-QAM and the joint power and modulation level optimized schemes provide a gain in 𝐸𝑏/𝑁0 of about 8dB and 20dB over the conventional equal power 16-QAM, respectively. The figure also shows that the scheme based on feedback of average 𝐸𝑏/𝑁0 has a relatively small degradation in performance (about 1dB in 𝐸𝑏/𝑁0) compared to the case of using instantaneous 𝐸𝑏/𝑁0 and the degradation vanishes at high 𝐸𝑏/𝑁0. This shows that we can use the average 𝐸𝑏/𝑁0-based scheme which has a significant reduction in complexity without paying much price in terms of PSNR performance.

Finally, we would like to clarify the fact that the online operation of the proposed scheme is distributed between the receiver and the transmitter. At one side, the scheme only requires the receiver to feedback the average SNR such that the transmitter decides the optimum combination of power allocation and modulation level for this SNR using the already built look-up table. Clearly, at the receiver side this is a minor operation with very low complexity. On the other side, the transmitter uses the value of the SNR as fed back by the receiver to look-up the optimum power allocation and modulation level which is almost of negligible complexity. The main source of complexity is the offline processing in which we build the look-up table that is used in the selection process of power and modulation levels. In this work we assumed that the transmitter is responsible for building this table. However, this operation is done only once and is independent of the image being transmitted as demonstrated in Figure 3. We also argue that the algorithm is guaranteed to converge. This is actually true because the proposed algorithm performs an exhaustive search among all possible combinations (there are 6561 possible combinations for 3 modulation levels and 8 bits per sample). Then it selects the combination that gives the highest PSNR. It is worth noting that the complexity associated with the exhaustive search could be further reduced by only searching a feasible subset among the possible combinations. As we have already shown in Section 3.1, this approach was adopted in the proposed scheme where we had to search a subset of 25 possibilities. This subset was decided to ensure that both the bandwidth efficiency and PSNR are maximized.

5. Conclusions

In this paper, an algorithm for joint optimization of transmitted bit power and modulation level according to the importance of the bit to the image quality was presented. The optimization was based on minimizing the mean-square error (MSE) or equivalently maximizing the peak signal-to-noise ratio (PSNR) of the reconstructed image. The optimality of the proposed scheme was derived using the Lagrange method and verified through exhaustive numerical search algorithm. Improvements in PSNR of 10 to 20dB were demonstrated for a modulation scheme with 4,16, or 64 levels QAM over a conventional fixed power 16-QAM system. The proposed scheme also improved the bandwidth efficiency and hence the rate of data transmission. Finally, practical issues in terms of increase in peak to average power ratio and sensitivity of system operation to errors in 𝐸𝑏/𝑁0 feedback were addressed.

Appendix

On the Convexity of the MSE

In what follows, we prove that the multivariable MSE function 𝑓(𝑤), as a function of 𝑤𝑘,𝑘=1,,𝑁, is well approximated by a convex function so that we can use the Lagrange method to find the global minimum. This is actually true for high 𝐸𝑏/𝑁0 values where the probability that the 𝑘th bit is in error denoted by 𝑃𝑘, is very small. As a result, the quantity (1𝑃𝑘) is assumed to be very close to 1. Thus using (1𝑃𝑘)1, (7) reduces toMSE012𝑁12𝑁1𝑖=1𝑖2𝑁1𝑘=0𝑃𝑘𝛾𝑖0(𝑘)+𝛾𝑖0(𝑘).(A.1) To establish the proof that the cost function is convex, we use the following properties of log-convex functions (pages: 104–106 [31]).(i)Log convexity is preserved under multiplication, addition, and integration.(ii)A log-convex function is also convex.

Based on the abovementioned properties, to prove that the MSE0 is convex in 𝑤𝑘,𝑘=1,,𝑁, it is sufficient to prove that the probability 𝑃𝑘 is log convex in 𝑤𝑘. This is demonstrated for the two cases of instantaneous 𝐸𝑏/𝑁0-based scheme and the average 𝐸𝑏/𝑁0-based scheme.

Instantaneous 𝐸𝑏/𝑁0-based scheme.

In this scheme, using Craig’s formula for the error function as given in [32], (9) which is the probability that the 𝑘th bit is in error can now be written as𝑃𝑘𝑎𝑘2𝜋02𝜋𝑒(𝑏𝑘𝑤𝑘(𝐸𝑏/𝑁0))/sin2𝜃𝑑𝜃.(A.2)

Using the fact that log convexity is preserved under integration and since 𝑒(𝑏𝑘𝑤𝑘(𝐸𝑏/𝑁0))/sin2𝜃 is log convex in 𝑤𝑘 for all sin2𝜃 (page 104 [31]), as a result the quantity 𝑎𝑘erfc(𝑏𝑘𝑤𝑘𝐸𝑏/𝑁0) is also log convex in 𝑤𝑘.

Average 𝐸𝑏/𝑁0-based scheme:

In this scheme, the probability that the 𝑘th bit is in error is approximated by𝑃𝑘𝑎𝑘𝐸𝑏/𝑁00erfc𝑏𝑘𝑤𝑘𝑥𝑒𝑥/(𝐸𝑏/𝑁0)𝑑𝑥.(A.3)

Since erfc(𝑏𝑘𝑤𝑘𝑥)𝑒𝑥/(𝐸𝑏/𝑁0) is log convex in 𝑤𝑘 for all 𝑥0 as shown above for the Instantaneous 𝐸𝑏/𝑁0-based scheme, 𝑃𝑘 is also log convex in 𝑤𝑘 (integration of log-convex functions). This proves the convexity of MSE in 𝑤𝑘 which motivated us to use the Lagrange method to find the optimal solution.