Abstract

This paper aims to improve energy efficiency of IEEE 802.11 wireless local area networks (WLANs) by effectively dealing with idle listening (IL), which is required for channel sensing and is unavoidable in a contention-based channel access mechanism. Firstly, we show that IL is a dominant source of energy drain in WLANs and it cannot be effectively alleviated by the power saving mechanism proposed in the IEEE 802.11 standard. To solve this problem, we propose an energy-efficient mechanism that combines three schemes in a systematic way: downclocking, frame aggregation, and contention window adjustment. The downclocking scheme lets a station remain in a semisleep state when overhearing frames destined to neighbor stations, whereby the station consumes the minimal energy without impairing channel access capability. As well as decreasing the channel access overhead, the frame aggregation scheme prolongs the period of semisleep time. Moreover, by controlling the size of contention window based on the number of stations, the proposed mechanism decreases unnecessary IL time due to collision and retransmission. By deriving an analysis model and performing extensive simulations, we confirm that the proposed mechanism significantly improves the energy efficiency and throughput, by up to 2.8 and 1.8 times, respectively, compared to the conventional power saving mechanisms.

1. Introduction

Nowadays, wireless local area networks (WLANs), also referred to as Wi-Fi, are widely deployed almost everywhere due to low installation cost and easy accessibility to the Internet. At the same time, most mobile devices (e.g., smart phones and tablets) and wearable devices (e.g., smart watches and smart glasses) are equipped with Wi-Fi interface, and it is expected that Wi-Fi will be increasingly installed in various emerging consumer electronics and embedded systems by keeping pace with the development of the Internet of Things. However, one of the most crucial problems faced in these devices is the limited capacity of energy storage and the significant energy drain by the WLAN interface. The radio interface is a source of weighty energy drain in mobile devices [1]; the WLAN interface card consumes almost half of the total energy [2]. Thus, it is imperative to manage the energy consumption of WLAN interfaces for extending the battery lifetime of mobile devices.

In order to limit the energy consumption at the radio interface, the power saving mode (PSM) was introduced in the IEEE 802.11 standard [3]. In PSM, a mobile station (STA) periodically switches between sleep and active states, where the radio interface is turned off during the sleep state. The energy consumption of an STA deploying PSM can be reduced by controlling the period or frequency of active and sleep states. However, the PSM is effective only in the case where STAs occasionally have frames to receive, and it does not make any significant contribution to energy conservation in the active state. It was shown in [4] that if STAs have a moderate amount of traffic to send/receive, more than 80 percent of the time is spent in idle listening (IL), even though the PSM is deployed. According to the channel access mechanism of WLANs, that is, carrier sense multiple access (CSMA), there are three states of STAs: frame transmission, frame reception, and IL. In the IL state, an STA performs channel sensing to determine whether the channel is idle or busy. Before transmitting a frame, the STA should defer the transmission during a random backoff time. This procedure of channel sensing and backoff is inevitable in the contention-based channel access mechanism. Moreover, the STA spends its time and energy when it receives frames that are not destined to itself, which comprises another part of IL. Let us denote an intended (nonintended) frame as a frame whose destination address is equal to (different from) the address of the STA listening to the channel. Due to the nature of carrier sensing and broadcasting in WLANs, the energy consumption in overhearing a nonintended frame is comparable to that in receiving an intended frame. Hence, the amount of energy consumption in the state of IL can be rather higher than those in the states of transmission and/or reception, especially when the number of STAs is large or their traffic loads are heavy. Meanwhile, a failure of frame transmission due to collision leads to the increase of IL time (ILT) in the STA suffering from the failure, as well as in those of the other STAs sensing the channel. In addition to the energy wastage due to retransmission, a binary exponential backoff (BEB) algorithm invoked by the transmission failure doubles the size of the contention window and accordingly increases the ILT due to backoff. As a result, the ILT is unavoidable due to the nature of the channel access mechanism of WLAN, and it is essential to effectively deal with IL for improving device’s energy efficiency.

This paper aims to conserve the IL-related energy in WLANs. For this purpose, we consider two approaches: reducing the unnecessary energy consumption during ILT and decreasing ILT. To realize these two approaches simultaneously, we propose a combination of three schemes: (i) downclocking, (ii) frame aggregation, and (iii) contention window control. With the downclocking scheme, the clock frequency of the WLAN interface card is lowered from the full operation clock frequency when receiving nonintended frames (we refer to this state as semisleep), similar to [4, 5]. Once the receiving frame is identified as a nonintended frame, the STA switches to the semisleep state until the end of transmission time of the nonintended frame. The gain of downclocking is amplified by introducing the frame aggregation scheme, which is the second component of the proposed mechanism. Since the frame aggregation scheme allows the STA to transmit multiple frames per channel access, it prolongs the semisleep time in the neighbor STAs to conserve more energy by means of downclocking. In addition to improving the energy efficiency, the frame aggregation is effective to improve the transmission efficiency by decreasing channel access overhead per frame, for example, backoff time and header overhead. The third component, contention window control, is designed to deal with the ILT resulting from collision, which becomes severe as the number of STAs competing for the channel access increases. By scaling the size of contention window depending on the number of STAs, the proposed mechanism can mitigate collision, and thus, it can prevent the unnecessary energy consumption due to collision and retransmission. We adopt the basic idea of frame aggregation and contention window control in our previous work [6].

Via rigorous analysis and extensive simulations, it is shown that, compared to the conventional PSM, the proposed mechanism reduces the energy per bit by up to about 2.8 times, while maintaining up to about 80% higher throughput. The following points summarize the notable features of the proposed mechanism and the contributions of this study.(i)The proposed mechanism combines three schemes (downclocking, frame aggregation, and contention window control) in a systematic way without requiring critical changes in the current IEEE 802.11 standard; therefore, it can be a feasible and practical solution for diminishing the unnecessary energy consumption due to IL in WLANs.(ii)As well as improving energy efficiency, the proposed mechanism increases transmission efficiency by employing frame aggregation and contention window control, which decrease the channel access overhead and collision, respectively.(iii)An effective analysis model is derived to evaluate the effectiveness of each component of the proposed mechanism in terms of energy per bit and throughput. In addition, the analysis result confirms that the downclocking scheme alone cannot significantly decrease the energy consumption due to IL and that the effect of downclocking can be quite magnified when it is integrated with the schemes of frame aggregation and contention window control.The rest of this paper is organized as follows. Section 2 gives the background and motivation of this study and addresses the related work. Section 3 provides details of the proposed mechanism, and its analytical model is derived in Section 4. Section 5 presents the simulation results of the proposed mechanism under various conditions. The conclusion follows in Section 6.

2.1. Background of PSM

We consider the PSM in an infrastructure WLAN where STAs transmit/receive frames to/from an access point (AP). According to the PSM, STAs alternate active (or awake) state and sleep (or doze) state, and the AP buffers frames for STAs in the sleep state. AP broadcasts a traffic indication map (TIM) periodically to inform the STAs of the existence of pending frames destined to them, while STAs wake up periodically to listen for the TIM. The TIM is embedded in a beacon frame, and the time difference between two consecutive beacon frame transmissions is called as beacon interval (BI). In the active state, the STA can transmit/receive its frames or sense the channel during backoff and reception of nonintended frames, which accounts for the ILT. The energy consumption of STA in the active state depends upon several factors like traffic load, frame size, transmission rate, and the length of ILT. On the contrary, in the sleep state, the PSM-enabled STA turns its radio interface off and is not capable of transmitting/receiving any frames; thus, the energy consumption in the sleep state can be significantly reduced. In addition to the BI, there is another interval, listen interval (LI), which affects the period and length of active and sleep states. The LI indicates the period at which a PSM-enabled STA wakes up to listen for the TIM. According to the IEEE 802.11 PSM, the LI should be set to multiple of BI. For example, if the LI is set to 3 BIs, the STA listens to the TIM only at every third beacon frame and ignores the other two TIMs. The value of LI is determined by the negotiation between the STA and AP in the procedure of association. If an STA hears the TIM indicating pending frames, it should be in the active state until receiving the pending frames. In order to retrieve the pending frames, the STA initiates a PS-Poll message to the AP after a random backoff time. Once the AP receives the PS-Poll message from the STA, it transmits the data frame to the corresponding STA after a short interframe space (SIFS). One PS-Poll message needs to be transmitted to retrieve one data frame from the AP, followed by an acknowledgment (ACK) from the STA. If there are multiple pending frames that need to be delivered to the STA, the AP sets “More data bit” field in the data frame. Then, the STA initiates another PS-Poll message to receive its additional frame from the AP. After receiving all the pending frames, the STA switches to the sleep state to conserve energy. If the STA hears the TIM indicating no pending frame, it can sleep immediately after receiving the TIM. It is important to note that the PS-Poll message is transmitted in a contention-basis manner; that is, the PS-Poll message cannot be transmitted as soon as the STA recognizes the existence of pending frames by receiving the TIM, but it can be transmitted after competing with other STAs (or APs) transmitting data frames or PS-Poll messages. Therefore, this contention incurs an unavoidable delay in transmitting the PS-Poll message, which increases the ILT and degrades the energy efficiency in the active state.

2.2. Motivation

The PSM can save energy by allowing STAs to remain in the sleep state, as explained in Section 2.1, and there have been lots of studies in the literature to optimize the performance of PSM by tuning parameters like BI and LI and by enhancing buffer management in the AP [7, 8]. However, the PSM cannot make any contribution to energy efficiency when STAs in the active state retrieve the pending frames and it cannot effectively deal with the ILT due to the nature of contention-based channel access of IEEE 802.11 WLANs, which becomes severer as the number STAs increases.

In order to quantify the amount of energy spent due to the ILT, we perform a simple simulation. We consider that one PSM-enabled STA coexists with several background (non-PSM-enabled) STAs, whose number (denoted as ) varies from 2 to 20. We set BI and LI to 100 ms and 2 BIs, respectively. In the simulation, we assume that the background STAs always have data frames to transmit, while the PSM-enabled STA always has data frames to retrieve in the first BI within the LI, but it has no frame to retrieve in the other BI within the LI (with this artificial configuration, we can make a balance between active time and sleep time in the PSM-enabled STA. In Section 5, we will investigate the effect of traffic load via various simulations). Details about simulation configuration can be found in Section 5. Note that the PSM-enabled STA has three states: IL, sleep, and active (transmission and reception) and that the power consumed due to transmission is comparable to that due to reception [2, 4, 9]. Figure 1 compares the total energies consumed in these three states, denoted as , , and , respectively. From Figure 1, we can observe that the STA, even with the PSM, consumes most of its energy due to IL; for example, when = 10, exceeds 7.1 J, which is higher than by more than 6 times and corresponds to about 80% of total energy consumption. In addition, as increases, increases but decreases; that is, the ILT increases and energy efficiency degrades with respect to the increase in the number of competing STAs. On the other hand, is almost constant, regardless of , because hardly affects the sleep time under this simulation configuration.

2.3. Related Work

Firstly, we address features of several power saving mechanisms adopted in the IEEE 802.11 standards. Then, we focus on studies dealing with the IL energy.

To improve energy efficiency and quality of service (QoS), an enhanced power saving mechanism, Unscheduled Automatic Power Save Delivery (U-APSD), was introduced in the standard of IEEE 802.11e [10]. Unlike PSM, an STA under U-APSD does not have to wake up periodically to listen for the TIM, but it transmits a trigger frame (QoS data frame or null frame) at any time to retrieve the pending frames in the AP buffer. On receiving the trigger frame from the STA, the AP firstly acknowledges it, and then if the AP has pending frames, the AP performs the channel access procedure and transmits several pending frames to the STA successively within the transmission opportunity (TXOP) limit. After receiving all the frames from the AP, the STA can go back to sleep. Since several frames can be retrieved by a single trigger in U-APSD, the energy efficiency can be improved compared to the PSM. In determining the frequency of trigger frame transmission, there occurs a trade-off; the frequent triggering can reduce the retrieval delay of pending data frame, but it may increase collision and waste airtime or energy due to unnecessarily frequent trigger frames [9, 11].

Recently, IEEE 802.11n introduced a scheduling-based power saving mechanism, power save multipoll (PSMP) [12]. This mechanism allows an AP to schedule downlink and uplink transmissions for multiple STAs by broadcasting a single PSMP frame. On receiving the PSMP frame, each STA becomes aware of when to wake up to receive/transmit data frames. Thanks to the scheduling-based operation, the PSMP mechanism can improve the energy efficiency by avoiding the ILT spent in sensing the channel and receiving nonintended frames, and at the same time, it relieves the overhead of individual polling or triggering by utilizing the PSMP frame. However, the PSMP mechanism has a few drawbacks. The overhead of PSMP management frames and the implementation complexity increase as the number of STAs increases. Moreover, if the PSMP frame is corrupted or misinterpreted due to any reason (e.g., wireless channel error or interference), the scheduled airtime may be wasted or some STAs may remain awake throughout the whole PSMP duration [13, 14].

Moreover, many mechanisms have been proposed to reduce the energy dissipation in the ILT. Considering that the energy is the product of power and time, they can be classified into two categories: (i) decreasing power in the ILT [4, 1517] and (ii) decreasing the ILT [1822]. The first approach makes STAs switch to a low-power state (e.g., semisleep) in the ILT, without attempting to decrease the ILT, whereas the second approach aims to minimize ILT resulting from channel access, collision, or reception of nonintended frames.

The mechanism proposed in [15] employs the ready-to-send/clear-to-send (RTS/CTS) control messages to limit the energy expenditure due to overhearing nonintended frames. If an STA overhears an RTS or CTS frame that is not destined to it, the STA forces its radio interface to a low-power idling state for the duration specified in the RTS or CTS frame. In addition, the mechanisms proposed in [4, 16] adopt a similar approach to that of [15]. The mechanism in [4] proposes a new preamble where the destination address is embedded and proposes a novel detection technique so that the STAs can accurately detect nonintended frames even with the reduced clock frequency. Similarly, the mechanism in [16] detects nonintended frames based on the information of MAC header and proposes adding a checksum field in the MAC header to decrease the probability of false detection. The energy-efficient distributed access mechanism proposed in [17] aims to reduce energy consumption during the backoff procedure, as while overhearing nonintended frames by forcing the STA to remain in a low-power state during the backoff procedure.

The energy efficiency can also be improved by decreasing ILT. The micropower management mechanism proposed in [18] lets STAs sleep for a very short idle time (in the order of microseconds) by leveraging a prediction mechanism of frame arrival, so that the short ILT is transformed to sleep time. Similar to PSMP, the studies in [1922] propose a scheduling-based or reservation-based approach to avoid unnecessary ILT. The mechanism in [19] proposes dividing the frame into a control subframe for channel contention and data subframe for data transmission, so that it can be free from unnecessary energy dissipation for accessing the channel and for receiving nonintended or collided frames. In the approaches of [20, 21], the time is slotted and reserved after exchanging control frames between transmitter and receiver [20], or the AP schedules the transmission order/duration and assigns the slotted time to a specific STA [21]. In this way, only STAs involved in communication wake up during the designated time, and other STAs can go to sleep. The mechanism in [22] proposes a prioritized reservation scheme, with which the contention among PS-Poll messages can be avoided and STAs are allowed to retrieve data frames at the reserved time.

Compared to these previous approaches, our mechanism has several advantages and desirable properties as follows.(i)The approach of decreasing power in the ILT [4, 1517] has a limited effect on energy efficiency, especially when the transmission rate is high or frame size is small. We validate the weakness of this approach via analysis and simulation and propose an effective and simple solution to magnify the benefit of this approach.(ii)The approach based on scheduling or reservation [1922] requires significant modifications of the channel access mechanism in the current IEEE 802.11 WLANs, such that it cannot maintain a backward compatibility. Our mechanism can be practically implemented without violating the compliance with the IEEE 802.11 standard.(iii)None of the previously discussed approaches [4, 10, 12, 1522] deal with collision and retransmission, which is another primary source of energy dissipation and performance degradation. Our mechanism incorporates an effective solution to collision for energy efficiency and performance enhancement.

3. Proposed Mechanism

In order to conserve energy consumption due to IL, the proposed mechanism systematically combines three schemes: (i) downclocking, (ii) frame aggregation, and (iii) contention window control. The first scheme, downclocking, lowers the clock frequency of radio interface during the ILT. When detecting nonintended frames, the STA switches to the semisleep state by decreasing the clock frequency without impairing the function of carrier sensing. The second scheme, frame aggregation, is designed to boost the gain of downclocking by prolonging the time of semisleep. The frame aggregation mechanism lets STAs transmit multiple frames per channel access; thus, it extends the semisleep time of neighbor STAs overhearing nonintended frames and decreases the ILT required for channel sensing in the transmitting STAs. Lastly, the third scheme, contention window control, contributes to the decrease of collision among STAs. Therefore, it can avoid unnecessary energy waste due to collision and retransmission. In this way, the proposed mechanism not only reduces the power in the ILT but also decreases the length of ILT. Moreover, the proposed mechanism does not require any significant changes in the current standard of IEEE 802.11. The following subsections explain the details of each scheme in turn.

3.1. Downclocking in Idle Listening Time

The objective of downclocking is to curtail unnecessary expenditure of energy during the ILT. To explain the basic concept of downclocking, the circuitry of the radio interface needs to be considered. The primary source of energy consumption in the wireless interface card is the clock used for sampling of the incoming signal. The power consumption of the wireless interface card, , follows [23], where is the supplied voltage and is the frequency of clock. Based on this equation, there is a linear relationship between and . However, in a practical system, the actual power is not exactly proportional to the frequency, because of the power consumed by other parts of the device. In [4, 5], the idea of reducing the clock frequency of processor in the wireless interface card was introduced to decrease the energy consumption during the ILT. The authors in [4] validated this idea via real experiments; the energy consumption during ILT is reduced to almost half when the clock frequency is reduced to 1/4 of the original clock frequency. In addition, it was shown in [4] that the switching time between full clock frequency and 1/4 clock frequency is about 130–150 microseconds. These results support the technical ground for energy conservation in the ILT by means of downclocking.

In this work, we adopt the downclocking technique to reduce energy consumption in the STA overhearing nonintended frames. To perform the operation of downclocking, the STA has two requirements: (i) distinguishing nonintended frames from intended frames and (ii) obtaining the information on the transmission duration of nonintended frames. Whenever receiving (or overhearing) an incoming frame, whether intended or not, an STA reads the PHY header of the frame. The transmission duration can be calculated from “LENGTH” and “RATE” fields in the PHY header, which represent the frame size and transmission rate, respectively. After determining the duration of the incoming frame, the STA checks whether the frame is an intended frame or nonintended frame by reading the “Address1” field in the MAC header, which indicates the MAC address of the receiver. Then, the STA makes a decision on downclocking. Let us denote and as the duration of incoming frame and the transition time required for switching receiving state and semisleep state, respectively. The STA switches its state to semisleep state by lowering the clock frequency by 1/4 if the following two conditions are satisfied: (i) the receiver address is neither its MAC address nor a broadcast/multicast address and (ii) is larger than . Otherwise, the STA does not change its clock frequency. The semisleep state is terminated (i.e., the clock frequency is returned to the original clock frequency) earlier than the expiration time of by the time amount of . As a result, the duration of semisleep state, denoted as , becomeswhere is the time spent in reading the address field of MAC header. If the PHY or MAC header of an incoming frame is corrupted, the STA cannot correctly obtain the information required for semisleep, and it discards the incoming frame and does not switch to the semisleep state. However, the probability of PHY/MAC header corruption is insignificant because its size is usually smaller than that of payload by three orders of magnitude. Therefore, the corruption of PHY/MAC header does not make any critical adverse effect on the operation of power saving and network performance. Figure 2 shows the operation of downclocking with frame aggregation, which will be explained in the following subsection.

3.2. Frame Aggregation

We devise the frame aggregation scheme to amplify the gain of downclocking by prolonging the semisleep time. As shown in (1), the semisleep time, , is determined by the receiving time of the nonintended frame, , which depends on the frame size and transmission rate. Therefore, if the frame size is small or the transmission rate is high, the gain of downclocking becomes insignificant. This deficiency can be overcome by the frame aggregation. Frame aggregation in WLANs was originally introduced to enhance the channel efficiency by reducing several MAC layer overheads including backoff time, header overhead, and ACK transmission time. According to the IEEE 802.11n standard [12], there are two types of frame aggregation: aggregation of MAC service data unit (A-MSDU) and aggregation of MAC protocol data unit (A-MPDU). When aggregating multiple frames, A-MSDU causes less header overhead (i.e., a single PHY/MAC header for all the frames) than A-MPDU (i.e., a common PHY header for all the frames and individual MAC header for each frame). However, A-MPDU outperforms A-MSDU in an error-prone wireless channel because the former selectively retransmits only the corrupted frames, while the latter has to unnecessarily retransmit all the frames even though only a frame is corrupted. Both schemes, as well as the combination of A-MSDU and A-MPDU or any other frame aggregation scheme like [24], can be easily integrated with the downclocking mechanism, but we consider A-MPDU in this paper due to its robustness to channel error and simple implementation.

We propose that the number of frames to be aggregated and transmitted per channel access (called as aggregation factor (AF)) should be proportional to the transmission rate of each STA. Defining the transmission rate as , AF can be simply calculated asHere, is a positive scaling factor, represents the round-off value of , is the minimum value of transmission rate defined in the IEEE 802.11 standard (e.g., 6 Mb/s for IEEE 802.11a/g), and is the number of frames backlogged in the transmission queue when it gets the chance of channel access. The underlying rationale behind (2) is as follows: (1)To increase the semisleep time for downclocking, the frame aggregation scheme aims to increase the network throughput and to maintain per-STA temporal fairness. By setting in proportion to , each STA occupies a comparable amount of airtime, regardless of its transmission rate, and the network throughput can be increased in multirate WLANs [6].(2)If the STA does not have a sufficient number of frames to aggregate, that is, , the STA just constructs the aggregated frame with frames without waiting for additional frame arrivals. This approach avoids an unnecessary delay due to frame aggregation.We need to discuss the effect of on the performance of the proposed mechanism. As the value of increases, the downclocking scheme conserves more energy due to IL, and at the same time, the frame aggregation scheme further improves the channel efficiency. However, the longer transmission time due to larger value of makes the transmission of the aggregated frame liable to be failed due to time-varying channel quality, and it degrades the short-term fairness among STAs. In addition, there exists a limit on the maximum size of aggregated frame as per IEEE 802.11 standard (e.g., 64 KB). Taking this trade-off into account, the recommended value of ranges between one and three.

3.3. Contention Window Control

In addition to the downclocking and frame aggregation schemes, we propose controlling the contention window size of STAs, in order to reduce the ILT unnecessarily spent due to the collision. The ILT in the IEEE 802.11 WLANs is comprised of three parts: (i) time spent in receiving nonintended frame, (ii) time spent in sensing the channel for frame transmission, and (iii) time spent in receiving any collided frame. The first part of ILT can be dealt with the downclocking scheme proposed in Section 3.1, while the second part of ILT can be decreased by the frame aggregation scheme proposed in Section 3.2. If a collision happens between nonintended frames, an overhearing STA cannot switch its state to semisleep because the information required for downclocking cannot be correctly decoded. Otherwise, if an intended frame is involved in the collision, the frame cannot be successfully delivered to the receiving STA and the STA has to sense the channel until the end of collision. Furthermore, the collided frame should be retransmitted, which results in a decrease of throughput and energy efficiency. Therefore, we propose a control scheme of contention window to decrease the ILT due to collision.

The key idea of contention window control is that the collision probability mostly depends on the number of competing STAs and that it can be controlled by the size of the contention window. Note that the IEEE 802.11 standard proposes a minimum value of contention window () that is not scalable to the number of competing STAs. The IEEE 802.11 standard also proposes the BEB mechanism to compensate for the weakness of static and fixed value of ; with the BEB mechanism, the value of is doubled on detecting the transmission failure so as to mitigate the possible collision in the retransmitted frame. However, the BEB mechanism is reactive to the transmission failure, so it is not effective to prevent collisions as the number of STAs increases. In addition, it was shown in [25, 26] that the optimal value of minimizing the collision probability is proportional to the number of STAs. Based on these insights, we propose a proactive approach for controlling the contention window as follows. The AP keeps track of the number of associated STAs, defined as , and advertises this information by periodically broadcasting the beacon frames containing it in the “station count” subfield of “BSS Load” element. After being informed of , each STA sets its contention window, , asIn (3), () is a scaling factor that can be used to control the trade-off between collision probability and channel access delay. The larger value of leads to the decrease in collision probability at the cost of increase in channel access delay. Note that the STA maintains the BEB mechanism; that is, is doubled on detecting a transmission failure, which plays as a subsidiary function to deal with collisions.

4. Analysis Model

We derive an analysis model to evaluate and compare the performance of the proposed mechanism with those of two conventional approaches, PSM and downclocking. We consider that PSM-enabled STAs (hereafter, the term of “PSM-enabled STA” indicates the STA that implements any type of power saving schemes, for example, IEEE 802.11 PSM, downclocking, or the proposed mechanism) coexist with non-PSM-enabled STAs (background STAs) and that the PSM-enabled STAs receive frames from the AP while background STAs transmit frames to the AP. We consider two performance indices for PSM-enabled STAs, energy per bit and throughput. The former is introduced to measure the energy efficiency and is defined as the amount of energy consumed to successfully retrieve a single bit of data frame, while the latter evaluates the transmission efficiency and is defined as the number of bits successfully delivered during a given time. For the purpose of clarity and tractability, we make the following reasonable assumptions: (i) each STA can sense the transmission of other STAs, such that there is no hidden STA, (ii) the probability of transmission failure resulting from channel error is negligible thanks to an appropriate link adaptation mechanism, (iii) the LI is properly set such that the PSM-enabled STA has data frames to retrieve in the first active BI but goes to sleep during the remaining BIs within the LI, (iv) the background STAs always have data frames to transmit, and (v) the transmission rate of each STA does not change but may be different from that of other STAs. For the tractability of analysis, we consider that the PSM-enabled STA always has data frames to retrieve in the active BIs, but no data frame to retrieve in the sleeping BIs. Also, we consider that STAs that have the same transmission rate of belong to the th group (, where is the number of total groups) and that the th group consists of STAs.

In order to derive the analysis model, we employ the two-dimensional Markov chain process similarly to the traditional Bianchi’s model [25] and its variants [2729]. Firstly, we calculate the probabilities of channel access, collision, and transmission success. From our previous work in [6], the probability of transmission attempt () by an STA in the th group is given byHere, and are the probabilities of busy channel and collisions, respectively, is the contention window size at backoff stage of group- STA, and is the maximum retry limit of group- STA. Since the group- STA finds the channel busy if there is at least one ongoing transmission except for itself, can be represented asIn a similar way, the transmission of the group- STA collides if more than one STA is transmitting at the same time. Thus, is given asIt is noteworthy that both and have the same formula, as shown in (5) and (6). However, they occur in different conditions; is the probability under the condition that the group- STA is not involved in transmission but senses the channel, whereas is the probability under the condition that the group- STA is transmitting. Let us define as the probability of successful transmission by the group- STA. If there is exactly one transmission by the group- STA, its transmission is successful, and thus, becomesFinally, , , , and can be numerically obtained from (4)–(7).

Now, we focus on the performance of power saving mechanisms. A PSM-enabled STA can be in any of the following six states:(S1)When the channel is idle, all the STAs decrease their backoff counters. This state takes up a small portion of ILT.(S2)The PSM-enabled STA successfully transmits a PS-Poll message and retrieves its data frame from the AP, and then it sends the corresponding ACK frame.(S3)The PSM-enabled STA overhears nonintended data frames transmitted by background STAs. This state accounts for the major part of ILT.(S4)There occurs a collision in which the PSM-enabled STA is involved (i.e., the PS-Poll message collides with data frame or other PS-Poll messages).(S5)There occurs a collision among background STAs. During this state, the PSM-enabled STA unnecessarily consumes its energy in sensing the channel.(S6)The PSM-enabled STA remains in the sleep state.By considering these six states, we can obtain the performances of several power saving mechanisms. In the following subsections, we derive the formula of energy per bit and throughput in the cases of the conventional IEEE 802.11 PSM, the downclocking scheme combined with the PSM, and the proposed mechanism.

4.1. Analysis of IEEE 802.11 PSM

Here, we calculate the probability, time, and energy of each state addressed above when the IEEE 802.11 PSM is implemented. Let us denote and as the numbers of PSM-enabled STAs and background STAs, respectively, and we denote the total number of STAs as .

(S1) The probability of backoff state, , is given byWe define as the slot time defined in the IEEE 802.11 standard and as the power spent during the ILT. Then, energy consumed during the backoff state, , becomes(S2) The probability of successful data retrieval by the group- PSM-enabled STA, , is calculated asIn this state, the PSM-enabled STA transmits a PS-Poll message and ACK frame, as well as receiving data frame from the AP. Thus, the transmission time, , becomesHere, and are the time spent in transmitting the PS-Poll message and ACK frame, respectively, and they can be represented aswhere is the constant time to transmit PHY header, , , and are the sizes of MAC header, PS-Poll message, and ACK frame in bits, respectively, and is the transmission rate of ACK frame and PS-Poll message. (The transmission rate of ACK frame and PS-Poll message () is usually smaller than the transmission rate of data frame () for reliable transmission, and it depends on the implementation. In this work, we assume that is equal to the minimum transmission rate, .) In the same way, the time spent in receiving the data frame from the AP, , is represented aswhere is the size of frame transmitted by the AP. Defining and as the receiving and transmitting power, respectively, the total energy consumed in this state, , becomes(S3) This state is identical to the case where a background STA successfully transmits its frame without any collision. The probability of this state, , can be represented asThe duration of this state, denoted as , iswhere is the transmission rate of the background STA. The energy consumption of the PSM-enabled STA in overhearing the nonintended frame, , is(S4) There are two substates of collision where the PSM-enabled STA is involved. The first one is the collision between two or more PS-Poll messages; the second one is the collision between PS-Poll message and data frame. The corresponding probabilities of these two substates, denoted as and , respectively, can be obtained asIn the first case, the PSM-enabled STA consumes energy in transmitting the PS-Poll message; in the second case, it consumes additional energy in overhearing the collided data frame. Therefore, the effective energy consumed by the PSM-enabled STA during this state can be represented as(S5) The probability of collision between background STAs, , isand the energy consumption of PSM-enabled STA in overhearing the collided data frame, , becomesIt is important to note that all the probabilities of active states in (8), (10), (15), (18), and (20) are calculated for those BIs where all the PSM-enabled STAs are active and that these probabilities of group- STAs are characterized by , which can be obtained numerically from (4) to (7) by considering the probabilities of all the other groups.

(S6) We define as the amount of power consumed in the sleep state and as a positive integer value such that . We define a virtual cycle as the time interval between the time at which a certain PSM-enabled STA starts a backoff procedure to retrieve a frame from the AP and the time at which it starts the next backoff procedure within an active BI. Thus, the virtual cycle consists of several times spent in each active state (S1)–(S5) except for the sleep state (S6), and its average value for the group- STA, , can be represented asNote that the PSM-enabled STA can retrieve frames from the AP several times within one active BI and the ratio of time required for retrieving one frame to the whole active BI is . Then, the effective sleep time spent in retrieving one frame by the group- PSM-enabled STA, , and the corresponding energy consumption in the sleep state, , becomeNow, we define the energy per bit () as the effective amount of energy consumed to successfully retrieve a single bit of data frame; that is,where is the total energy consumed for the PSM-enabled STA to retrieve one frame from the AP; that is,We denote as the throughput achieved by the group- PSM-enabled STA, which is the number of bits successfully transmitted during a virtual cycle including sleep state, that is, . Then, can be represented as

4.2. Analysis of Downclocking Scheme

From the view point of energy efficiency, the downclocking scheme is identical to the IEEE 802.11 PSM, except for the reduced power in overhearing nonintended frames destined to the background STAs. The probabilities of each of the six states do not change from those of IEEE 802.11 PSM; the only difference happens in the energy consumption at the state (S3). Similar to (1), the semisleep time at the state (S3) is given as . Here, we assume that is large enough such that has a positive value. Let us denote as the power during the semisleep time, which is less than due to downclocking. Then, the energy consumption at state (S3), in (17), is changed asIt is important to note that the throughput of downclocking scheme does not change from that of IEEE 802.11 PSM.

4.3. Analysis of the Proposed Mechanism

Compared to the downclocking scheme, the proposed mechanism maintains two additional features of frame aggregation and contention window control. Note that the frame aggregation and contention window control are applied to all the STAs, regardless of the capability of PSM. In order to get the analysis results for the proposed mechanism, several probabilities represented in (4)–(7) need to be obtained numerically by considering the change in the minimum contention value () as given in (3). This leads to the different probabilities of all the six states from those of IEEE 802.11 PSM and downclocking. Moreover, we should consider the frame aggregation as given in (2) when calculating the transmission time of aggregated frame. Let us define as the number of aggregated frames that are delivered to a group- PSM-enabled STA after it successfully transmits a PS-Poll message to the AP. With the traffic model considered in this analysis model, at state (S2) becomes and needs to be changed from (13) toIn the same way, the transmission time of background STAs at states (S3), (S4), and (S5) is also changed due to frame aggregation asHere, is the number of aggregated frames for a background STA to transmit; that is, ; is the time required for transmitting the block ACK frame; that is,where is the size of block ACK frame. In addition to and , in (11), (19), (21), and (22) needs to be replaced with . Also, in (24) and (26) is replaced with to account for the frame aggregation. The effect of downclocking can also be considered from (27) with the extended semisleep time, , due to the change of in (29). The remaining equations can be calculated straightforwardly, as in Section 4.1.

5. Simulation Results and Discussion

5.1. Simulation Configuration

In this section, we validate the analysis model and compare the performance of the proposed mechanism with those of the conventional power saving mechanisms. We developed our simulator using MATLAB, which implements the IEEE 802.11g MAC/PHY layers. Table 1 lists the parameters and their values used in the simulations. The values of power consumption in Table 1 were set based on the real measurements of the WLAN interface card [30] (we performed the simulations with a different set of power consumption parameters given in a recent paper [31] and found that the relative performance is hardly affected by these parameters). In the proposed mechanism, we set two scaling factors, and , as 1. In order to focus on the efficiency of power saving mechanisms, we consider only downlink traffic for the PSM-enabled STAs and uplink traffic for the background STAs. If we consider uplink traffic in the PSM-enabled STAs, as well as downlink traffic, we expect that the energy efficiency will decrease with respect to the load of uplink traffic; however, the relative performance among all the comparative mechanisms will be mostly maintained because the PSM-enabled STAs work in the same way to transmit the uplink traffic, regardless of power saving schemes. Also, the effect of uplink traffic in the PSM-enabled STAs can be indirectly inferred from the effect of increasing in the number of background STAs. We set the beacon interval to 100 ms, which is the default value recommended by the IEEE 802.11 standard, and set the simulation time to 1 million time slots. We consider the following four scenarios in terms of the number of STAs, transmission rate, and traffic load, to evaluate the performance of the proposed mechanism under diverse configurations.

Scenario 1. We fix as 1 while varying from 2 to 20. All the STAs have the same transmission rate of 48 Mb/s. In addition, we set LI of PSM-enabled STA to two beacon intervals and consider that the PSM-enabled STA attempts to retrieve as many frames as possible during the first beacon interval within the LI and goes to sleep during the next beacon interval, while background STAs always have data frame to transmit.

Scenario 2. We set both and to 4 and set the transmission rate of each STA differently, to 6, 12, 24, and 48 Mb/s. We consider the same traffic model as Scenario 1. The LI is set to two beacon intervals for all four PSM-enabled STAs, but the active beacon intervals for frame retrieval are set differently among them.

Scenario 3. We set the interarrival time of frames for all the STAs randomly according to an exponential distribution with the mean value of 50 ms. Here, we set LI as three beacon intervals and set as 1 while increasing from 2 to 20.

Scenario 4. We consider a configuration of light traffic load where the frame arrivals per beacon interval follow a Poisson distribution with the mean value ranging from 1 to 10. We fix as 1 and consider several values of , such as 4, 6, and 10.

As the performance indices, we consider energy per bit and throughput, which represent the energy efficiency and transmission efficiency, respectively. All the values of performance indices given at tables and figures in this section represent the averaged value with 30 instances of simulations.

5.2. Validation of Analysis Model

In the first simulation, we validate the analysis model derived in Section 4, by comparing the analysis results with simulation results. We consider three power saving mechanisms in Section 4; hereafter, we refer to them as BASE (IEEE 802.11 PSM), DC (downclocking), and TRIPL (proposed mechanism). The simulations were conducted for Scenario 1. Figure 3(a) confirms the accuracy of analysis models; that is, the energy per bit () attained from simulations closely coincides with that obtained from analysis. As shown in Figure 3(a), increases with respect to the increase of due to the increase of ILT and collision. As increases from 2 to 20, values of BASE, DC, and TRIPL, increases from 0.13, 0.12, and 0.05, to 0.93, 0.85, and 0.33 in J/bit, respectively. It is worthwhile to note that the effect of downclocking on energy efficiency is not drastic. Compared to BASE, DC lowers by about 8%. However, TRIPL significantly decreases 2.6- and 2.4-fold on average, compared to BASE and DC, respectively.

Moreover, we validate the analysis results in terms of per-STA throughput of PSM-enabled STA () in Figure 3(b). We observe from Figure 3(b) that there is no significant deviation between simulation results and analysis results. Recall that the analysis model gives absolutely no difference in of BASE and DC. Figure 3(b) shows that decreases as increases for all the mechanisms. This is because all the STAs share the channel, and the probability of collision increases as increases. TRIPL maintains a higher value of than BASE and DC by about 65–85%, which results from the decrease of collision and channel access overhead.

5.3. Performance Validation of the Proposed Mechanism

The goal of this simulation is to compare the performance of TRIPL with BASE when several PSM-enabled STAs coexist with background STAs and STAs have different transmission rates. Table 2 lists energy per bit () and the fraction of energy consumption due to IL () for four PSM-enabled STAs () under Scenario 2. The transmission rate given in Table 2 indicates the transmission rate of data frame from the AP to each PSM-enabled STA. Note that and retrieve data frames at the odd beacon intervals and sleep at the even beacon intervals, while and alternate data retrieval and sleep in different beacon intervals as compared to and .

First, we investigate the performance of BASE. In terms of , all four STAs have similar performance in spite of the difference in the transmission rate as long as they share the same beacon intervals for the active state: of and is about 0.82 μJ/bit while of and is about 0.59 μJ/bit. These results can be explained as follows. According to the operation of CSMA in multirate WLANs, the PSM-enabled STAs with the same active beacon interval receive a comparable amount of data frames, regardless of the transmission rates. Meanwhile, they have the same power in receiving intended frames or in overhearing nonintended frames. On the other hand, of and is decreased compared to that of and ; that is, the energy efficiency is improved in high-rate STAs because they can retrieve more data in a given time. However, it is important to note in Table 2 that even though of BASE decreases in high-rate STAs, rather increases with respect to the increase of transmission rate. This means that high-rate STAs dissipate more energy due to the ILT. When high-rate STAs coexist with low-rate STAs, high-rate STAs spend more ILT in overhearing nonintended frames destined to low-rate STAs that occupy more airtime to receive a data frame than those destined to high-rate STAs.

Next, we focus on the performance of TRIPL. The results of TRIPL are quite different from those of BASE and are summarized as follows:(1)While of BASE is independent of transmission rate for STAs sharing the same active beacon intervals, of TRIPL is inversely proportional to the transmission rate, regardless of STA’s active beacon interval.(2)The increase of transmission rate increases of BASE, but it hardly affects of TRIPL.(3)Compared to BASE, TRIPL decreases by up to almost 6 times and decreases by up to 16%.These results mainly stem from the frame aggregation implemented in TRIPL, which makes all the STAs occupy the comparable amount of airtime, regardless of their transmission rates. Each STA consumes a comparable amount of energy in overhearing nonintended frames, regardless of transmission rate, but the amount of bits delivered per channel access is proportional to the transmission rate. Therefore, becomes inversely proportional to transmission rate but becomes immune to transmission rate. As well as the frame aggregation, the downclocking and contention window control in TRIPL contribute to decreasing and .

5.4. Performance Comparison

In this section, we compare the performance of TRIPL with several mechanisms via simulations. As well as BASE and DC, we consider another two mechanisms, FA and DUAL. The former implements only the frame aggregation scheme as described in Section 3.2, whereas the latter integrates two schemes of downclocking and frame aggregation as proposed in Sections 3.1 and 3.2, respectively. By comparing the performances of DC, FA, and DUAL, we can quantitatively evaluate how large a contribution the downclocking or frame aggregation makes to improving energy/transmission efficiency. In the same way, we can observe the effect of contention window control by comparing the performances of DUAL and TRIPL. In this section, we consider two types of traffic models. With the first traffic model, all the PSM-enabled STAs always have data frames to retrieve in the active beacon intervals and all the background STAs always have data frames to transmit. Meanwhile, with the second traffic model, the PSM-enabled STAs sometimes do not have data frames to retrieve, even in the active beacon intervals, and switch to the sleep state, and the background STAs do not always have data frames to transmit. We refer to these two types of traffic as heavy traffic and light traffic and consider them in the subsequent Sections 5.4.1 and 5.4.2, respectively.

5.4.1. Performance Comparison under Heavy Traffic

Figure 4(a) compares energy per bit () of several mechanisms under Scenario 1. As shown in Figure 4(a), of all the mechanisms linearly increases as increases. The increase of is primarily due to the aggravation of contention, collision, and IL with respect to the increase of . We observe the following results from Figure 4(a):(1)When is small, of DC is not much decreased from that of BASE; that is, the downclocking cannot effectively improve energy efficiency. This is because the gain of downclocking is realized when the PSM-enabled STA overhears nonintended frames.(2)For the entire range of , of FA is lower than that of BASE and DC by about 33% and 28%, respectively. This means that the frame aggregation without downclocking saves more energy than the downclocking without frame aggregation, reconfirming the limited gain of the downclocking.(3)The combination of downclocking and frame aggregation amplifies the synergy in improving energy efficiency; on average, of DC and FA is lower than that of BASE by about 8% and 33%, respectively, but of DUAL is lower than that of BASE by at least 61%.(4)As increases, the gain of contention window control increases. As long as < 10, of TRIPL is smaller than that of DUAL by about 16%; however, it becomes smaller by up to 33% when = 20. This confirms that the contention window control is effective in mitigating collision and avoiding unnecessary retransmission, which become severe when is large.(5)TRIPL significantly outperforms the other mechanisms; compared to BASE, DC, and FA, it reduces by up to 2.9, 2.6, and 2.0 times, respectively. TRIPL systematically combines three individual mechanisms of downclocking, frame aggregation, and contention window control, each of which effectively handles energy wastage due to nonintended frame, channel access, and collision, respectively.

Next, we observe and compare per-STA throughput of PSM-enabled STA () of several mechanisms in Figure 4(b). As expected, DC achieves almost the same throughput as BASE because the downclocking hardly affects throughput. For the same reason, there is little difference between of FA and of DUAL. Figure 4(b) shows that of FA (or DUAL) is higher than that of BASE (or DC) by about 31%; this increase results from the decrease of channel access overhead (i.e., backoff, header overhead, and ACK transmission) due to frame aggregation. In Figure 4(b), it is interesting to observe that (i) when is small, of DUAL is quite larger than that of BASE but slightly smaller than that of TRIPL; (ii) as increases, TRIPL further outperforms DUAL in terms of ; that is, the ratio of of TRIPL to that of DUAL increases from 1.03 to 1.33 when increases from 2 to 20. These results imply that the frame aggregation plays an important role in increasing throughput when there are few STAs, whereas the contention window control does so when there are many STAs. Since TRIPL adopts both schemes, it maintains the outstanding performance of , regardless of ; TRIPL attains higher than BASE (DC) and FA (DUAL) by up to 85% (81%) and 30% (33%), respectively.

We can observe an interesting result in Figure 4(c) that the total throughput () of TRIPL is almost immune to the increase of , but other mechanisms suffer from the decrease of total throughput with respect to the increase of . The robust performance of TRIPL in response to the change of stems from the fact that TRIPL effectively mitigates collision by adjusting the minimum contention window with respect to the number of STAs.

5.4.2. Performance Comparison under Light Traffic

Up to now, we have investigated the performance of several mechanisms with heavy traffic under Scenarios 1 and 2. Now, we will consider light traffic under Scenarios 3 and 4.

Figure 5 shows the results of for several mechanisms under Scenario 3. Here, the average interarrival time of frames is set to 50 ms according to an exponential distribution; that is, two frames arrive during a beacon interval on average. As shown in Figure 5, the increase of mostly increases for all the mechanisms. Compared to the values of in Figure 4(a), the values of in Figure 5 are reduced as a whole, because the PSM-enabled STA can switch to the sleep state when there is no frame to retrieve from the AP and the energy consumption due to IL is decreased. Among all the mechanisms, TRIPL maintains the lowest value of , less than that of BASE, DC, FA, and DUAL by up to 31%, 28%, 17%, and 13%, respectively. Although TRIPL still outperforms the other mechanisms notably, the relative gain of TRIPL over other mechanisms is slightly reduced, compared to the case of heavy traffic (see Figure 4(a)). This is because the advantage of frame aggregation cannot be fully obtained when the traffic load is light; that is, the semisleep time decreases but the ILT due to channel access increases accordingly.

We further investigate the effect of traffic load on the performance by changing the average number of frame arrivals per beacon frame, denoted as . We introduce a new performance index, , to evaluate the relative gain of TRIPL over BASE in terms of energy efficiency; that is, = ((BASE) − (TRIPL))/(BASE). Figure 6 shows obtained under Scenario 4 where the value of ranges between 1 and 10 according to a Poisson distribution, and has different values of 4, 6, and 10. As increases, increases due to sufficient number of frames to aggregate and the increase of semisleep time. Recall that the proposed mechanism aims to improve energy efficiency during the active states by effectively dealing with the ILT, whereas there is no way to improve it during the sleep states. In addition, TRIPL attains higher when there are many background STAs, because its gain is boosted by effectively decreasing collision. When = 10 and = 10, achieves the maximum value of 0.48. When = 2, is about 0.13, 0.15, and 0.21 in the cases of = 4, 6, and 10, respectively. The results in Figure 6 indicate that TRIPL further improves energy efficiency when the traffic load is high and the network is dense with many STAs, which is a clearly increasing trend in the current and future WLANs.

6. Conclusion

In this paper, we focused on the problem of energy loss due to IL in IEEE 802.11 WLANs and verified that the IL is the primary obstacle to improving energy efficiency of WLANs. We also identified three sources of IL: overhearing nonintended frames, channel sensing and accessing based on backoff, and collision among STAs. As a solution to this problem, we proposed the energy-efficient mechanism consisting of three schemes: downclocking, frame aggregation, and contention window control. Each of these schemes is designed to alleviate a cause of IL. Firstly, the downclocking scheme allows the STA overhearing nonintended frames to remain in the semisleep state. Secondly, the frame aggregation scheme extends the semisleep time, as well as decreasing the channel sensing and accessing overhead per frame. Lastly, the contention window control scheme decreases collisions and avoids unnecessary retransmissions. Therefore, the proposed mechanism not only significantly improves energy efficiency but also increases total throughput. By deriving the analysis model and comparing the analysis results with simulation results, we validated the outstanding performance of the proposed mechanism in terms of energy per bit and throughput.

Competing Interests

The authors declare that they have no competing interests.

Acknowledgments

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIP) (no. NRF-2013R1A2A2A01009883).