#### Abstract

In cognitive radio (CR) cooperative sensing schemes, wireless sensor nodes deployed in the network
sense the licensed spectrum and send their local sensing decisions to a fusion center (FC) that makes
a global decision on whether to allow the unlicensed user transmit on the licensed spectrum, based
on a decision (fusion) rule. *k*-out-of-*N* is widely used in the literature owing to its practical simplicity.
Regrettably, it exhibits a tradeoff between the achievable probabilities of false alarm and miss detection,
which could have consequent effects on the performance of CR. In this paper, based on the notion
of typical sequences, we propose a novel fusion rule in which the false alarm and miss detection
probabilities can be *simultaneously* made as small as desired (asymptotically zero as the number of
sensors goes to infinity).

#### 1. Introduction

Cognitive Radio (CR) has recently emerged as a topic of interest in wireless research, following the findings of the United States' Federal Communications Commission's (FCC) Spectrum Policy Task Force Report [1] that most radio spectra go unused most of the time. CR has therefore been proposed as a tool to increase the spectrum usage efficiency. Unlicensed users (called secondary users or SU) are allowed to temporarily access licensed bands that are not utilized by their corresponding licensed users (called primary users or PU), provided that the formers do not cause significant interference to them.

Hence, an important task in CR is for an SU to efficiently monitor spectrum and sense any transmission from a PU on its licensed band. Traditional spectrum sensing techniques are energy detection [2, 3], matched filter detection [4, 5], and cyclostationary feature detection [6, 7]. Energy detection is the simplest method to detect the presence of a PU signal based on the amount of energy. As such it does not require any knowledge about the PU's or channel features, yet achieves less reliable detection than the other schemes. Matched filter is the optimal sensing method, though it requires a priori knowledge of the PU's channel parameters and signal. Finally, cyclostationary feature detection outperforms energy detection by being able to distinguish white noise from signals, but fails to distinguish the PU's signal from other noisy signals such as very far signals or unintended transmissions from neighboring cells [8]. As each approach is limited, spectrum sensing appears to be one of the most challenging tasks in CR, particularly in the very low SNR case (where it is hard to distinguish signals from noise) or in a hidden terminal scenario (where primary users are shadowed or in a deep fade).

Very recently, cooperative spectrum sensing has been suggested to overcome the aforementioned issues. A network operator deploys a (large) centralized wireless network of sensor nodes. These periodically sense the spectrum in search of any PU transmission, then feed their sensing findings (local decisions) back to a fusion center (FC). Based on these reports, the FC makes an educated guess on whether to allow the secondary user(s) to transmit over the PU's channel. As it is unlikely that all sensors suffer from very low SNR or incur a hidden terminal problem, it is expected that cooperative sensing can overcome these issues that are usually difficult to solve in traditional noncooperative sensing. A crucial task, however, is the global decision rule (by the FC) also known as the *fusion rule*, as it will impact on the CR global performance. This issue shall be the focus of this paper.

Assume that all sensors' decisions are independent (uncorrelated). Of particular interest is the case where there is disagreement among the sensors (i.e., some sensors claim the PU is active while others claim the PU is silent). In such a case, the decision making process becomes less obvious. The aim of the FC is to *minimize* two probabilities of error: (i)*a probability of false alarm (PFA)*: probability that the FC wrongfully decides that PU is *active* (while it is, in fact, *silent*),(ii)*a probability of miss detection (PMD)*: probability that the FC wrongfully decides that PU is *silent* (while it is, in fact, *active*).

There has been a significant number of research works on deriving efficient fusion rules, see, for example, [9] and references therein for a survey on the main fusion rules in literature. In particular, two rules are more commonly used in CR. A first fusion rule, called *-out-of-N* [9, 10], requires the FC to allow the SU to transmit if at least sensors out of assume PU is silent. Another rule is the Bayesian rule [10, 11], in which a likelihood (or log-likelihood) ratio is computed at the FC and compared to a given threshold in order to determine which hypothesis (PU is silent versus PU is active) is more likely. In [10], the authors showed that *both fusion rules are equivalent*.

Although such rules enjoy practical simplicity, unfortunately they cannot minimize both probabilities (PFA and PMD) at the same time: (i)if is too small (close to 0), the FC will, most of the time, assume that PU is silent. Then, clearly and (this also can be seen in (14)-(15), by tending to 0). Subsequently, the FC will allow the SU to transmit, most of the time. In such a case, the SU will enjoy a good data rate but the PU will suffer from substantial interference from the SU, (ii)contrarily, if is too large (close to ), the FC will, most of the time, assume that PU is active. Then, clearly but (this also can be seen in (14)-(15), by tending to ). Subsequently, the FC will forbid the SU from transmitting, most of the time. In such a case, the PU will enjoy an interference-free transmission but the SU will have a disrupted transmission (which makes CR less interesting).

In practice, usually CR systems are designed such that one probability is minimized while the other has a tolerable (but nonnegligible) value [10].

Contrarily, we show in this work that it may be possible to minimize both the PFA and a PMD, using the *asymptotic equipartition property* (AEP), a law first stated by Claude Shannon in his seminal work [12] and one that derives from the law of large numbers. Though it may have a rather evolved definition, the AEP has a simpler interpretation: (i)of all possible events, some events are more likely to occur than others (in other words, not all events are equally likely), (ii)most-likely events are fewer than unlikely events (in other words, of all possible events, only a fraction of them occur most of the time).

Precisely, let us assume that the sensors report their binary local sensing decisions (1: PU is active, 0: PU is silent) to the FC. Then, the AEP simply states that, of all the combinations (sequences) of possible reports, only a *fraction* of them are *almost surely*, (that is, with probability tending to 1 as goes to infinity,) likely to be received, should PU be indeed, say, active. Such sequences are called *typical sequences* [13].

Therefore, we suggest in this work that the FC simply ignores *nontypical* sensor reports, and only believe *typical* ones, as they occur most of the time. By disregarding unlikely (nontypical) report sequences, we demonstrate in this work that the probabilities of false alarm and miss detection at the FC using can be *simultaneously* made as small as desired, and tend to zero as the number of sensors is increased.

#### 2. Preliminaries

##### 2.1. Notations

The following notations will be considered in this work. When is a variable, denotes the probability of occurrence of a realization of while denotes the distribution (probability density function, pdf) of the random variable . denotes the Shannon entropy [13]: where denotes the mathematical expectation and denotes the base-2 logarithm. When is a finite set of possible realizations of a random variable , let denote its cardinality, that is, the number of elements in . Finally, denotes the probability that a realization of the random variable lies in the set , that is, .

##### 2.2. System Model and Assumptions

###### 2.2.1. Network Model

We consider a centralized wireless sensor network made of an FC and sensors. We assume that sensors are deployed to periodically sense the spectrum, detect the activity of the PU, and make a local decision on whether the PU, is active or silent. Two hypotheses (sensing outcomes) are possible. (i)Hypothesis : a sensor decides that PU is active. (ii)Hypothesis : a sensor decides that PU is silent.

After making a local sensing decision on the activity of the PU, the sensors report their findings to the FC.

##### 2.3. Channel Model

Each sensor is assumed to be equipped with a single antenna. For , sensor senses at time , a signal sample corrupted by an additive white Gaussian noise with zero mean and variance denoted as , through a Rayleigh flat-fading channel with equivalent baseband channel , given by [14]: where (i) denotes the distance between the PU and sensor , (ii) denotes a reference distance, (iii) denotes a path loss exponent, (iv) denotes a random variable that accounts for Rayleigh fading, (v) denotes a random variable that accounts for a phase shift uniformly distributed over , and (vi) denotes a random variable that accounts for Log-normal shadowing.

##### 2.4. Cooperative Spectrum Sensing Model

###### 2.4.1. Locally Sensing the PU's Signal

Each sensor filters its received signal in the sensed spectrum then performs sampling. Let denote the number of samples obtained from the received signals, which we assume common to all sensors and in agreement with Nyquist's rule (at least twice as large as the product of the sensed bandwidth and sensing time). Subsequently, energy detection is used for spectrum sensing [8, 9]:
Therefore, the hypotheses and (mentioned earlier in Section 2.2.1) can be defined as follows:
In other words, represents the hypothesis that the sensed signal is only the additive noise (PU is silent) whereas represents the hypothesis that the sensed signal is the sum of a noise and a signal transmitted by PU (PU is active) and received through the channel . Using the energy detector, the local decision test is given by [8, 9]:
where is a local decision threshold that we assume common to all sensors. The sensors' decisions are assumed *independent*. (In practice, independence can be achieved by sufficiently (a few times the wavelength of their antennas) spacing the wireless nodes. For instance, for GHz transmissions, it suffices that sensors be spaced a few centimeters apart from each other.) and *identically distributed*. (Such assumption may hold in many settings, particularly when sensors have similar sensing capabilities, use the same sensing method/threshold, and are uniformly distributed in the area of concern.) Under hypothesis , such distribution is known to be chi-squared with degrees of freedom, that is [8, 9],
while the test distribution under hypothesis is noncentral chi-squared with degrees of freedom and noncentrality parameter , that is, [8, 9],
where denotes the Bessel function of the first kind.

###### 2.4.2. Sending the Local Reports to the FC

Let denote the local test decision of sensor , . Then, local decisions can be modeled as binary i.i.d. random variables. Precisely, for all , let:
then, the set of all reports received by the FC can be modeled as a *sequence * of binary i.i.d. random variables, each taking 0 with a probability and 1 with probability . Subsequently, the sequence of sensing reports has the following probability of occurrence:
where denotes the number of zeros within the sequence . We also shall assume (for simplicity) that the sensing reports , are transmitted error-freely to the FC (an assumption that can be achieved through the use of a powerful (low-rate) error-correcting code and a transmit rate that is no larger than the capacity of the channel between the sensors and the FC). Finally, based on the received sequence of sensing reports, the FC attempts to make a decision on whether the PU is indeed silent or active (according to a *fusion rule*), while trying to *minimize* the PFA and the PMD.

#### 3. The PFA-PMD Tradeoff

As explained in the introduction, so far in the literature it has been very difficult to simultaneously minimize both probabilities (PFA and PMD). We explain here why. In all that follows, , respectively denote the (local) PMD and PFA at each sensor. (According to our model, all sensors have the same local probabilities of false alarm and miss-detection, see Section 2.4.1 for further details.), while , , respectivley, denote the (global) PMD and PFA at the FC.

##### 3.1. The Single-Sensor Case

Let us start by considering the simplest scenario where only one sensor node makes up the sensor network. According to our local sensing model in Section 2.4.1, the probability of miss detection is given by Likewise, the probability of false alarm reads

As we only have 1 sensor in the network, it is evident that the FC is wrong *iff* the sensor is wrong. Thus

Now, we shall explain the tradeoff between the PFA and the PMD at the FC. (i)If we are to minimize , we have to minimize . As , this integral is minimized *iff **is minimized*. (ii)If we are to minimize , we have to minimize . As , this integral is minimized *iff **is maximized*.

Hence, it is not possible to *simultaneously* minimize both and if the network is made by merely one sensor. Figure 1 illustrates the tradeoff between minimizing the local PFA and minimizing the local PMD.

##### 3.2. The Multiple-Sensor Case

Now, let us assume that the wireless sensor network is made up by sensors. The PMD and PFA probabilities ( and ) at the FC depend upon the fusion rule. -out-of- [9, 10, 14] is a widely used fusion rule owing to its simplicity. Therefore, we illustrate the tradeoff between PFA and PMD when using this fusion rule at the FC. Basically, a hypothesis, say , is favored if at least sensors out of confirm it [15]: Subsequently, the probability of false alarm at the FC is given by [15] whereas the probability of miss detection at the FC is given by [15]: As we have, then, we can see that Likewise, we can infer from (15) that Therefore, it is not possible to simultaneously minimize both PFA and PMD in the multiple-sensor case when using -out-of- as a fusion rule.

#### 4. Proposed Fusion Rule

##### 4.1. Problem Statement

As explained earlier, even in the multiple-sensor case, it is not possible to simultaneously minimize the probabilities of false alarm and miss detection when using -out-of-, the conventional fusion rule. Therefore, the aim of this work is to propose a fusion rule at the FC that achieves probabilities of miss detection and false alarm that can be made simultaneously asymptotically zero.

##### 4.2. How to Read This Section

For convenience in exposition, we summarize the main content of this section. (i)First, we provide a paragraph where we familiarize the reader with the notion of typical sequences, through a summary of the main results as well as an intuitive interpretation. (ii)Then, we intuitively explain the motivation behind applying such notion to our problem (minimizing PFA and PMD). (iii)Then, we provide a detailed description of the proposed fusion rule (based on the notion of typical sequences). (iv)Finally, we state Theorem 3, the main contribution of this work.

##### 4.3. Typical Sequences: A Bird-Eye's View

*Definition 1 (Typical Set [13]). *Let denote i.i.d. random variables, their (any) probability density function, and a strictly positive real number. A sequence of realizations of the aforedescribed random variables is said to be -*typical* if
The set of -typical sequences is called *typical* set.

Put into simpler terms, the typical set is the set of sequences whose probability of occurrence is roughly .

Theorem 2 (properties of a typical set [13]). *The typical set has the following properties when is sufficiently large: *(i)*,
*(ii)*. *

In other words, (i)the typical set has a probability of occurrence that nears 1,(ii)the number of sequences in the typical set is nearly .

These properties of typical sequences follow from the law of large numbers. Simply put, Theorem 2 states that if we draw a sequence of i.i.d. variables, then this sequence will *almost surely* (i.e., with probability , with tending to zero when goes to infinity) lie in the typical set . In what follows, based on the notion of typical sequences, we provide a hard fusion rule at the FC that achieves arbitrarily small probabilities of (global) miss detection and false alarm. First we outline the proposed scheme, then we justify why it achieves such unprecedented performance.

##### 4.4. Motivation behind Applying the Notion of Typical Sequences to Cooperative Spectrum Sensing in CR

In a network of -sensors, FC receives a report made up by binary random variables. Therefore, there are at most possible reports that can be received by the FC. However, according to Theorem 2, these do not necessarily have the same probability of occurrence. As a matter of fact, Theorem 2 tells us that of all these possible sequences, *only* a fraction of them (roughly ) *occur most of the time when a certain hypothesis holds.* Without loss of generality, let us consider the set of likely (typical) events given hypothesis. (This choice is arbitrary. We could have defined the typicality with respect to hypothesis . Regardless of this choice, we shall show later that we can simultaneously minimize both the PFA and the PMD) . Motivated by the result in Theorem 2, we raise the following question: rather than believing all report sequences that are received (as suggested by conventional fusion rules), why not trust only report sequences that are *likely* to be received should hypothesis be true? Indeed, according to Theorem 2, if we ignore unlikely (nontypical) sequences, we will be right most (precisely, ) of the time, and wrong on much rarer occasions ( of the time). Therefore, even if we disregard such unlikely reports and they turn out to be true, they will have very little impact on the average probability of error.

###### 4.4.1. Outline of the Proposed Sequence-Typicality-Based Fusion Rule

Motivated by the previous reflections, we propose the following fusion rule. (1)Set initial parameters: the local sensing threshold , the local test statistics with respect to hypotheses , . The probabilities and can therefore be computed.(2)Make a list of all typical report sequences with respect to the hypothesis , given a sufficiently large number of nodes and an arbitrary real strictly positive number : *what are the sequences that are likely to be received when hypothesis ** is indeed true? *(3)Whenever a sequence of reports is received, the FC performs the following computationally simple test.(a)If the sequence *is in* the list , FC goes in favor of hypothesis (i.e., assumes that PU *is active*). (b)If the sequence *is not in* the list , FC goes in favor of hypothesis (i.e., assume that PU *is silent*).

###### 4.4.2. Generation of the Typical Set

According to (1), a sequence of sensor nodes' decisions is -typical if Therefore, in order to determine such sequences, we need to determine the following (i)The probability of occurrence of any possible sequence , according to (9). (ii)The Shannon entropy of the i.i.d. random variables . The reports are i.i.d. discrete (binary) random variables that follow a Bernoulli distribution. (A one is received with probability and a zero is received with probability .) From the definition (1), the entropy reads

###### 4.4.3. Determining Which Sequences Are Typical

Now that FC has computed both the probability of occurrence of any sequence and the Shannon entropy , it can determine which sequences are in the typical set, as follows. (i)For every sequence (out of the possible sequences) (a)if , then the sequence *is in* the typical set. If such sequence is received, FC will decide in favor of hypothesis ;(b)otherwise (if or ), then the sequence *is not in* the typical set. If such sequence is received, FC will decide in favor of hypothesis . If such sequence is received, FC will decide in favor of hypothesis .

For clarity, a flowchart in Figure 2 illustrates the various steps in the implementation of the proposed fusion rule, as discussed in this paragraph.

##### 4.5. Analysis of the Probability of Error (False Alarm or Miss Detection)

###### 4.5.1. Main Result

Theorem 3 constitutes the main contribution in this work.

Theorem 3. *Let and respectivly denote the probabilities of (global) false alarm and miss detection of the FC's sensing decision. Then, irrespective of the local decision threshold , and are asymptotically zero when using the proposed fusion rule, that is,
*

*Proof. *The proof that we provide is inspired by the proof of Theorem 15.3.1. in [13, pages 530–532] on a different topic (the achievability of the capacity region for the multiple-access channel).

Using the proposed fusion rule, only 4 situations are possible, as summarized in Table 1. Of these 4 possible scenarios, only two represent a global decision error at the FC. Precisely, using the proposed fusion rule, FC is wrong if either of the following situations occur. (i)Situation : when the PU is active but is *not in* the typical set: this is a *miss detection*, *because under the proposed rule the FC would assume the PU silent*. (ii)Situation : when the PU is silent but is *in* the typical set: this is a *false alarm*, *because under the proposed rule the FC would assume the PU active*.

Let denote the probability that situation occurs,
Owing to Theorem 2, we have
Therefore, it follows that
Hence, using the proposed fusion rule, the probability of miss detection goes to zero when goes to infinity. We have by now proved the first part of Theorem 3.

Now, let us consider the second part (i.e., the probability of false alarm). The probability of occurrence of event is
Owing to Theorem 3, we know that the typical set has at most elements. Let us denote these elements as . Subsequently, we get
On the other hand, the law of total probabilities tells us that, for all :
Thus, we get:
Hence, also converges to zero when goes to infinity when using the proposed fusion rule.

###### 4.5.2. Commentary

We explain in this paragraph the consequences of Theorem 3. By applying the formal definition of the limit of a function in mathematics, (22) can be rewritten as follows:
By taking , we get
Therefore, it appears that for any real number , there exists a certain number of sensor nodes, , such that the proposed scheme can achieve and probabilities that are *both below *.

###### 4.5.3. On the Tightest Bound for a Given Sequence

Let us fix an arbitrary bound . Let be an -typical sequence. Then, by definition, the sequence has a probability of occurrence, , such that
Further, if the FC assumes such sequence of reports correct (as suggested in our fusion rule), then PFA and QFA will be lower than . However, is *not necessarily* the tightest bound on .

Indeed, for some sequences, there may be an such that
Therefore, one conclusion we may draw is that *even within the typical set, some sequences are more reliable than others.* Precisely, the closer the probability of occurrence, , of a sequence to the true () typical probability , the more reliable the sequence is (in terms of PFA and PMD).

Quantitatively, let denote the gap between and Let denote the smallest upper bound on PFA and QMD of , that is, Then, one may write: It follows that

#### 5. Numerical Examples

In this section, we report numerical examples obtained through computer simulations. The goals behind such examples are twofold.(1)First, we provide an illustrative example to give insight on how the proposed fusion rule can be applied in practice. Particularly, we explain how the list can be generated, given a number of active sensors in the network and an *arbitrary* threshold *on both probabilities* (PFA and PMD). (2)Second, we provide a numerical evaluation of the bounds on PFA and PMD for the proposed fusion rule compared with the conventional rule (-out-of-).

Toward this end, we consider a wireless sensor network of sensor nodes that sense the spectrum using an energy detector by taking samples, where has been varied from 5 to 20. A node's spectrum sensing is subject to an additive white Gaussian noise (AWGN) with zero mean and variance . Finally, local decision tests are with respect to a threshold that has been varied from 0 to 30.

##### 5.1. A Toy Example

We start by explaining, through an illustrative example, how the proposed scheme can be implemented in practice. This toy example is inspired by Problem 3.13 provided in [13, pages 68–69], to illustrate the general notion of typical sequences.

We consider sensor nodes in the network. (While is not really a large number, 1024 (the number of possible sequences) is a sufficiently large number for the targeted PFA and PMD threshold, . If much smaller probabilities are targeted, then there may be a need for a larger number of sensor nodes.) Thus, the number of possible sequences is .

Let us *arbitrarily* assume that the network operator wants *both* probabilities (PFA and PMD) to be less than, say, .

According to Theorem 3, is an upper bound on PFA and PMD. Therefore, a *sufficient* condition for such requirement would be to fix . Now, we shall determine the set of all possible -typical sequences, that is, those whose probability is bounded as

###### 5.1.1. Computing the Shannon Entropy

A first step towards this goal is to compute . Figure 3 illustrates the Shannon entropy of any given report relative to a sensor node , , for different local detection thresholds and detection samples . has been computed according to (21). From these results, it appears that the entropy-maximizing threshold is increasing with the detection sample. As the number of typical sequences is nearly , our simulation results give insights to the network operator on how the threshold should be set to manage the size of the typical set: (i)if the network operator wants to reduce the number of possible -typical sequences, should be set to a relatively low value;(ii)if the network operator wants to increase the number of possible -typical sequences, should be set to a high value.

###### 5.1.2. Determining the -Typical Sequences

After computing according to (21), we compute the probability of occurrence of every sequence (1024 in total), according to the formula in (9), for and (thereby yielding an entropy ), as reported in Figure 4. On the same figure, we plot three thresholds.(i)The probability that a (true, i.e., when ) typical sequence occurs. This probability is . It can be verified that . (ii)The outer bound on the probability of an -typical sequence, with . The outer bound is .(iii)The inner bound on the probability of an -typical sequence, with . The inner bound is .

The sequences whose probabilities of occurrence are between the two far-most lines (i.e., when ) are the -typical sequences while the sequences that lie outside the two far-most lines are the non--typical sequences. As claimed in Theorem 2 and depicted in Figure 4, we can see that nontypical sequences constitute the majority of the possible sequences.

##### 5.2. Performance Evaluation

Figures 5 and 6 respectivly illustrate the global PFA and PMD ( and (14) and (15)) of the conventional fusion rule (-out-of-) versus the number of sensors in the network, for different values of . On the same figures, the *upper bound*, , on the global PFA and PMD for the proposed fusion rule is plotted.

First, by observing the performance of the conventional fusion rule (-out-of-), we confirm the tradeoff that we discussed earlier in Section 3.2. For instance, while settings where is large (, , the red curves) are ideal for cooperative spectrum sensing from the perspective of minimizing PFA (Figure 5), they are unattractive from the perspective of PMD (Figure 6), as the latter is *increasing* with the number of sensors (in other words, cooperation hurts). Similarly, decreasing (the blue curves) yields a global PMD that is decreasing with the number of sensors but also a PFA that is increasing with the number of sensors.

Contrarily, the proposed fusion rule achieves global PFA and PMD that are *both* decreasing with the number of sensors in the network. This confirms the theoretical result in Theorem 3 that the proposed fusion rule achieves asymptotically zero PFA and PMD probabilities when goes to infinity. In other words, the proposed fusion rule can simultaneously achieve arbitrarily small PFA and PMD for a sufficient number of sensors.

Further, we observe that with the same number of sensor nodes in the network, the proposed fusion rule achieves the lowest PFA and PMD, which also suggests that the proposed fusion rule is more reliable than the conventional fusion rule, for a given number of sensor nodes.

#### 6. Conclusions

In CR cooperative sensing schemes, wireless sensor nodes deployed in the network sense the licensed spectrum and send their local sensing decisions to a fusion center (FC) that makes a global decision on whether or not to allow the secondary user use the spectrum, based on a decision fusion rule. -out-of- rule is widely used in the literature owing to its practical simplicity. Regrettably, it cannot minimize both probabilities of false alarm and miss detection, which could have consequent effects on the performance of CR. In this work, based on the notion of typical sequences, we propose a novel fusion rule in which the false alarm and miss detection probabilities can be *simultaneously* made as small as desired (asymptotically zero as the number of sensors goes to infinity).