Abstract

In this paper, we study the estimators of the population total in adaptive cluster sampling by using the information of the auxiliary variable. The numerical examples showed that the ratio estimator in adaptive cluster sampling without replacement of networks is more efficient than the ratio estimators in adaptive cluster sampling without replacement of units.

1. Introduction

Adaptive cluster sampling, proposed by Thompson [1], is an efficient method for sampling rare and hidden clustered populations. In adaptive cluster sampling, an initial sample is selected by simple random sampling without replacement of units (Thompson [1]). The unbiased estimators for adaptive cluster sampling with an initial sample taken by simple random sampling without replacement of units are Hansen-Hurwitz and Horvitz-Thompson estimator. Although the initial sample is selected by simple random sampling without replacement of units, some networks in the sample may be selected more than once. An adaptive cluster sampling design exists with selection without replacement of networks (Salehi and Seber [2]). The unbiased estimator for adaptive cluster sampling without replacement of networks is Des Raj estimator.

The estimators mentioned above are provided by the variable of interest only. Sometimes other variables are related to the variable of interest. We can obtain additional information for estimating the population total. The use of an auxiliary variable is a common method to improve the precision of estimates of a population total. Dryver and Chao [3] proposed ratio estimators based on Hansen-Hurwitz and Horvitz-Thompson estimator in adaptive cluster sampling without replacement of units. In this study, we will study the estimator of population total in adaptive cluster sampling without replacement of networks using an auxiliary variable. Some comparisons are made using a simulation.

2. Adaptive Cluster Sampling without Replacement of Units

As in the finite population sampling situation, the population consists of units indexed by their labels . Unit is associated as a variable of interest . The vector of the population total -values may be written as . The estimator of the population total is studied.

For adaptive cluster sampling, an initial sample of units is selected by simple random sampling without replacement. The unbiased estimator of the population total of the variable of interest is formed by using Hansen-Hurwitz estimator and Horvitz-Thompson estimator.

2.1. The Hansen-Hurwitz Estimator

The Hansen-Hurwitz estimator is based on draw-by-draw probabilities that a unit’s network is intersected by the initial sample.

Let denote the initial sample size and denote the final sample size. Let denote the network that includes unit and let be the number of units in that network.

The Hansen-Hurwitz estimator of the population total of the variable of interest can be written as (Thompson and Seber [4]) where is the average of the -values in the network that includes unit of the initial sample; that is, The variance of is The unbiased estimator of is

2.2. The Horvitz-Thompson Estimator

The Horvitz-Thompson estimator is based on probabilities of the initial sample intersecting networks. The unbiased estimator of the population total of the variable of interest can be written as (Thompson and Seber [4]) where is the sum of -values in network ; that is, . is the total number of distinct networks in the population and is the total number of distinct networks in the sample. is an indicator variable that takes a value of 1 with probability if the initial sample intersects network and 0 otherwise. The inclusion probability of network is , and the probability that networks  and are both intersected is The variance of is and the unbiased estimator of is

2.3. Ratio Estimators in Adaptive Cluster Sampling

Dryver and Chao [3] proposed the ratio estimator in adaptive cluster sampling based on Hansen-Horvitz’s ratio estimator as where , , and is the average of the auxiliary variable in the network that includes unit of the initial sample; that is, .

The approximated variance of is The unbiased estimator of is

The ratio estimator in adaptive cluster sampling based on Horvitz-Thompson’s ratio estimator is where The approximated variance of is where The unbiased estimator of is where

3. Adaptive Cluster Sampling without Replacement of Networks

Although the initial sample is selected by simple random sampling without replacement of units, some networks in the sample may be selected more than once. An adaptive cluster sampling design exists with selection without replacement of networks (Salehi and Seber [2]).

Salehi and Seber [2] proposed a new sampling design, that is, adaptive cluster sampling with networks selected without replacement. Their procedure is as follows.

The first initial sample unit is selected by simple random sampling from the population. Then the first network is created and this network is removed from the population. Next, the second initial unit is selected by simple random sampling without replacement of the remaining units; then a second network is created. And finally, the process is continued until all of networks are selected.

Let denote the initial sample size. Let denote the network that includes unit and let be the number of units in that network.

Further, let be the first-draw probabilities for network which includes unit . Thus , where is the number of units in the network which includes unit . So is the conditional th draw probability for the th network which includes unit in the sample given the first networks selection.

By using a modified Des Raj estimator (Salehi and Seber [2]), an unbiased estimator for the population total of the variable of interest is where, for , and for let From Raj [5] we find that are mutually uncorrelated so that the variance of (Salehi and Seber [2]) is and the unbiased estimator of is

The variance of the modified Des Raj estimator, , is less than that of the Hansen-Hurwitz estimator, (Salehi and Seber [2]).

4. Proposed Ratio Estimator in Adaptive Cluster Sampling without Replacement of Networks

Some other variables are related to the variable of interest. We can obtain additional information for estimating the population total. The use of an auxiliary variable is a common method to improve the precision of estimates of a population total. In this section, proposed ratio estimator in adaptive cluster sampling without replacement of networks will be described.

The proposed ratio estimator based on Des Raj’s estimator is where and is the modified Des Raj estimator for .

To obtain the variance of , we write where is the population ratio.

If is reasonably large, is closed to and is approximately equal to ; therefore, Let So, Therefore, The approximated variance of is and the estimator of is

5. Examples

5.1. The Blue-Winged Teal Example

As an example, the blue-winged teal data (Smith et al. [6]) were used in this study. The population region of 5,000 km2 in an area of Central Florida was partitioned into fifty units of 100 km2. These data were set to be -values. The total of -values, , is 14,121 (see Figure 1).

The -values were simulation results from the model , where (see Figure 2).

For each iteration, an initial sample of units is selected by simple random sampling without replacement. The -values are obtained for keeping the sample network. In each of the sample networks, the -values are obtained. The condition for added units in the sample is defined by .

For each estimator, 20,000 iterations were performed to obtain an accuracy estimate. Initial SRS sizes were varied with , 8, 10, 15, and 20. The estimated variance of the estimate total is where is the value for the relevant estimator for sample .

For the blue-winged teal example, the estimated MSE of estimators under the adaptive cluster sampling without replacement of unit were calculated and listed in Table 1. The estimated MSE of estimators under adaptive cluster sampling without replacement of networks were shown in Table 2.

5.2. Simulation Example

As a second illustration, we have used the simulation -values and -values from Chutiman and Kumphon [7] were studied. The populations were shown in Figures 3 and 4.

For each iteration, an initial sample of units is selected by simple random sampling without replacement. The -values are obtained for keeping the sample network. In each sample network, the -values are obtained. The condition for added units in the sample is defined by .

For each estimator, 20,000 iterations were performed to obtain an accuracy estimate. Initial SRS sizes were varied , 10, 15, 20, 30, 40, and 50.

For the simulation example, the estimated MSE of estimators under the adaptive cluster sampling without replacement of unit were calculated and listed in Table 3. Table 4 shows the estimated MSE of estimators under adaptive cluster sampling without replacement of networks.

From the examples, we see that the estimated MSE of adaptive cluster sampling without replacement of networks is smaller than the estimated MSE of adaptive cluster sampling without replacement of units; that is,

For the population total in adaptive cluster sampling by using the information of the auxiliary variable, we see that the estimated MSE of the ratio estimator in adaptive cluster sampling without replacement of networks is smaller than the estimated MSE of the ratio estimator in adaptive cluster sampling without replacement of units; that is,

6. Conclusions

We discuss using the auxiliary information, that is, the ratio estimators in adaptive cluster sampling without replacement of units and without replacement of networks. For the adaptive cluster sampling without replacement of units, Horvitz-Thompson’s ratio estimator is better than Hansen-Hurwitz’s ratio estimator (Dryver and Chao [3]).

Although the initial units in adaptive cluster sampling are selected by simple random sampling without replacement, networks may be selected more than once. So the adaptive cluster sampling without replacement of units is equivalent to the adaptive cluster sampling without replacement of networks. Therefore, the modified Des Raj ratio estimator is better than both Hansen-Hurwitz’s ratio estimator and Horvitz-Thompson’s ratio estimator.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publishing of this paper.

Acknowledgments

The authors would like to profoundly thank Paveen Chutiman for his programming advice. They would also like to express their special thanks to Assistant Professor Siriluck Jemjitpornchai and Prasert Vangsantitrakul for their valuable comments and suggestions on the material of this paper and for their help in correcting English as well.