Abstract
Auxiliary variable is commonly used in survey sampling to improve the precision of estimates. Whenever there is auxiliary information available, we want to utilize it in the method of estimation to obtain the most efficient estimator. In this paper using multiauxiliary information we have proposed estimators based on geometric and harmonic mean. It was also shown that estimators based on harmonic mean and geometric mean are less biased than Olkin (1958) and Singh (1967) estimators under certain conditions. However, the MSE of Olkin (1958) estimator and geometric and harmonic estimators are same up to the first order of approximations.
1. Introduction
The problem of estimating the population mean in the presence of an auxiliary variable has been widely discussed in finite population sampling literature. Kadilar and Cingi [1], Singh et al. [2], Singh and Vishwakarma [3], Singh et al. [4], and Koyuncu and Kadilar [5] proposed estimators in stratified random sampling. Ghosh [6] and Rao [7] have suggested estimators in stratified random sampling with multiple characteristics.
Olkin [8] has considered the use of multiauxiliary variables positively correlated with the variable under study to build up a multivariate ratio estimator of population mean .
In this paper, we have considered the multiauxiliary variables. Olkinβs [8] estimator is based on the weighted arithmetic mean of βs and is given as where (i) βs are weights such that , (ii) βs are the population means of the auxiliary variables and assumed to be known, and (iii) is the sample mean of the study variable and are the sample means of the auxiliary variables based on a stratified random sample of size drawn from a population of size . Let the population of size, , be stratified into strata with th stratum containing units, where such that .
Following Olkinβs [8] estimator, several other estimators using multiauxiliary variables have been proposed in recent years. Singh [9] has extended Olkinβs [8] estimator to the case where auxiliary variables are negatively correlated with the variable under study. Srivastava [10] and Rao and Mudholkar [11] have given estimators, where some of the characters are positively and others are negatively correlated with the character under study. The main objective of presenting these estimators was to reduce the bias and mean square errors.
Motivated by Singh [9, 12] and Singh et al. [13], we propose an estimator in stratified sampling as We also propose two alternative estimators based on geometric mean and harmonic mean, as such that .
These estimators are based on the assumptions that the auxiliary characters are positively correlated with . Let be the correlation coefficient between and and the correlation coefficient between and .
2. BIAS and MSE of the Estimators
To obtain the bias and MSEβs of the estimators up to first order of approximation, we write such that , where, Also, In the same way and are defined.
Further, let and , ().
Using Taylorβs series expansion, under the usual assumptions, we obtain Subtracting from both sides of (2.4) and then taking expectation of both sides, we get the bias of the estimator up to the first order of approximation as Subtracting from both sides of (2.4), taking square, and then taking expectation of both sides, we get the MSE of the estimator up to the first order of approximation as In the same way using the Taylor series expansion under the usual assumptions, we obtain
To calculate the bias and mean square error, we considered the terms having powers up to second degree only as the calculations become more complicated when the higher-order terms are included.
So, from (2.7), the bias and mean square error of the estimates up to are obtained as We see that MSEβs of these estimators are same and the biases are different. In general We know that in case of univariate the usual ratio-type estimator for the th auxiliary variable is superior to the mean per unit estimator , when Comparing the variance of with the mean square error of all the three estimators, we note that the ratio estimators given in (1.1), (1.2), and (1.3) are more efficient than .
3. Comparison of Biases
The biases may be either positive or negative. So, for comparison, we have compared the absolute biases of the estimates when these are more efficient than the sample mean. The bias of the estimator of geometric mean is smaller than that of arithmetic mean: Squaring and simplifying (3.1), we observe that Thus the above inequality is true when both factors are either positive or negative. The first factor of (3.2) is positive, when In the same way, it can be shown that the second factor of (3.2) is also positive when When both factors are of (3.2) is negative, the sign of inequalities of (3.4) and (3.5) reversed.
Also comparing the square of the biases of geometric and harmonic estimator, we find that geometric, estimator is more biased than harmonic estimator.
Hence we may conclude that under the situations where arithmetic, geometric and harmonic estimator are more efficient than sample mean and the relation (3.5) or is satisfied, the biases of the estimates satisfy the relation Usually the weights are so chosen so as to minimize the MSE of an estimator subject to the condition
4. Empirical Study
In this section, we use the data set earlier used in Koyuncu and Kadilar [5].
To illustrate the efficiency of suggested estimators, we consider the data concerning the number of teachers as the study variable (), number of students (), and number of classes () in both primary and secondary schools as auxiliary variables for 923 districts at 6 regions (as 1: Marmara, 2: Agean, 3: Mediterranean, 4: Central Anatolia, 5: Black Sea, and 6: East and Southeast Anatolia) in Turkey in 2007 (source: The Turkish Republic Ministry of Education). The summary statistics of the data are given in Table 1. We used the Neyman allocation for allocating the samples to different strata [14].
5. Conclusion
From Table 2, we observe that the ratio estimator based on harmonic mean is less biased. However, the mean square errors of the estimators are same. Hence for this data set, we conclude that when more than one auxiliary variables are used for estimating the population parameters, it is better to use harmonic mean as an estimator in case of stratified random sampling.