Abstract

It is well accepted that suitable additional information improves the efficiency of an estimator but at the same time, it faced the situation of nonresponse. The two-occasion successive sampling is useful to handle the absence of a full response from respondents. In this present study, we have developed exponential estimators for population mean using a subsampling nonrespondent procedure. To show the efficacy of the recommended estimators, several properties are derived, and respective optimum replacement strategies are inferred. To evaluate the performance of the recommended estimators empirically, a computational study is carried out as well. It was found that the recommended estimators outperform the existing ones under the nonresponse in successive sampling.

1. Introduction

Incomplete information is a well-known problem in sample surveys, especially in socio-economic surveys of households, in which individual data are collected. The reasons for missing information may be migration, refusal to respond, not being available at the time of surveys performed, etc.

Jessen [1] initially encountered the problem and suggested the method of estimation under the successive sampling with partial replacement of units utilizing the complete information at the previous occasion. Furthermore, Patterson [2]; Rao and Graham [3]; Feng and Zou [4] studied the properties of different estimators under successive sampling. Biradar and Singh [5] and Singh and Vishwakarma [6]; Singh and Pal [7]; Sanahulla et al. [8]; Javaid et al. [9] and Pal et al. [10] used additional information for estimation under successive sampling.

The concept used in this paper is nonresponse. First time during the data collection, Hansen and Hurwitz [11] realize the problem of nonresponse at the estimation stage. He took this problem forward and realizes that while taking the subsamples from the nonrespondents group effect of nonresponse can be reduced. Furthermore, he developed the estimators utilizing the information from response and nonresponse groups together which was well accepted in the sampling theory.

Later on, several researchers including Chaudhary et al. [12]; Singh and Priyanka [13], and Pal and Singh [14] combined the concept of successive sampling and nonresponse and used it for estimation of population mean on current occasion on two-occasion successive sampling. Under different practical situations using the auxiliary variable, the aforesaid authors suggested some estimators when nonresponse was observed on the current occasion in two-occasion successive sampling.

The remainder of this paper is organized as follows: in section 2, terminology and notations are presented further the resent estimators and proposed estimators along with their properties are in sections 3 and 4, respectively. Section 5 provides the computational study, and concluding remarks are offered in section 6.

2. Terminology and Notations

Let be a finite population of N units, which has been sampled over two occasions. The character under study is denoted by on the first (second) occasion. It is assumed that information on an ancillary variable (with unknown population mean), which is positively correlated with the study variable, is readily available and almost stable over both the occasions. A simple random sample (without replacement) of n units is drawn on the first occasion. A random subsample of units is retained (matched) for its use on the second occasion. We assume that there is nonresponse at the current occasion, so that the population can be divided into two classes, those who will respond at the first attempt and those who will not respond. Let the sizes of these two classes be and , respectively. At the current (second) occasion, a simple random sample (without replacement) of units is drawn afresh from the entire population so that the sample size on the current (second) occasion is also . and , are the fractions of matched and fresh samples, respectively, at the current (second) occasion. We assume that in the unmatched portion of the sample on the current (second) occasion, units respond and units do not respond. Let and . Let denote the size of the subsample (of drawn from the nonresponding units in the unmatched (fresh) portion of the sample on the current (second) occasion (i.e., from the nonrespondents, on SRSWOR of units is selected with the inverse sampling rates , where , .

The notations used in the research paper are shown in the Table 1.

3. Recent Developments of Estimators

For estimating the current population mean , Singh et al. [15, 16] have given the estimators of set as

Singh et al. [15] also gave the estimators of set for estimating the current population mean as

Singh et al. [15] suggested the following estimators of population mean at the current (second) occasion by combining the estimators of sets and aswhere ; are unknown constants (scalars) to be determined under certain criteria. For detailed properties of the estimators , readers are referred to Singh et al. [15].

4. New Development of Estimators

4.1. Estimators Based on the Unmatched Portion of the Ample

Looking the formulation of the estimators given by (1) due to Singh et al. [15], it is clear that the estimator is defined in the situation, in which information on the auxiliary variable is obtained for all the sample units (drawn afresh from the entire population at the second occasion), and the population mean of the auxiliary variable is known, but some sample units fail to supply information on the study variable . We note that, when suggesting the estimators for the population mean on the current (second) occasion, Singh et al. [15] used only information on the sample mean of the auxiliary variable . However, one can also obtain the unbiased estimator of the population mean (without any extra effort) while in the process of obtaining the unbiased estimator of the population mean . Thus, in this situation, where information on the auxiliary variable is obtained for all the sample units , we have two unbiased estimators and , of the population mean of the auxiliary variable (see, Singh and Kumar [17]). With this background, we suggest the following estimators (using and together) for the population mean on the current (second) occasion based on unmatched portion of the sample of the size .where are, respectively, the estimates of ,

For the simplicity in calculation, Reddy (1974, [18]) and Ruiz Espejo [19] initially assumed that the coefficient of variation of study and auxiliary variables are equal. It is further assumed that under incomplete information, coefficient of variation of class is equal to coefficient of variation of population. Following this, we state that

Theorem 1. The MSE of the estimator to the fda is given by

Theorem 2. The MSE of the estimator to the fda is given by

Remark 1. The MSE of the estimators and is given while considering the assumption made by Singh et al. [15, 16] that the population correlation coefficient is equal to the nonresponse class correlation coefficient :

4.2. Estimators Based on the Matched Portion of the Sample

It is assumed that there is no nonresponse on the first occasion as well on the matched portion of the sample. Under the above assumption, we consider the following estimators based on the matched sample of size :where are the estimates of the population regression coefficients , respectively, based on the sample of size .

To the fda, the MSE of the proposed estimators is given by

Under the assumption , the MSEs (10) reduce to

4.3. Covariance between and

The covariance between (, ) is given by

It is to be noted that the expression (11) has been derived under the assumptions , , and .

4.4. Linear Combination of Estimators

We have suggested the following estimators for estimating the population mean at the current occasion by combining the different estimators at the matched and unmatched portions of the samples, respectively:where is suitably chosen constant to be determined under certain assumptions.

4.5. MSEs of the Estimators and

The MSEs of the estimators are derived up to fda, and under the assumption, , , and .

Theorem 3. MSEs of , 2) to the fda are obtained as

4.6. MMSEs of the Estimators , 2)

Since the MSE of the estimators in (15) and (16) is functions of unknown constants ’s , 2), therefore, the MSEs are minimized with respect to , 2), respectively, forand thus, the resulting MMSEs of the estimators , 2) are given aswhere , , ,, , , , , , , , , , , , , , ,

4.7. Optimum Replacement Policy

It is observed from (18) and (19) that the MMSEs of the estimators , 2) are the functions of , 2) (functions of sample to be drawn afresh at the second occasion); therefore, the optimum values of are obtained to estimate the population mean with minimum precision and lowest cost. To obtain the optimum values of , 2), we minimize the MMSEs of the proposed estimators , 2) given in (18) to (19), respectively, with respect to , 2) which result in quadratic equations in (, 2), and the respective salutations of , 2) say , 2) are given as follows:where

From (21) and (22), it is observed that real values of , 2) exist, iff the quantities under square roots are greater than or equals to zero, and for any combinations , , and , which satisfy the conditions of real situations, two real values of , 2) are possible, and hence, while choosing the values of , 2), it should be remembered that , 2). All other values of , 2) are inadmissible. Substituting the admissible values of (say) , 2) from (21) and (22) in (17) and (18), respectively, we have the optimum values of mean squared errors of , 2) which are shown as follows:

4.8. Efficiency Comparisons

The percent relative losses in efficiencies of the estimators , 2) are obtained with respect to the similar estimator and natural successive sampling estimator when the nonresponse no observed on any occasion. The estimator is for complete information and under the similar assumption as estimator , 2). Whereas the estimator is natural estimator under successive sampling, given bywhere , ,

Proceeding in similar manner as discussed for the estimators , 2), the optimum mean squared errors of the estimators , , 2) are derived aswhere (fraction of the sample for the estimator ),

Remark 2. For comparison of estimators and , it was advisable by Cochran [3] & Feng and Zou [4] to assume the intraclass correlation coefficient equal, that is, .

5. Numerical Illustrations

For , and different choices of and , Tables 2 & 3 give the optimum values of and percent relative losses , , 2; , 2) in the precision of estimators , 2) with respect to , , 2).

The percent relative losses in the precision of estimators , 2) with respect to , , 2), under their respective optimality conditions are given by

The following inferences may draw from Tables 2 and 3.

The above Table 2 depicts that(i)Considering the constant value of , , and , we observe that values of , , and decrease as increases. In other words, the intraclass correlation coefficient between y and z is inversely proportional to the new sample. That means the efficacy of the estimators under nonresponse depends positively on the auxiliary information. This attitude of the estimators is very important and extremely desirable.(ii)Considering the constant value of , , and , we observe that values of remain the same whereas and increase as increases.(iii)Considering the constant value of , , and , we observe that values of , , and increase as increases.(iv)Considering the constant value of , , and , we observe that values of , , and increase as increases. Table 3 depicts that nonresponse rate and sample size at current occasion are directly proportional and therefore the cost of the survey increases as well.(v)Considering the constant value of , , and , we observe that values of , , and decrease as increases. In other words, the intraclass correlation coefficient between y and z is inversely proportional to the new sample. That means the efficacy of the estimators under nonresponse depends positively on the auxiliary information. This attitude of the estimators is very important and extremely desirable.(vi)Considering the constant value of , , and , we observe that values of remain the same whereas and increase as increases.(vii)Considering the constant value of , , and , we observe that values of , , and increase as increases.(viii)Considering the constant value of , , and , we observe that values of , , and increase as increases. In other words, the nonresponse rate and sample size at the current occasion are directly proportional; consequently, the cost of the survey increases as well.

6. Conclusions

In this paper, we have proposed exponential type estimators for estimating the population mean under the unavailability of full response in two-occasion successive sampling using the additional information. An extensive theoretical and computational study led to the conclusion that both the proposed estimators are better than the usual estimators available in the literature, and moreover, the estimator is better than the estimator between the self-comparison. Therefore, the authors recommend the use of estimator over the other estimators considered in this paper in practice. Furthermore, the researchers can use the composition of the proposed estimators for their use [20, 21].

Data Availability

No external data were used to support this study.

Conflicts of Interest

The authors declare that there are no conflicts of interest.