Estimation of ﬁnite population mean using dual auxiliary variable for non-response using simple random sampling

: This paper addresses the issue of estimating the population mean for non-response using simple random sampling. A new family of estimators is proposed for estimating the population mean with auxiliary information on the sample mean and the rank of the auxiliary variable. Bias and mean square errors of existing and proposed estimators are obtained using the ﬁrst order of measurement. Theoretical comparisons are made of the performance of the proposed and existing estimators. We show that the proposed family of estimators is more e ﬃ cient than existing estimators in the literature under the given constraints using these theoretical comparisons.


Introduction
In survey sampling, the appropriate use of auxiliary information is known to enhance the accuracy of an estimator of the unknown population parameter. This information (auxiliary) can be used to select a random sample using SRSWR or SRSWOR. Auxiliary information gives us a sort of technique in terms of ratio, product, regression, and other methods, it is therefore necessary to have a representative part of the population, when the population of interest is more homogeneous, than simple random sampling can be used to select units. A considerable amount of work was done on estimating the population mean by simple random sampling, a number of important references include [2, 5, 7, 11-13, 16-20, 22-24, 26, 27, 30, 32] and the references cited therein, have suggested different types of estimators to estimate the population mean and population distribution function in the presence of non-response.
As a practical matter, one of the main problems with surveys is that they suffer from non-response, non-response has a lot of ways to happen. Examples are language problems, non-availability of response, incorrect return address and input from another person, censorship or clustering is a problem across several data. The statistician has recognized for quite few time that ignoring the stochastic nature of incompleteness or non-response may change the nature of the data. Several factors affect the non-response rate for a survey, some of these factors are the type of information collected, the official status of the investigating agency, the extent of the publicity, legal requirements of respondents, the duration of the enumerator's visit and the length of withdrawal period etc.
A great deal of work has been done on the estimation of the population mean to check non-response bias and increase efficiency of estimators by different authors. The issue of non-response in sample surveys is more common and prevalent in mail surveys than in special interview surveys. [10] was the first to address the issue of incomplete samples in the postal or telephone surveys. For certain related work, we refer to [1-3, 12-15, 17, 18, 20, 21, 23-26, 28, 31, 32] and the references cited therein.
On the line of [11] a new family of estimators is proposed for the estimation of population mean in the presence of non-response. We will prove theoretically and numerically that the proposed family of estimators is more precise than the existing estimators.
The rest of the paper is set out as follows: In Section 2, some notations are introduced by SRS with non-responding. In Section 3, the existing estimators examined for the two non-response situations. A new family of estimators is presented in Section 4 under both non-response situations using simple random sampling. The existing and proposed estimators are theoretically compared in Section 5. In Section 6, the existing and proposed family of estimators are compared numerically. Section 7 condenses the principal discovery and culminate the document.

Notations
Suppose Ω = {U 1 , U 2 , ..., U N } denotes be a finite population of N distinct units that is bisect into two groups, respondents and non-respondents, having sizes N 1 and N 2 , where N = N 1 + N 2 . Thus we denote Ω 1 = {U 1 , U 2 , ..., U N 1 } for the response group and Ω 2 = {U 1 , U 2 , ..., U N 2 } for the non-response group. In order to estimate the population mean, a sample of n is taken from the underlying population by simple random sampling without replacement (SRSWOR), and for which units n 1 are responding and n 2 = n − n 1 are not responding. It is also assumed that the sample size n 1 is drawn from the response group of Ω 1 and n 2 is drawn from the non-response group of Ω 2 . Moreover a sample of size r = n 2 /k units, where k > 1 is drawn by simple random sampling without replacement from n 2 , and the temporal response is obtained from all r units.
Let Y, X, Z, be the study, auxiliary and ranks of the auxiliary variable.
The population and sample mean of Y. X = N i=1 X i /N,X = n i=1 X i /n: The population and sample mean of X. Z = N i=1 Z i /N,Ẑ = n i=1 Z i /n: The population and sample mean of Z.
The population mean of Y for non-response group.
The population mean of X for non-response group.
The population mean of Z for non-response group. Y (1) = n 1 i=1 Y i /n 1 denote the sample mean based on n 1 responding units out of n units. X (1) = n 1 i=1 X i /n 1 denote the sample mean based on n 1 responding units out of n units. Z (1) = n 1 i=1 Z i /n 1 be the sample mean based on n 1 responding units out of n units. Y (2r) = r i=1 Y i /r be the sample mean based on r reacting units out of n 2 non-response units. X (2r) = r i=1 X i /r be the sample mean based on r reacting units out of n 2 non-response units. Z (2r) = r i=1 Z i /r denote the sample mean based on r reacting units out of n 2 non-response units.
variance of Y, X, and Z.
The population variance of Y, X, and Z for non-response group. (2) , C X(2) = S X(2) /X (2) , C Z(2) = S Y(2) /Z (2) : Be the population coefficient of variation of Y, X and Z for non-response group.
The population correlation coefficient between (Y, X), (Y, Z), and (X, Z) for non-response group.
The population coefficient of multiple determination of Y on X and Z for non-response group.
are unbiased estimators ofX andZ respectively under non-response with corresponding variances respectively. In order to obtain the properties of the proposed estimator, we consider the following relative error terms. Let , and for i = 1, 2. Where E(·) represents the mathematical expectation of (·). Let where r, s, t, u = 0, 1, 2. Here, Usually in case of non-response, two situations are more likely to happen, namely non-response on Y only (say Situation-I) and non-response on both Y, X and Z (say Situation-II).

Existing estimators
In this portion, some existing estimates of the population mean for non-response are briefly reviewed for both situations.

Situation-I
When non-response occurs in only one study variable, say Y (1) The estimator of the typical ratio of theȲ is given as: The properties ofŶ R , are given by: respectively.
(2) The typical product estimatorȲ is given as: The properties ofŶ * P , are given as: The typical difference estimator for theȲ is given as: The minimal variance ofŶ * D at d (opt) = (ȲV 110 )/(XV 020 ) is given as: Here in (3.6) can be written as: (4) Following [27], a difference-type estimator ofȲ iŝ The properties ofŶ * R,D , are given by: and By simplify Eq (3.10) the value of k 1 and k 2 , are given as: respectively. The minimal MSE ofŶ * R,D at the optimal values is given by: Equation (3.11) may be written as (3.12) (5) Following [4], is given as:Ŷ (3.14) The biases and MSEs ofŶ * BT,R andŶ * BT,P , are given as: and Following [29], a generalized ratio-type exponential estimator ofȲ iŝ The properties ofŶ * S , are given as: where θ = aX/(aX + b).
(7) Following [8], a generalized class of ratio-type exponential estimators ofȲ is given as: The properties ofŶ * GK , are given as: The optimum values of k 1 and k 2 determined by simplifying (23), are given as: The simplified minimum MSE ofŶ * GK at the optimum values of k 1 and k 2 is given by Here (3.20) may be written as (3.21)

Situation-II
When non response is occur in both study and auxiliary variables, say Y and X.
(7) Following [8], estimators ofȲ is given by: The properties ofŶ * * GK , are given by: The ideal values of k 1 and k 2 is expressing by (3.40), The minimal MSE ofŶ GK at the optimal values of k 1 and k 2 is given by: (3.41) Equation (3.41) may be written as: (3.42)

Proposed estimator in non-response using simple random sampling
The proper use of ancillary variable improve the accuracy of estimator in the design and estimation stages. Complete auxiliary information is frequently supplied along with the sample frame for social, economic, and natural surveys. When the study variable and the auxiliary variable have a sufficient amount of connection, the rankings of the auxiliary variable are also correlated with the values of the auxiliary variable. Consequently, The categorised auxiliary variable (which includes the auxiliary variable's rank) can be treated as a new auxiliary variable, and this information can help an estimator perform better. Because of We present an improved family of estimators for predicting the population mean that requires additional information on the study and auxiliary variable sample means, as well as the ranks of the auxiliary variable under non-response using simple random sampling.

Situation-I
When non-response occur only in study variable. On the lines of [11], the proposed improved estimator ofȲ in the presence of non-response using SRS, sayŶ * S uggested is given as: where w 1 , w 2 , and w 3 are unknown constant. The proposed estimatorŶ S uggested j can be rewritten aŝ Simplifying (4.2), we have The properties ofŶ * S uggested , are given as: The optimal values of w 1 , w 2 , and w 3 determined by minimizing (4.4), are The minimal MSE ofŶ * S uggested at optimal values of w 1 , w 2 and w 3 is given by: (4.5) Equation (4.5) me be written as

Situation-I
In this section, we performed a comparison of the adapted and proposed estimators, when nonresponse is available in the study variable.

Numerical investigation
In this section, the mathematical result is shown to verify the effectiveness of all estimators as compared to existing estimators. Four data sets are under consideration. The data description and mean square error are listed in Tables 2 and 3. The percent efficiency of estimatorŶ i w.r.tŶ S RS : where i = R, P, . . . , S ugeested. The MSEs and PREs of mean estimators, computed from two populations, are given in Tables 4-11. Population I. (Source: [9]) Y: The egg assemble in 1990, X: Value per dozen in 1991.  (2) 940.7629 S X (2) 19.53920 S Z (2) 3.605551 Population II. (Source: [9]) Y: Eggs assemble in 1990, X: Value per dozen in 1990. 940.7629 S X (2) 18.25925 S Z (2) 3.605551

Conclusions
In this paper, a new family of estimators for estimating the population mean with information on the auxiliary variable in the form of the sample mean and ranks of the auxiliary variable in the presence of non-response has been devised. The suggested family of estimators a mathematical expressions for biases and minimum MSEs have been generated up to the first order of approximation and compared both theoretically and numerically with the [6,10,22], the conventional difference, [8,27,29] estimators under Situation-I and Situation-II. It has been observed that the proposed family of estimators is more efficient in both non-response situations.