Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Finite population distribution function estimation with dual use of auxiliary information under simple and stratified random sampling

  • Sardar Hussain,

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Validation, Writing – original draft

    Affiliation Department of Statistics, Quaid-I-Azam University, Islamabad, Pakistan

  • Sohaib Ahmad,

    Roles Methodology, Writing – original draft

    Affiliation Department of Statistics, Quaid-I-Azam University, Islamabad, Pakistan

  • Mariyam Saleem,

    Roles Writing – review & editing

    Affiliation Department of Statistics, Government College Univerisity, Lahore, Pakistan

  • Sohail Akhtar

    Roles Writing – review & editing

    s.akhtar@gcu.edu.pk

    Affiliation Department of Statistics, Government College Univerisity, Lahore, Pakistan

Abstract

The main purpose of this paper is to propose two new estimators for estimating the finite population distribution function under simple and stratified random sampling schemes using supplementary information on the distribution function, mean and ranks of the auxiliary variable. The mathematical expressions for the bias and mean squared error of the proposed estimators are derived under the first order of approximation. The theoretical and empirical studies showed that the proposed estimators uniformly perform better than the existing estimators in terms of the percentage relative efficiency.

1 Introduction

In survey sampling, the use of suitable auxiliary information improves the precision of estimators of the unknown population parameter(s). Several estimators of population parameters, including the population mean, median, total, distribution function, quantiles, etc., exist in the literature, and requires, supplementary information on one or more auxiliary variables along with the information on the study variable. A number of studies have been published on the estimation of the population mean. Some important references to the population mean estimation using auxiliary information include Murthy [1], Sisodia and Dwivedi [2], Srivastava and Jhajj [3], Rao [4], Upadhyaya and Singh [5], Singh [6], Kadilar and Cingi [7], Kadilar and Cingi [8], Gupta and Shabbir [9], Grover and Kaur [10], Grover and Kaur [11], Lu [12], Muneer et al. [13], Shabbir and Gupta [14], and Gupta and Yadav [15]. In these studies, the authors have proposed improved ratio, product, and regression type estimators for estimating the finite population mean. These authors have used a single auxiliary variable for the estimation procedure. In a recent study, Haq et al. [16] suggested using ranks of the auxiliary varible as an additional auxiliary variable to increase the precision of the estimator of the population mean in simple random sampling. However, to the best of our knowledge, there is no study concerning the use of two auxiliary information variables for the estimation of the finite population distribution function.

The problem of estimating the finite population cumulative distribution function (CDF) arises when the interest lies in finding out the proportion of the values of the study variable that are less or equal to a certain value. There are situations where estimating the CDF is deemed necessary. For example, for a nutritionist, it is interesting to know the proportion of the population that consumes 25% or more of the calorie intake from saturated fat. In the literature, many authors have estimated the CDF using information about one or more auxiliary variables. Chambers and Dunstan [17] suggested an estimator for estimating the CDF that requires information both on the study and auxiliary variables. Similarly, Rao et al. [18] and Rao [19] proposed ratio and difference/regression estimators for estimating the CDF under a general sampling design. Kuk [20] suggested a kernel method for estimating the CDF using the auxiliary information. Ahmed and Abu-Dayyeh [21] estimated the CDF using the information on multiple auxiliary variables. Rueda et al. [22] used a calibration approach to develop an estimator for estimating the CDF. Singh et al. [23] considered the problem of estimating the CDF and quantiles with the use of at the estimation stage of a survey. Moreover, Yaqub and Shabbir [24] considered a generalised class of estimators for estimating the CDF in the presence of non-response. Chen and Chen [25] investigated the injury severities of truck drivers in single-and multi-vehicle accidents on rural highways while Zeng et al. [26] worked on a multivariate random-parameter Tobit model for analysing highway crash rates by injury severity, and Yaqub and Shabbir [24] considered a generalised class of estimators for estimating the CDF in the presence of non-response. Dong et al. [27] investigated the differences of single-vehicle and multi-vehicle accident probability using a mixed logit model, Chen et al. [28] worked on an analysis of hourly crash likelihood using an unbalanced panel data mixed logit model and real-time driving environmental big data, Zeng et al. [29] suggested jointly modelling area-level crash rates by severity, and Zeng et al. [30] used spatial joint analysis for zonal daytime and night-time crash frequencies using a Bayesian bivariate conditional autoregressive model. However, these estimators only used one auxiliary variate.

In this paper, we propose two new families of estimators for estimating the CDF using the information on the distribution function, ranks, and mean of the auxiliary variable under simple random sampling and stratified random sampling. The bias and mean squared errors (MSEs) of the existing and proposed estimators of the CDF are derived under the first order of approximation. The theoretical and numerical comparisons showed that the proposed estimators are more precise than the existing adapted estimators when estimating the CDF of a finite population.

2 Notation in simple random sampling

Consider a finite population Ω = {1,2,…,N} of N distinct units. In order to estimate the finite population distribution function, a sample of size n units is drawn from Ω using simple random sampling without replacement. Suppose Y and X are the study and auxiliary variables, respectively. Let Z denote the ranks of X, I(Yy) is indicator variable based on Y and I(Xx) is indicator variable based on X. Similarly, and ( and ) are the population and sample distribution functions of Y (X), respectively. Let and ( and ) are the population and sample means of X (Z), respectively.

In order to obtain the biases and mean squared errors (MSEs) of the adapted and proposed estimators of F(y), we consider the following relative error terms. Let such that ei = 0 for i = 0,1,2,3, where E(⋅) is the mathematical expectation of (⋅). Let where λ = (Nn)/(nN), ,

,

,

,

C1 = S1/F(y), C2 = S2/F(x),

, , R12 = S12/(F(y)F(x)),

, , , , ,

,

,

,

.

In addition, let be the multiple correlation coefficient of I(Yy) on I(Xx) and X, and let be the multiple correlation coefficient of I(Yy) on I(Xx) and Z, under simple random sampling.

3 Adapted estimators in simple random sampling

In this section, some estimators of finite population mean are adapted for estimating the finite CDF under simple random sampling. The biases and MSEs of these adapted estimators are derived under the first order of approximation.

1. The traditional unbiased estimator of F(y) is (1)

The variance of is (2)

2. Cochran [31] adapted ratio estimator of F(y) is (3)

The bias and MSE of , to the first order of approximation, are (4)

If R12>C2/(2C1), then is better than in terms of MSE.

3. Murthy [32] adapted product estimator of F(y) is (5)

The bias and MSE of , to the first order of approximation, are (6)

If −C2/(2C1)>R12, then is better than in terms of MSE.

4. The adapted difference estimator of F(y) is (7) where k is an unknown constant. Here, is an unbiased estimator of F(y). The minimum variance of at the optimum value k(opt) = (F(y)V1100)/(F(x)V0200) is (8)

Here, (8) may be written as (9)

5. Rao [4] adapted difference-type estimator of F(y) is (10) where k1 and k2 are unknown constants. The bias and MSE of , to the first order of approximation, are (11)

The optimum values of k1 and k2, determined by minimizing (11), are

The minimum MSE of at the optimum values of k1 and k2 is (12)

Here, (12) may be written as (13)

6. Singh et al. [33] adapted generalized ratio-type exponential estimator of F(y) is (14) where a and b are known constants. The bias and MSE of , to the first order of approximation, are (15) where θ = aF(x)/(aF(x)+b).

7. Grover and Kaur [11] adapted generalized class of ratio-type exponential estimator of F(y) is (16) where k3 and k4 are unknown constants. The bias and MSE of , to the first order of approximation, are (17)

The optimum values of k3 and k4, determined by minimizing (17), are

The simplified minimum MSE of at the optimum values of k3 and k4 is (18)

Here, (18) may be written as (19) which shows that is more precise than .

4 Proposed estimators in simple random sampling

The precision of an estimator increases by using suitable auxiliary information at the estimation stage. In previous studies, the sample distribution function of the auxiliary variable was used to improve the efficiencies of the existing distribution function estimators. In a recent study, Haq et al. [16] suggested using ranks of the auxiliary variable as an additional auxiliary variable to increase the precision of an estimator of the population mean. On similar lines, we use additional auxiliary information on sample means of the auxiliary and ranked-auxiliary variables along with the sample distribution function estimators of F(y) and F(x) to estimate the finite CDF. For this purpose, we propose two families of estimators for estimating F(y).

4.1 First proposed family of estimators

For the first family of estimators, the sample mean of the auxiliary variable is used as an additional auxiliary variable; whilst in the second family of estimators, the sample mean of the ranked auxiliary variable is used as an additional auxiliary variable.

On the lines of and , first proposed family of estimators for estimating F(y) is given by (20) where k5, k6 and k7 are unknown constants, a(≠0) and b are either two real numbers or function of known population parameters of I(Xx), like R12, β2 (coefficient of kurtosis), C2, etc.

The estimator can also be written as (21)

Simplifying (21) and keeping terms only up to the second power of ei’s, we can write (22)

The bias and MSE of , to the first order of approximation, are (23)

The optimum values of k5, k6 and k7, determined by minimizing (23), are

The simplified minimum MSE of at the optimum values of k5, k6 and k7 is (24) where .

Here, (24) may be written as (25) where

It can be seen that is more precise than .

4.2 Second proposed family of estimators

On similar lines, second proposed family of estimators for estimating F(y) is given by (26) where k8, k9 and k10 are unknown constants, a(≠0) and b are either two real numbers or functions of known population parameters of I(Xx), like R12, β2 (coefficient of kurtosis), C2, etc. The estimator can also be written as (27)

Simplifying (27) and keeping terms only up to the second power of ei’s, we can write (28)

The bias and MSE of , to the first order of approximation, are (29)

The optimum values of k8, k9 and k10, determined by minimizing (29), are

The simplified minimum MSE of at the optimum values of k8, k9 and k10 is (30) where .

Here, (30) may be written as (31) where

It is clear that is more precise than .

In Table 1, we put some members of the Singh et al. [33], Grover and Kaur [11], and proposed families of estimators with selected choices of a and b.

thumbnail
Table 1. Some members of the adapted and proposed distribution function estimators.

https://doi.org/10.1371/journal.pone.0239098.t001

5 Efficiency comparisons in simple random sampling

In this section, the adapted and proposed estimators of F(y) are compared in terms of the minimum MSEs. [(i)]

1. From (2) and (25),

2. From (4) and (25),

3. From (6) and (25),

4. From (9) and (25),

5. From (13) and (25),

6. From (15) and (25),

7. From (19) and (25),

8. From (2) and (31),

9. From (4) and (31),

10. From (6) and (31),

11. From (9) and (31),

12. From (13) and (31),

13. From (15) and (31),

14. From (19) and (31),

The proposed families of estimators are always more precise than the adapted estimators as the above conditions (i)–(xiv) are always true.

6 Empirical study in simple random sampling

In this section, we conduct a numerical study to investigate the performances of the adapted and CDF estimators. For this purpose, five populations are considered. The summary statistics of these populations are reported in Table 2. The percentage relative efficiency (PRE) of an estimator with respect to is where i = 2,3,…,9.

The PREs of distribution function estimators, computed from five populations, are given in Tables 711.

Population I (Source: Singh, [6])

Y: Duration of sleep (in minutes) and

X: Age of old persons.

Population II (Source: Gujarati, [34])

Y: The eggs produced in 1990 (millions) and

X: The price per dozen (cents) in 1990.

Population III (Source: Murthy, [1])

Y: The output of the factory and

X: The number of workers.

Population IV (Source: Sarndal, [35])

Y: Population in 1983 (in million) and

X: Population in 1980 (in million).

Population V (Source: Koyuncu and Kadilar, [36])

Y: Number of teachers and

X: number of students.

From the numerical results, presented in Tables 37, it is observed that the PREs of all families of estimators change with the choice of a or b. It is further noted that the proposed families of estimators are more precise than the adapted distribution function estimators of Cochran [31], Murthy [32], Rao [4], Singh et al. [33] and Grover and Kaur [11], in terms of PRE. It can be seen that, for data sets I, III, IV and V, the second proposed family of the estimators perform better than the first proposed family of estimators, and for data set-II, it is also observed that the first proposed family of the estimators perform better than the second proposed family of estimators.

thumbnail
Table 3. PREs of distribution function estimators using Population I.

https://doi.org/10.1371/journal.pone.0239098.t003

thumbnail
Table 4. PREs of distribution function estimators using Population II.

https://doi.org/10.1371/journal.pone.0239098.t004

thumbnail
Table 5. PREs of distribution function estimators using Population III.

https://doi.org/10.1371/journal.pone.0239098.t005

thumbnail
Table 6. PREs of distribution function estimators using Population IV.

https://doi.org/10.1371/journal.pone.0239098.t006

thumbnail
Table 7. PREs of distribution function estimators using Population V.

https://doi.org/10.1371/journal.pone.0239098.t007

Here we take five data sets for numerical illustration. We selected different sample sizes from these populations and, then, we used simple random sampling. The MSEs (minimum) of the proposed families of estimators are pointed out in Eqs 25 and 31. Finally, the adapted estimators and proposed families of estimators were compared with each other with respect to their PRE values. These results are set out in Tables 37. In Table 2, we see the summary statistics about the populations. We can also note from the numerical results presented in Tables 37 that the PREs of all families of estimators change with the choice of a or b. It is further noted that the proposed families of estimators are more precise than the adapted distribution function estimators of Cochran [31], Murthy [32], Rao [4], Singh et al. [33] and Grover and Kaur [11], in terms of the PRE. It can be seen that, for data sets I, III, IV and V, the second proposed family of the estimators perform better than the first proposed family of estimators, and, for data set II, it is also observed that the first proposed family of the estimators performs better than the second proposed family of estimators.

7 Notation in stratified random sampling

Consider a finite population Ω = {1,2,…,N} of N distinct units, which is divided into L homogeneous strata, where the size of hth stratum is Nh, for h = 1,2,…,L, such that . Let Y and X be the study and auxiliary variables which take values yh and xh, respectively, where i = 1,2,…,Nh and h = 1,2,…,L, for estimating finite population distribution function, assume a sample of size nh is drawn from the th stratum using simple random sampling without replacement, such that , where n is the sample size.

Let and ( and ) be the population (sample) distribution functions of Y and X under stratified random sampling, respectively, where Wh = Nh/N, , , ,

. Let and ( and ) be the population (sample) means of X and Z under stratified random sampling, respectively, where , , , .

In order to obtain the biases and MSEs of the adapted and proposed estimators of F(y) under stratified random sampling, the following relative error terms are considered. Let such that E(ei) = 0 for i = 1,2,3,4, where E(⋅) is the mathematical expectation of (⋅). Let Where

8 Adapted estimators in stratified random sampling

In this section, some estimators of finite population mean are adapted for estimating the finite CDF under stratified random sampling. The biases and MSEs of these adapted estimators are derived under the first order of approximation.

1. The traditional unbiased estimator of F(y) is (32)

The variance of is (33)

2. Cochran [31] adapted ratio estimator of F(y) is (34)

The bias and MSE of , to the first order of approximation, are (35)

3. Murthy [32] adapted product estimator of F(y) is (36)

The bias and MSE of , to the first order of approximation, are (37)

4. The adapted difference estimator of F(y) is (38) where m is an unknown constant. Here, is an unbiased estimator of F(y). The minimum variance of at the optimum value m(opt) = (F(y)ψ1100)/(F(x)ψ0200) is (39)

Here, (39) may be written as (40)

5. Rao [4] adapted difference-type estimator of F(y) is (41) where m1 and m2 are unknown constants. The bias and MSE of , to the first order of approximation, are (42)

The optimum values of m1 and m2, determined by minimizing (42), are

The minimum MSE of at the optimum values of m1 and m2 is (43)

Here, (43) may be written as (44)

6. Singh et al. [33] adapted generalized ratio-type exponential estimator of F(y) is (45) where ast and bst are known constants. The bias and MSE of , to the first order of approximation, are (46) where Θ = astF(x)/(astF(x)+bst).

7. Grover and Kaur [11] adapted generalized class of ratio-type exponential estimator of F(y) is (47) where m3 and m4 are unknown constants. The bias and MSE of , to the first order of approximation, are (48)

The optimum values of m3 and m4, determined by minimizing (48), are

The simplified minimum MSE of at the optimum values of m3 and m4 is (49)

Here, (49) may be written as (50) which shows that is more precise than .

9 Proposed estimators in stratified random sampling

9.1 First proposed family of estimators

On the lines of and , first proposed family of estimators for estimating F(y) is given by (51) where m5, m6 and m7 are unknown constants, ast(≠0) and bst are either two real numbers or functions of known population parameters of I(Xx), like R12, (coefficient of kurtosis), , etc.

The estimator can also be written as (52)

Simplifying (52) and keeping terms only up to the second power of ζi’s, we can write (53)

The bias and MSE of , to the first order of approximation, are (54)

The optimum values of m5, m6 and m7, determined by minimizing (54), are

The simplified minimum MSE of at the optimum values of m5, m6 and m7 is (55) where .

Here, (55) may be written as (56) where

It can be seen that is more precise than

9.2 Second proposed family of estimators

On similar lines, second proposed family of estimators for estimating F(y) is given by (57) where m8, m9 and m10 are unknown constants, ast (≠0) and bst are either two real numbers or functions of known population parameters of I(Xx), like R12, (coefficient of kurtosis), , etc.

The estimator can also be written as (58)

Simplifying (58) and keeping terms only up to the second power of ζi’s, we can write (59)

The bias and MSE of , to the first order of approximation, are (60)

The optimum values of m8, m9 and m10, determined by minimizing (60), are

The simplified minimum MSE of at the optimum values of m8, m9 and m10 is (61) where .

Here, (61) may be written as (62) where respectively, which shows that is more precise than .

In Table 8, we put some members of the Singh et al. [33], Grover and Kaur [11], and proposed families of estimators with selected choices of a and b.

thumbnail
Table 8. Some members of the proposed and adapted estimators.

https://doi.org/10.1371/journal.pone.0239098.t008

10 Efficiency comparisons in stratified random sampling

In this section, the adapted and proposed estimators of F(y) are compared in terms of the minimum MSEs. [(i)]

1. From (33) and (56),

2. From (35) and (56),

3. From (37) and (56),

4. From (40) and (56),

5. From (44) and (56),

6. From (46) and (56),

7. From (50) and (56),

8. From (33) and (62),

9. From (35) and (62),

10. From (37) and (62),

11. From (40) and (62),

12. From (44) and (62),

13. From (46) and (62),

14. From (50) and (62),

The proposed families of estimators are always more precise than the adapted estimators as the above conditions (i)–(xiv) are always true.

11 Empirical study in stratified random sampling

In this section, we conduct a numerical study to investigate the performances of the adapted and proposed CDF estimators. For this purpose, five populations are considered. The summary statistics of these populations are reported in Tables 912. The PRE of an estimator with respect to is where i = 2,3,…,9.

The PREs of distribution function estimators, computed from four populations, are given in Tables 912.

Population I (Source: Koyuncu and Kadilar, [36])

Y: The number of teachers and

X: The number of students in both primary and secondary schools in Turkey in 2007 for 923 districts in six regions.

Population II (Source: Koyuncu and Kadilar, [36])

Y: The number of teachers and

X: The number of classes in both primary and secondary schools in Turkey in 2007 for 923 districts in six regions.

Population III (Source: Kadilar and Cingi, [37])

Y: Apple production amount in 1999 and

X: The number of apple trees in 1999.

Population IV (Source: Kadilar and Cingi, [37])

Y: Apple production amount in 1999 and

X: Apple production amount in 1998.

From the numerical results, presented in Tables 1316, it is observed that the PREs of all families of estimators change with the choices of a and b. It is further noted that the proposed families of estimators are more precise than the adapted distribution function estimators of Cochran [31], Murthy [32], Rao [4], Singh et al. [33] and Grover and Kaur [11], in terms of PRE. It can be seen that, for all data sets, the second proposed family of the estimators perform better than the first proposed family of estimators.

thumbnail
Table 13. PREs of distribution function estimators using Population I.

https://doi.org/10.1371/journal.pone.0239098.t013

thumbnail
Table 14. PREs of distribution function estimators using Population II.

https://doi.org/10.1371/journal.pone.0239098.t014

thumbnail
Table 15. PREs of distribution function estimators using Population III.

https://doi.org/10.1371/journal.pone.0239098.t015

thumbnail
Table 16. PREs of distribution function estimators using Population IV.

https://doi.org/10.1371/journal.pone.0239098.t016

We computed sample size in stratum h. Here we took sample sizes of 180, 180, 140 and 140 from four populations. Then we used stratified random sampling, and the MSEs (minimum) of the proposed families of estimators were computed in Eqs 56 and 62, respectively. Lastly, the adapted estimators and proposed families of estimators were compared with each other with respect to their PRE values. The PRE results are shown in Tables 1316. In Tables 912, we can observe the descriptive statistics regarding the populations, strata, and sample size. From the numerical results, presented in Tables 1316, it is observed that the PREs of all families of estimators change with the choices of a and b. It is further noted that the proposed families of estimators are more precise than the adapted distribution function estimators of Cochran [31], Murthy [32], Rao [4], Singh et al. [33] and Grover and Kaur [11], in terms of the PRE. It can be seen that, for all data sets, the second proposed family of estimators perform better than the first proposed family of estimators. We can also see a rise in the value of PREs when the value of a = 1; and b = R12; a = R12 and b = C2; a = R12 and b = β2(st), and see a slight fall in PREs when a = 1 and b = NF(x)).

12 Conclusion

In this paper, we have proposed two new families of estimators for estimating the finite population distribution function under simple and stratified random sampling schemes. The proposed estimators required supplementary information about the sample mean and the ranks of the auxiliary variable. The biases and MSEs of the proposed families of estimators were derived using the first order approximation. Based on theoretical and numerical comparative studies, it can be concludethat the proposed families of estimators are more precise than their existing counterparts. Thus, we recommend using the sample mean and the ranks of the auxiliary variable with the proposed families of estimators for estimating the finite population distribution function under simple or stratified random sampling. It would be interesting to extend the suggested estimators to two-phase and stratified two-phase sampling schemes. Furthermore, the proposed estimators could also be generalised by utilising information about multi-auxillary variables.

References

  1. 1. Murthy MN. Sampling Theory and Methods. Statistical Pub. 1967 Society,Calcutta.
  2. 2. Sisodia BV, Dwivedi VK. Modified ratio estimator using coefficient of variation of auxiliary variable. Journal-Indian Society of Agricultural Statistics. 1981.
  3. 3. Srivastava SK, Jhajj HS. A class of estimators of the population mean using multiauxiliary information. Calcutta Statistical Association Bulletin. 1983 Mar;32(1–2):47–56.
  4. 4. Rao TJ. On certail methods of improving ration and regression estimators. Communications in Statistics-Theory and Methods. 1991 Jan 1;20(10):3325–40.
  5. 5. Upadhyaya LN, Singh HP. Use of transformed auxiliary variable in estimating the finite population mean. Biometrical Journal: Journal of Mathematical Methods in Biosciences. 1999 Sep;41(5):627–36.
  6. 6. Singh S. Advanced Sampling Theory With Applications: How Michael"" Selected"" Amy. Springer Science & Business Media; 2003.
  7. 7. Kadilar C, Cingi H. Estimator of a population mean using two auxiliary variables in simple random sampling. International Mathematical Journal. 2004;5:357–67.
  8. 8. Kadilar C, Cingi H. Improvement in estimating the population mean in simple random sampling. Applied Mathematics Letters. 2006 Jan 1;19(1):75–9.
  9. 9. Gupta S, Shabbir J. On improvement in estimating the population mean in simple random sampling. Journal of Applied Statistics. 2008 May 1;35(5):559–66.
  10. 10. Grover LK, Kaur P. Ratio type exponential estimators of population mean under linear transformation of auxiliary variable: theory and methods. South African Statistical Journal. 2011 Jan 1;45(2):205–30.
  11. 11. Grover LK, Kaur P. A generalized class of ratio type exponential estimators of population mean under linear transformation of auxiliary variable. Communications in Statistics-Simulation and Computation. 2014 Jan 1;43(7):1552–74.
  12. 12. Lu J. Efficient estimator of a finite population mean using two auxiliary variables and numerical application in agricultural, biomedical, and power engineering. Mathematical Problems in Engineering. 2017 Jan 1;2017. pmid:29578548
  13. 13. Muneer S, Shabbir J, Khalil A. Estimation of finite population mean in simple random sampling and stratified random sampling using two auxiliary variables. Communications in Statistics-Theory and Methods. 2017 Mar 4;46(5):2181–92.
  14. 14. Shabbir J, Gupta S. Estimation of finite population mean in simple and stratified random sampling using two auxiliary variables. Communications in Statistics-Theory and Methods. 2017 Oct 18;46(20):10135–48.
  15. 15. Gupta RK, Yadav SK. Improved estimation of population mean using information on size of the sample. American Journal of Mathematics and Statistics. 2018;8(2):27–35.
  16. 16. Haq A, Khan M, Hussain Z. A new estimator of finite population mean based on the dual use of the auxiliary information. Communications in Statistics-Theory and Methods. 2017 May 3;46(9):4425–36.
  17. 17. Chambers RL, Dunstan R. Estimating distribution functions from survey data. Biometrika. 1986 Dec 1;73(3):597–604.
  18. 18. Rao JN, Kovar JG, Mantel HJ. On estimating distribution functions and quantiles from survey data using auxiliary information. Biometrika. 1990 Jun 1:365–75.
  19. 19. Rao JN. Estimating totals and distribution functions using auxiliary information at the estimation stage. Journal of Official Statistics. 1994 Jun 1;10(2):153.
  20. 20. Kuk AY. A kernel method for estimating finite population distribution functions using auxiliary information. Biometrika. 1993 Jun 1;80(2):385–92.
  21. 21. Ahmed MS. and Abu-Dayyeh W. Estimation of finite-population distribution function using multivariate auxiliary information. Statistics in Transition. 2001 5(3):501–507.
  22. 22. Rueda M, Martínez S, Martínez H, Arcos A. Estimation of the distribution function with calibration methods. Journal of statistical planning and inference. 2007 Feb 1;137(2):435–48.
  23. 23. Singh HP, Singh S, Kozak M. A family of estimators of finite-population distribution function using auxiliary information. Acta applicandae mathematicae. 2008 Nov 1;104(2):115–30.
  24. 24. Yaqub M, Shabbir J. Estimation of population distribution function in the presence of non-response. Hacettepe Journal of Mathematics and Statistics. 2018 Apr 1;47(2):471–511.
  25. 25. Chen F, Chen S. Injury severities of truck drivers in single-and multi-vehicle accidents on rural highways. Accident Analysis & Prevention. 2011 Sep 1;43(5):1677–88.
  26. 26. Zeng Q, Wen H, Huang H, Pei X, Wong SC. A multivariate random-parameters Tobit model for analyzing highway crash rates by injury severity. Accident Analysis & Prevention. 2017 Feb 1;99:184–91.
  27. 27. Dong B, Ma X, Chen F, Chen S. Investigating the differences of single-vehicle and multivehicle accident probability using mixed logit model. Journal of Advanced Transportation. 2018 Jan 1;2018. pmid:32908973
  28. 28. Chen F, Chen S, Ma X. Analysis of hourly crash likelihood using unbalanced panel data mixed logit model and real-time driving environmental big data. Journal of safety research. 2018 Jun 1;65:153–9. pmid:29776524
  29. 29. Zeng Q, Guo Q, Wong SC, Wen H, Huang H, Pei X. Jointly modeling area-level crash rates by severity: a Bayesian multivariate random-parameters spatio-temporal Tobit regression. Transportmetrica A: Transport Science. 2019 Nov 29;15(2):1867–84.
  30. 30. Zeng Q, Wen H, Wong SC, Huang H, Guo Q, Pei X. Spatial joint analysis for zonal daytime and nighttime crash frequencies using a Bayesian bivariate conditional autoregressive model. Journal of Transportation Safety & Security. 2020 Apr 20;12(4):566–85. pmid:27648455
  31. 31. Cochran WG. The estimation of the yields of cereal experiments by sampling for the ratio of grain to total produce. The journal of agricultural science. 1940 Apr;30(2):262–75.
  32. 32. Murthy MN. Product method of estimation. Sankhyā: The Indian Journal of Statistics, Series A. 1964 Jul 1:69–74.
  33. 33. Singh R, Chauhan P, Sawan N, Smarandache F. Improvement in estimating the population mean using exponential estimator in simple random sampling. International Journal of Statistics & Economics. 2009 3(A09):13–18
  34. 34. Gujarati DN. Basic econometrics. Tata McGraw-Hill Education; 2009.
  35. 35. Särndal CE. Methods for estimating the precision of survey estimates when imputation has been used. Survey methodology. 1992;18(2):241–52.
  36. 36. Koyuncu N, Kadilar C. Ratio and product estimators in stratified random sampling. Journal of statistical planning and inference. 2009 Aug 1;139(8):2552–8.
  37. 37. Kadilar C, Cingi HB. Ratio estimators in stratified random sampling. Biometrical Journal: Journal of Mathematical Methods in Biosciences. 2003 Mar;45(2):218–25.