On continuity correction for RSS-structured cluster randomized designs with binary outcomes

Ahn, Soohyun; Wang, Xinlei; Wang, Mumu; Lim, Johan

doi:10.1007/s40300-021-00226-5

On continuity correction for RSS-structured cluster randomized designs with binary outcomes

Published: 28 January 2022

Volume 80, pages 383–397, (2022)
Cite this article

METRON Aims and scope Submit manuscript

Soohyun Ahn¹,
Xinlei Wang ORCID: orcid.org/0000-0002-8561-6511²,
Mumu Wang² &
…
Johan Lim³

155 Accesses
1 Citation
Explore all metrics

Abstract

Correction for continuity is commonly used to improve the inference for binary data when the event of interest is rare or the sample size is small. A standard approach to reduce the bias in logit estimation is to add a small constant to both event and nonevent counts. The 0.5 adjustment is known as a correction rendering the estimation unbiased up to the order of \(K^{-1}\), where K is the size of a simple random sample. However, for general designs beyond simple random sampling, the bias in estimating the logit is no longer zero in order \(K^{-1}\). In this paper, we derive the formula of the correction factor that makes the first-order term of the bias vanish for general designs. We then apply it to estimate the logit when data are from ranked set sampling (RSS) embedded in a cluster randomized design (CRD). An RSS-structured CRD (RSS-CRD), introduced by Wang et al. (J Am Stat Assoc 111: 1576–1590, 2016), is a new two-stage design for more efficient estimation of treatment effect. We propose two methods to estimate the correction factors derived for RSS-CRDs. We numerically compare the proposed methods to those with the default factor 0.5 in terms of bias and mean squared error for estimating the treatment effect, and finally make recommendations to practitioners.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sampling Techniques for Quantitative Research

A Tutorial on Applying the Difference-in-Differences Method to Health Data

Article Open access 07 September 2023

Protecting against researcher bias in secondary data analysis: challenges and potential solutions

Article Open access 13 January 2022

Availability of data and material

Supplemental material for this article is available online.

References

Anscombe, F.J.: On estimating binomial response relations. Biometrika 43, 461–465 (1956)
Article MathSciNet Google Scholar
Bhaumik, D.K., Amatya, A., Normand, S.L.T., et al.: Meta-analysis of rare binary adverse event data. J. Am. Stat. Assoc. 107(498), 555–567 (2012)
Article MathSciNet Google Scholar
Blyth, C.R., Still, H.A.: Binomial confidence intervals. J. Am. Stat. Assoc. 78, 108–116 (1983)
Article MathSciNet Google Scholar
Bradburn, M.J., Deeks, J.J., Berlin, J.A., et al.: Much ado about nothing: a comparison of the performance of meta-analytical methods with rare events. Stat. Med. 26(1), 53–77 (2007)
Article MathSciNet Google Scholar
Chen, H., Stasny, E.A., Wolfe, D.A.: Ranked set sampling for efficient estimation of a population proportion. Stat. Med. 24, 3239–3385 (2005)
Article MathSciNet Google Scholar
Cox, D.R.: The Analysis of Binary Data. Methuen, London (1970)
MATH Google Scholar
Dumbgen, L., Zamanzade, E.: Inference on a distribution function from ranked set samples. Ann. Inst. Stat. Math. 72(3), 157–185 (2020)
Article MathSciNet Google Scholar
Fleiss, J.H.: Statistical Methods for Rates and Proportions. Wiley, New York (1981)
MATH Google Scholar
Frey, J., Wang, L.: Edf-based goodness-of-fit tests for ranked-set sampling. Can. J. Stat. 42(3), 451–469 (2014)
Article MathSciNet Google Scholar
Frey, J., Zhang, Y.: Testing perfect rankings in ranked-set sampling with binary data. Can. J. Stat. 45(3), 326–339 (2017)
Article MathSciNet Google Scholar
Frey, J., Zhang, Y.: An algorithm with applications in ranked-set sampling. J. Stat. Comput. Simul. 88(3), 471–481 (2018)
Article MathSciNet Google Scholar
Frey, J., Zhang, Y.: Robust confidence intervals for a proportion using ranked-set sampling. J. Korean Stat. Soc. 50, 1009–1028 (2021)
Gart, J.J., Pettigrew, H.M., Thomas, D.G.: The effect of bias, variance estimation, skewness and kurtosis of the empirical logit on weighted least squares analyses. Biometrika 72, 179–190 (1985)
Article Google Scholar
Grizzle, J.E.: Continuity correction in the \(\chi ^{2}\)-test for \(2\times 2\) tables. Am. Stat. 21, 28–32 (1967)
Google Scholar
Haldane, J.B.S.: The estimation and significance of the logarithm of a ratio of frequencies. Ann. Hum. Genet. 20, 309–311 (1955)
Article Google Scholar
Hamdan, M.A.: On the continuity correction. Technometrics 16(4), 631–632 (1974)
Article MathSciNet Google Scholar
Li, L., Wang, X.: Meta-analysis of rare binary events in treatment groups with unequal variability. Stat. Methods Med. Res. (2017). https://doi.org/10.1177/0962280217721246
Article Google Scholar
Newcombe, R.G.: Two-sided confidence intervals for the single proportion: comparison of seven methods. Stat. Med. 17, 857–872 (1998)
Article Google Scholar
Ozturk, O.: Combining ranking information in judgment post stratified and ranked set sampling designs. Environ. Ecol. Stat. 19, 73–93 (2012)
Article MathSciNet Google Scholar
Ozturk, O.: Two-stage cluster samples with ranked set sampling designs. Ann. Inst. Stat. Math. 71(1), 63–91 (2019)
Article MathSciNet Google Scholar
Ozturk, O., Balakrishnan, N.: An exact control-versus-treatment comparison test based on ranked set samples. Biometrics 65(4), 1213–1222 (2009)
Article MathSciNet Google Scholar
Pearson, E.: The choice of statistical test illustrated on the interpretation of data classed in a 2 \(\times \) 2 tables. Biometrika 34, 139–167 (1947)
MathSciNet MATH Google Scholar
Pirie, W.R., Hamdan, M.A.: Some revised continuity corrections for discrete distributions. Biometrics 28(3), 693–701 (1972)
Article Google Scholar
Plackett, R.L.: The continuity correction in \(2\times 2\) tables. Biometrika 51, 327–337 (1964)
MathSciNet MATH Google Scholar
Sweeting, M., Sutton, A.J., Lambert, P.: What to add to nothing\(?\) use and avoidance of continuity corrections in meta analysis of sparse data. Stat. Med. 23(9), 1351–1375 (2004)
Article Google Scholar
Terpstra, J.: On estimating a population proportion via ranked set sampling. Biom. J. 46, 264–272 (2004)
Article MathSciNet Google Scholar
Wang, X., Ahn, S., Lim, J.: Unbalanced ranked set sampling in cluster randomized studies. J. Stat. Plan 187, 1–16 (2017)
Article MathSciNet Google Scholar
Wang, X., Lim, J., Stokes, L.: Using ranked set sampling with cluster randomized designs for improved inference on treatment effects. J. Am. Stat. Assoc. 111, 1576–1590 (2016)
Article MathSciNet Google Scholar
Wang, X., Wang, M., Lim, J., et al.: Using ranked set sampling with binary outcomes in cluster randomized designs. Can. J. Stat. 48(3), 342–365 (2020)
Article MathSciNet Google Scholar
Yates, F.: Contingency tables involving small numbers and the \(\chi ^{2}\) test. J. R. Stat. Soc. 1, 217–235 (1934)
MATH Google Scholar
Zamanzade, E., Mahdizadeh, M.: Using ranked set sampling with extreme ranks in estimating the population proportion. Stat. Methods Med. Res. 29(1), 165–177 (2020)
Article MathSciNet Google Scholar
Zamanzade, E., Wang, X.: Estimation of population proportion for judgment post-stratification. Comput. Stat. Data Anal. 112, 257–269 (2017)
Article MathSciNet Google Scholar

Download references

Funding

This work was supported by the National Research Foundation of Korea (Grant no.: NRF-2017R1D1A1B03032073, NRF-2019R1F1A1056779)

Author information

Authors and Affiliations

Department of Mathematics, Ajou University, Suwon, Korea
Soohyun Ahn
Department of Statistical Science, Southern Methodist University, Dallas, TX, 75275, USA
Xinlei Wang & Mumu Wang
Department of Statistics, Seoul National University, Seoul, 08826, Korea
Johan Lim

Authors

Soohyun Ahn
View author publications
You can also search for this author in PubMed Google Scholar
Xinlei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Mumu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Johan Lim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xinlei Wang.

Ethics declarations

Conflict of interest

The authors declare that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 376 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ahn, S., Wang, X., Wang, M. et al. On continuity correction for RSS-structured cluster randomized designs with binary outcomes. METRON 80, 383–397 (2022). https://doi.org/10.1007/s40300-021-00226-5

Download citation

Received: 18 December 2020
Accepted: 09 December 2021
Published: 28 January 2022
Issue Date: December 2022
DOI: https://doi.org/10.1007/s40300-021-00226-5

Keywords

Mathematics Subject Classifications

Primary 62D05 Secondary 62F10

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On continuity correction for RSS-structured cluster randomized designs with binary outcomes

Abstract

Access this article

Similar content being viewed by others

Sampling Techniques for Quantitative Research

A Tutorial on Applying the Difference-in-Differences Method to Health Data

Protecting against researcher bias in secondary data analysis: challenges and potential solutions

Availability of data and material

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary material 1 (pdf 376 KB)

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classifications

Navigation

On continuity correction for RSS-structured cluster randomized designs with binary outcomes

Abstract

Access this article

Similar content being viewed by others

Sampling Techniques for Quantitative Research

A Tutorial on Applying the Difference-in-Differences Method to Health Data

Protecting against researcher bias in secondary data analysis: challenges and potential solutions

Availability of data and material

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary material 1 (pdf 376 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classifications

Search

Navigation