A Review on Sample Size Determination for Cronbach’s Alpha Test: A Simple Guide for Researchers

Background Reliability studies are commonly used in questionnaire development studies and questionnaire validation studies. This study reviews the sample size guideline for Cronbach’s alpha test. Methods Manual sample size calculation using Microsoft Excel software and sample size tables were tabulated based on a single coefficient alpha and the comparison of two coefficients alpha. Results For a single coefficient alpha test, the approach by assuming the Cronbach’s alpha coefficient equals to zero in the null hypothesis will yield a smaller sample size of less than 30 to achieve a minimum desired effect size of 0.7. However, setting the coefficient of Cronbach’s alpha larger than zero in the null hypothesis could be necessary and this will yield larger sample size. For comparison of two coefficients of Cronbach’s alpha, a larger sample size is needed when testing for smaller effect sizes. Conclusions In the assessment of the internal consistency of an instrument, the present study proposed the Cronbach’s alpha’s coefficient to be set at 0.5 in the null hypothesis and hence larger sample size is needed. For comparison of two coefficients’ of Cronbach’s alpha, justification is needed whether testing for extremely low and extremely large effect sizes are scientifically necessary.


Introduction
Cronbach's alpha is a measure of the internal consistency or reliability between several items, measurements or ratings. In other words, it estimates how reliable are the responses of a questionnaire (or domain of a questionnaire), an instrumentation or rating evaluated by subjects which will indicate the stability of the tools. Alpha was developed by Cronbach (1), which was originally used to measure the reliability of a psychometric instrument. The value of Cronbach's alpha ranges from zero to one with the higher values implying the items are measuring the same dimension. In contrary, if the Cronbach's alpha value is low (near to 0), it means some or all of the items are not measuring the same dimension (2-3).

Cronbach's Alpha Test
Discussion on Cronbach's alpha test in terms of concept, applications, statistical test and sample size determination have been widely discussed in the literatures (7)(8)(9)(10)(11). Some of the discussions related to sample size are summarised in Table 1 (6,(12)(13)(14)(15)(16). The present study emphasizes the review of sample size determination for Cronbach's alpha and focuses on sample size determination introduced by Bonett (6,12).

Hypothesis Testing for a Single Cronbach's Alpha Test
The null hypothesis is H 0 : CA0 = CA1 and the possible selection of alternative hypotheses are: H a : CA0 ≠ CA1 (this option yields a two-tailed test) H a : CA0 < CA1 (this option yields a one-tailed test) H a : CA0 > CA1 (this option also yields a one-tailed test) CA0 refers to the value of Cronbach's alpha in null hypothesis and CA1 refers to the value of Cronbach's alpha in alternative hypothesis. Most commonly CA0 is assumed and set at 0 in which smaller sample sizes are derived since the gap between CA0 and CA1 is larger. Setting up the CA0 as not equal to 0 is uncommon in research but it is very useful in testing and comparing the hypothesis of two different alpha coefficients (13).

Hypothesis Testing for Comparison of Two Cronbach's Alpha Test
The null hypothesis is H 0 : ρ a = ρ b and the possible selection of alternative hypotheses are: H a : ρ a ≠ ρ b (this option yields a twotailed test) H a : ρ a < ρ b (this option yields a onetailed test) H a : ρ a > ρ b (this option also yields a one-tailed test) ρ a refers to the value of Cronbach's alpha in population "a" and ρ b refers to the value of Cronbach's alpha in population "b".
For example, the Depression Anxiety and Stress Scale (DASS-21) that is used to measure the magnitude of respondents' stress level condition for over the past week, has seven items measured by using a four-point Likert scale (ranging from zero to three) (4). To ascertain whether the items are reliable in measuring the same dimension, a test for Cronbach's alpha may be used. The Cronbach's alpha test is usually applied to test the consistency and stability of the questionnaires which measure latent variables. Although Cronbach's alpha test may be applied in situations other than questionnaire development or validation, there is limited literature of its application in such scenarios.
Cronbach's alpha has been applied in research to develop clinical tool. For instance, a study by Berg et al. (5) developed an instrument to measure the balancing ability in elderly patients. The study used a scale consisting of 14 movements of patients such as sitting unsupported, change of position and etc. The movements were evaluated by five physical therapists and they were given a score from zero to four for each movement. The high degree of internal consistency in this study showed that the scale measures the same dimension One of the common issues in inferential studies is to determine the sufficient sample size. The lack of knowledge in sample size determination and unfamiliarity with sample size softwares are usually the challenges encountered by researchers especially among those who do not have adequate knowledge in statistics. Sufficient sample size is needed so that research conducted can provide reliable and reproducible evidence that can detect the desired consistency or stability of an instrument (questionnaire). It is an important aspect in research to avoid lack of power of the test due to underestimation of sample size and also prevent the waste of resources due to overestimation of sample size (6).
The purpose of this present paper is to provide a simple guide for medical researchers to plan sample size estimation for their studies that involve Cronbach's alpha test. The sample sizes are determined and presented in the form of tables to estimate the minimum number of samples needed to obtain the desired value of Cronbach's alpha. Following that, the discussion of this paper emphasizes on the application of the formula and the sample size tables. test or one-tailed test settings. In this paper, the more-commonly used two-tailed test was chosen with the aim to detect the difference between the two Cronbach's alpha. There are three things that need to be considered when determining the sample size for Cronbach's alpha test: the number of items or raters (k), the value of Cronbach's alpha at null hypothesis (CA0) and the expected value of Cronbach's alpha (CA1). The value for CA0 and CA1 could be any value ranges from −1 to 1, however CA1 should not be equal to CA0 (CA1 ≠ CA0).
In order to illustrate the above formula with an example; there are 15 items in a questionnaire of which the reliability of its measurements need to be measured (CA0 and CA1 are identified at 0.0 and 0.7, respectively). Power is set at 90% and the value of alpha at 0.05. The minimum sample size required based on formula (i) and (ii) is as shown below:

Sample Size Calculation Based on Formula by Bonett (6)
Sample size for Coefficient alpha or Cronbach's alpha was calculated using Microsoft Excel. The formulation was from Bonett (6) based on formula given; Sample size was calculated based on power of 0.80 and 0.90 (Power = 1-β) while probability of type I error (α) was set at 0.05 at all time. There are options on whether to use two-tailed This paper suggests the sample size of 30 is sufficient on condition that first (largest) eigenvalue obtained from Principal Component Analysis (PCA) of the sample data set is higher than 6.00. However, if that first eigenvalues are between 3.00 and 6.00, then required minimum sample size is 100. 4 Cronbach's alpha reliability: Interval estimation, hypothesis testing, and sample size planning The results of a simulation study demonstrated that the proposed method performed better than alternative methods.

5.
Statistical methods-scale reliability analysis with small samples, Birmingham City University, Centre for Academic Success Samuels P Provides guideline for reliability analysis considering small samples.

6.
The RCSI sample size handbook: a rough guide This study summarized that sample size of 30 can measure reliability using Cronbach's alpha considering the scale items have strong correlations.
ten items in a questionnaire in group 1 and 2 for which the reliability of its measurements need to be measured (ρ a and ρ b are identified at 0.3 and 0.7, respectively). Power is set at 90% and the value of alpha at 0.05. The minimum sample size required based on formula (iii) and (iv) is as shown below: Calculations:

Results
Based on pre-specified alpha, power and effect size, the minimum sample size requirements are shown in Table 2 to Table 3. The pre-specified alpha and power are usually known (0.05 and 80.0% or 90.0%, respectively) and thus, the minimum sample size required is mainly determined by the value of k and the differences of the coefficients in null hypothesis and alternative hypothesis. The minimum requirement for sample size remains relatively unchanged when the value of k increases. Larger sample size was observed when the differences between the coefficients in the null and alternative hypotheses were smaller.
For one coefficient of Cronbach's alpha test, if the coefficient of Cronbach's alpha equals to zero in the null hypothesis, it is possible to derive a smaller sample size of less than 30 to achieve a minimum desired effect size of 0.7 or more (Table 2). However, setting the coefficient of Cronbach's alpha larger than zero (e.g., CA0 = 0.50) in the null hypothesis could be necessary in some cases and this will yield a larger sample size. (Table 2). For a comparison of two coefficients of Cronbach's alpha in different Calculations: 17.53 18 n . = Therefore, the minimum sample size required for this case study is approximately 18 samples.

Sample Size Calculation Based on Formula by Bonett (12)
Sample size for comparing two Coefficient alpha or Cronbach's alpha was also calculated using Microsoft Excel. The formulation was from Bonett (12) based on formula given; Sample size was calculated based on power of 0.80 and 0.90 (Power = 1 -β) while probability of type I error (α) was set at 0.05 at all time. There are options on whether to use two-tailed test or one-tailed test settings. In this paper, the more-commonly used two-tailed test was chosen with the aim to detect the difference between the two Cronbach's alpha. There are four things that need to be considered when determining the sample size for Cronbach's alpha test: the number of items or raters in group one (k 1 ), the number of items or raters in group two (k 2 ), the value of Cronbach's alpha at null hypothesis or group one (ρ a ) and the expected or group two value of Cronbach's alpha (ρ b ). The value of k 1 and k 2 are set to be similar and present as k. The value for ρ a and ρ b could be any value ranges from −1 to 1, however ρ b should not be equal to ρ a (ρ b ≠ ρ a ). In order to illustrate the above formula with an example; there are target a high coefficient of Cronbach's alpha, however, it is impractical to do so as there is a possibility that it may not be achieved in an actual study. It is more reasonable to target a desired Cronbach's alpha between 0.7 to 0.8 with an acceptable sample size.
An example of a sample size statement is as followed: "The aim of the study is to evaluate the internal consistency of particular questionnaire "A". Questionnaire "A" has 15 items with a fivepoint Likert scale for every item. The coefficient of Cronbach's alpha in the null hypothesis and alternative hypothesis are assumed to be equal to 0.0 and 0.7, respectively. Based on alpha value fixed at 0.05, the minimum sample size requirement is 14 in order to achieve power of 80.0%. The calculation is based on the formula introduced by Bonett (6)." If the questionnaire has a different number of items, researchers will need to calculate the sample size based on the formula in Equation (i).

Sample Size Determination for a Single Coefficient of Cronbach's Alpha Test when CA0 is Assumed to be More than Zero
Hypothesis testing when CA0 > 0 is rare in literature. However, the concept of testing the null hypothesis with CA0 > 0 may be useful in cases when researchers set a high target of internal consistency of an instrument. Some statistical soft-wares do not provide statistical test to assess internal consistency when CA0 is assumed to be larger than zero. However, with the sample size calculation, researchers can calculate and determine the sufficient sample size to assess the internal consistency when CA0 is assumed to be larger than zero.
For example, when k = 3 and CA0 = 0.50, based on alpha < 0.05 and power of at least 80.0%, a minimum sample size of 31 is sufficient to detect CA1 at 0.80. A larger minimum sample size is needed (n = 93) to detect CA1 at 0.70 (Table 2). An example of a sample size statement is as followed: "The aim of the study is to determine whether a particular questionnaire "B" has high magnitude of internal inconsistency. Questionnaire "B" has 10 items with a 5-point Likert scale for every item. The null hypothesis has set the coefficient of Cronbach's alpha is at 0.50 while in the alternative hypothesis, the coefficient of Cronbach's alpha is set at 0.80. Based on alpha of 0.05, the minimum sample size requirement is 23 to be able to detect at least 80.0% power of the test. The calculation is based on the formula introduced by Bonett (6)." groups, a larger sample size is needed to detect smaller effect size when the coefficients in the null hypothesis (ρ a ) and alternative hypothesis (ρ b ) are nearly about the same (Table 3).

Discussions
For a single coefficient of Cronbach's alpha test, the application of the sample size tables can be used into two situations, which are sample size determination when CA0 can be assumed equal to zero and sample size determination when CA0 is assumed to be more than zero.

Sample Size Determination for a Single Coefficient of Cronbach's Alpha Test when CA0 can be Assumed Equal to Zero
In the questionnaire reliability studies, the coefficient of Cronbach's alpha in the null hypothesis (CA0) is always assumed to be equaled to zero, implying that researchers assumed that there is no internal consistency of the whole items or for a particular domain of a questionnaire. Then, the coefficient of Cronbach's alpha in the alternative hypothesis (CA1) will assume that there is a magnitude of internal consistency (either low, moderate or strong) of the questionnaire or a particular domain of a questionnaire. The minimum sample size requirement depends on the desired magnitude of internal consistency in the alternative hypothesis.
Higher differences of Cronbach's alpha coefficients in the null hypothesis and alternative hypothesis will yield the lower sample size. Based on Table 2, in order to achieve an acceptable Cronbach's alpha of 0.7, when the values of alpha and power are fixed at 0.05 and 80.0%, respectively, the sample size ranges from four (k = 100 and CA0 = 0.0 versus CA1 = 0.95) to 52 (k = 3 and CA0 = 0.0 versus CA1 = 0.5), depending on the number of k (number of items). Hence, Cronbach's alpha test is suitable for conducting pilot studies when the CA0 is usually assumed to be zero. In studies related to questionnaire development and validation, Cronbach's alpha is a common indicator for internal consistency and has always been evaluated in the pilot studies (17)(18)(19)(20).
Based on the calculation, it is possible that the sample size can be as small as four with assumption of a very high internal consistency such as 0.95 or more (k = 100 and CA0 = 0.0 versus CA1 = 0.95). Although researchers may       The present article recommends testing hypothesis with CA0 = 0.5 is necessary to test whether an instrument has an excellent internal consistency. However, it is a choice by the researchers to set the value of CA0, including setting CA0 as equals to zero. But, solely targeting a large difference of Cronbach's alphas between CA0 and CA1 to get a lower sample size is not recommended. Therefore, justification is needed whether testing such effect size is scientifically necessary.

Sample Size Determination when to Compare Two Cronbach's Alpha in Two Different Groups
Hypothesis testing to compare two Cronbach's alphas in two different groups is also rare in literature. However, this hypothesis testing is useful if the researcher aims to compare two coefficients of Cronbach's alphas in two different groups. Although some statistical softwares might not provide statistical test to assess such hypothesis, researchers can calculate and determine sufficient sample size for the planned study to compare two Cronbach's alphas in two different groups.

Conflicts of Interest
None.

Funds
None. For example, when k = 3 and ρ a = 0.50, based on alpha = 0.05 and power = 80.0%, a minimum sample size of 183 is sufficient to detect ρ b at 0.70. A lower minimum sample size is needed (n = 59) to detect ρ b at 0.80 (Table 2). An example of a sample size statement is as followed: "The aim of the study is to determine whether the internal consistency of particular questionnaire "B" in group "two" is higher than in group "one". Questionnaire "B" has 15 items with a 5-point Likert scale for every item. The coefficient of Cronbach's alpha in group "one" was reported at 0.70 while the coefficient of Cronbach's alpha in group "two" is hypothesised to be at least 0.80. Based on alpha of 0.05, the minimum sample size requirement is 207 to be able to detect at least 80.0% power of the test. The calculation is based on the formula introduced by Bonett (12)."

Conclusion
This article provides sample size tables and simple guides for sample size estimation for Cronbach's alpha. This guideline is useful for researchers especially in medical field where the recommended tables will ease them in estimating sample size for reliability studies. For one coefficient of Cronbach's alpha test, the assumption of "the CA0 equals to zero" will usually yield a smaller sample size, and that is appropriate for pilot studies. However, to ensure an instrument has an excellent internal consistency, testing the hypothesis with CA0 larger than zero (CA0 > 0), such as CA0 = 0.5 is recommended.
Besides that, the calculation of sample size based on comparing two Cronbach's alpha values in two different groups is also presented. Justifications are needed to decide whether the testing for extremely low or extremely large effect sizes are scientifically necessary. Hopefully, this guideline could help the researchers in sample size estimation for their researches to assess the reliability of an instrument.