Principles of sample size calculation

In most areas in life, it is difficult to work with populations and hence researchers work with samples. The calculation of the sample size needed depends on the data type and distribution. Elements include consideration of the alpha error, beta error, clinically meaningful difference, and the variability or standard deviation. The final number arrived at should be increased to include a safety margin and the dropout rate. Over and above this, sample size calculations must take into account all available data, funding, support facilities, and ethics of subjecting patients to research.


Elements in Sample Size Calculation
Sample size calculations begin with an understanding of the type of data and distribution we are dealing with.Very broadly, data are divided into quantitative (numerical) and categorical (qualitative) data.For the former, information on the mean responses in the two groups' u1 and u2 are required as also the common standard deviation for the two groups.For categorical data, p1 and p2 or information on proportions of successes in the two groups is needed.This information is usually obtained either from the published literature, a pilot study, or at times guesstimated.The other two key components are the alpha and beta error.Because the estimated sample size represents the minimum number of subjects required for the study, a "safety factor" should be added.The size of the safety factor is again an educated guess.Additions for drop-outs/attrition during the course of the study should also be made.Apart from this, an understanding of whether the data are normally distributed (follows the Gaussian or bell-shaped curve) or otherwise is also needed. [2]derstanding of Key Terms [3] The calculation of sample size based on power considerations requires that an investigator specify the points given below.The first three items are under the control of the investigator: The size of the effect that is clinically worthwhile to detect (d).This for numerical data is the difference between u1 and u2 for quantitative data and p1 and p2 for categorical data.This is also called the clinical meaningful difference, which will make the physician change his or her practice.

The probability of falsely rejecting a true null hypothesis (α-error).
This is also called the false positive error and is the probability of finding a difference where none exists.This error is perceived to be the more dangerous of the two errors, since it can impact clinical practice.It is also called the regulator's error.The alpha error is linked to the P-value or probability value and is conventionally set at 5%.

The probability of failing to reject a false null hypothesis (β-error).
This is also called the false negative error and is the probability of NOT finding a difference when one actually exists.This is conventionally set either at 10% or 20% and is also called the investigator's error.
The standard deviation of the population being studied (SD or σ).This is the variability or spread associated with quantitative data.
Standard values of the alpha and beta error are given in the solved examples and can be found in most statistics books.The examples below can be solved by hand using simple or scientific calculators.The website and the pdf file, http://www.idfbridges.org/files/BRIDGES-sample-size-calculation-andexample-of-budget.pdf,provide an easy tool of how to use online sample size calculators.

Few Solved Examples
(A) Sample size for one mean, normal distribution Problem: An emergency medicine physician wants to know if the mean heart rate after a particular type of trauma differs from the healthy population rate of 72 beats/min.He considers a mean difference of 6 beats/min to be clinically meaningful.He also chooses 9.1 beats/min as the variation based on a previously published study.How many patients will be needed to carry out the study at 5% significance and 80% power?
In this example, the following data are given to us: • the size of the effect that is clinically worthwhile to detect Problem: A new treatment for hypertension is being compared with placebo.How many patients will be required at 90% power and 5% significance to detect an average difference of 5 mmHg between the Rx group and placebo group assuming a standard deviation (a measure of interpatient variability) to be 10 mm?
In this example, the following data are given to us: • the size of the effect that is clinically worthwhile to detect The sample size calculation should be done with the help of a statistician.However, the present article provides the basic understanding of the principles behind the sample size calculation.This would help in providing the required inputs to the clinicians while interacting with the statistician.
(d) = 5 mm • the probability of falsely rejecting a true null hypothesis (α) = 0.05, Z α = 1.96 • the probability of failing to reject a false null hypothesis (β) = 0.80, Z β = 1.282• the standard deviation of the population being studied (SD or σ) = 10 mm.n = 84 patients per group.(C)Sample size for two proportions, categorical dataThe BASIL study-bypass versus angioplasty: The statistical calculations were based on the 3-year survival value of 50% in the angioplasty and 65% in the bypass group.At 5% significance and 90% power, how many patients would be needed to detect a difference between the two groups?(Lancet 2005;366:1925-34).In this example, the following data are given to us: • the size of the effect that is clinically worthwhile to detect (d) = 15% or 0.15 • the probability of falsely rejecting a true null hypothesis (α) = 0.05, Z α = 1.96 • the probability of failing to reject a false null hypothesis (β) = 0.80, Z β = 1.282.p1 = 0.65, p2=.50, p = 0.575, i.e., there will be 233 patients per group.