G*Power Sample Size Calculator

Calculate the required sample size for your statistical analysis using G*Power parameters

Statistical Test Type

Effect Size (Cohen’s d, f, or w) Typical values: Small (0.2), Medium (0.5), Large (0.8)

Significance Level (α)

Statistical Power (1 – β) Typically 0.8 (80%) or 0.9 (90%)

Tail(s)

One-tailed

Two-tailed

Number of Groups

Number of Predictors

Calculation Results

Required Sample Size per Group:

Total Sample Size:

Statistical Power Achieved:

Critical t-value:

Non-centrality Parameter:

Comprehensive Guide: How to Calculate Sample Size Using G*Power

Determining the appropriate sample size is a critical step in research design that directly impacts the validity, reliability, and generalizability of your study findings. G*Power is a free, widely-used statistical power analysis program developed by researchers at Heinrich-Heine-Universität Düsseldorf that helps researchers calculate sample sizes, power, and effect sizes for various statistical tests.

Why Sample Size Calculation Matters

Statistical Power: Ensures your study has sufficient power (typically 80% or higher) to detect true effects
Resource Allocation: Helps optimize budget and time by avoiding overly large or insufficient samples
Ethical Considerations: Prevents exposing more participants than necessary to research procedures
Publication Requirements: Most journals require power analyses in research proposals

Key Concepts in Power Analysis

Before using G*Power, understand these fundamental concepts:

Effect Size (d, f, w, etc.): The magnitude of the difference or relationship you expect to find. Cohen (1988) provided benchmarks:
- Small effect: d = 0.2, f = 0.1, w = 0.1
- Medium effect: d = 0.5, f = 0.25, w = 0.3
- Large effect: d = 0.8, f = 0.4, w = 0.5
Significance Level (α): Probability of rejecting the null hypothesis when it’s true (Type I error). Common value is 0.05.
Statistical Power (1 – β): Probability of correctly rejecting the null hypothesis when it’s false. Target power is typically 0.8 or 0.9.
Tails: One-tailed tests look for effects in one direction; two-tailed tests look in both directions.

Step-by-Step Guide to Using G*Power for Sample Size Calculation

Step	Action	Example for t-test
1	Select Test Family	t-tests
2	Select Statistical Test	Means: Difference between two independent means (two groups)
3	Select Type of Power Analysis	A priori: Compute required sample size
4	Set Tails	Two-tailed (most common)
5	Enter Effect Size	0.5 (medium effect)
6	Set α Err Prob	0.05
7	Set Power	0.80
8	Set Allocation Ratio	1 (equal group sizes)
9	Calculate	Click “Calculate”

Common Statistical Tests and Their Parameters

Test Type	Effect Size Measure	Typical Medium Effect	When to Use
Independent Samples t-test	Cohen’s d	0.5	Compare means between two independent groups
Paired Samples t-test	Cohen’s dz	0.5	Compare means from the same group at different times
One-way ANOVA	Cohen’s f	0.25	Compare means among 3+ independent groups
Chi-square Test	Cohen’s w	0.3	Test relationships between categorical variables
Pearson Correlation	r	0.3	Measure linear relationship between two continuous variables
Linear Regression	Cohen’s f²	0.15	Predict continuous outcome from one or more predictors

Practical Example: Calculating Sample Size for a Clinical Trial

Let’s walk through a concrete example of calculating sample size for a randomized controlled trial comparing a new drug to a placebo:

Research Question: Does Drug X reduce blood pressure more than placebo?
Test Type: Independent samples t-test (two groups)
Effect Size: Based on pilot data, we expect a medium effect (Cohen’s d = 0.5)
Significance Level: α = 0.05 (standard)
Power: 0.80 (80% chance of detecting the effect if it exists)
Tails: Two-tailed (we’re interested in any difference, not just drug being better)

Entering these parameters into G*Power yields:

Required sample size per group: 64 participants
Total sample size: 128 participants (64 per group)
Critical t-value: 1.997
Non-centrality parameter: 2.828
Actual power: 0.803 (slightly above our target)

This means we need to recruit 128 participants total (64 in the drug group and 64 in the placebo group) to have an 80% chance of detecting a medium-sized effect if one truly exists, with only a 5% chance of falsely concluding there’s a difference when there isn’t (Type I error).

Advanced Considerations in Sample Size Calculation

While the basic calculation is straightforward, several advanced factors can influence your sample size requirements:

Attrition Rate: Account for participant dropout by increasing your sample size. If you expect 20% attrition, divide your required sample size by 0.8.
Unequal Group Sizes: If your groups will have unequal sizes, adjust the allocation ratio in G*Power (e.g., 2:1 instead of 1:1).
Covariates: For ANCOVA designs, including covariates can reduce required sample size by accounting for variance.
Cluster Designs: For cluster-randomized trials, you’ll need to account for intraclass correlation (ICC).
Multiple Comparisons: If performing multiple tests, consider adjusting your alpha level (e.g., Bonferroni correction).

Common Mistakes to Avoid

Researchers frequently make these errors in power analysis:

Overestimating Effect Sizes: Using effect sizes from published studies (which often overestimate true effects) can lead to underpowered studies. Consider using meta-analytic effect sizes when available.
Ignoring Attrition: Failing to account for participant dropout can leave you with insufficient power at the analysis stage.
Using One-tailed Tests Inappropriately: One-tailed tests should only be used when you’re certain about the direction of the effect and have strong theoretical justification.
Neglecting Assumption Checks: Power calculations assume your data will meet statistical assumptions (normality, homoscedasticity, etc.).
Not Reporting Power Analyses: Always document your power analysis in your methods section for transparency.

Alternative Tools and Resources

While G*Power is the most comprehensive free tool, several alternatives exist:

PASS Sample Size Software: Commercial software with extensive capabilities (NCSS)
PowerAndSampleSize.com: Free online calculators for basic tests
R packages: pwr, WebPower, and simr for simulation-based power analysis
Stata: Built-in power commands like power, sampsi, and poweroneway
SAS: PROC POWER procedure for comprehensive power analysis

Ethical Implications of Sample Size

The U.S. Department of Health & Human Services emphasizes that appropriate sample size is an ethical imperative in human subjects research. Key ethical considerations include:

Minimizing Harm: Sufficient power ensures participants aren’t exposed to research risks for an underpowered study that won’t yield meaningful results.
Scientific Validity: IRBs require studies to be scientifically valid, which includes adequate power.
Resource Allocation: Oversized studies waste resources and may expose unnecessary participants to research procedures.
Informed Consent: Participants should be informed about the study’s power and how it affects the likelihood of detecting true effects.

Frequently Asked Questions

Q: What if I can’t reach the calculated sample size?

A: If you can’t achieve the ideal sample size, you have several options:

Increase your effect size by using more sensitive measures or more extreme groups
Accept lower statistical power (but document this limitation)
Use a more efficient statistical test (e.g., ANCOVA instead of ANOVA)
Consider a qualitative or mixed-methods approach if quantitative power is insufficient

Q: How does sample size affect confidence intervals?

A: Larger sample sizes produce narrower confidence intervals, giving more precise estimates of population parameters. The margin of error in a confidence interval is inversely related to the square root of the sample size.

Q: Can I calculate sample size for non-parametric tests in G*Power?

A: G*Power primarily handles parametric tests. For non-parametric tests like Mann-Whitney U or Kruskal-Wallis, you can:

Use the equivalent parametric test in G*Power, then increase sample size by ~15% to account for reduced power of non-parametric tests
Use specialized software like PASS or simulation methods in R
Consult tables in non-parametric statistics textbooks (e.g., Siegel & Castellan)

Q: How do I calculate sample size for longitudinal studies?

A: For repeated measures or growth curve models:

Use G*Power’s “repeated measures ANOVA” option for simple designs
For complex models, use specialized software like Optimal Design or simulation in R
Account for attrition at each time point (e.g., if you expect 10% attrition at each of 3 time points, your initial sample needs to be ~35% larger)
Consider the correlation between repeated measures – higher correlations reduce required sample size

Conclusion and Best Practices

Calculating appropriate sample sizes using G*Power is a critical skill for researchers across disciplines. Remember these best practices:

Always perform a priori power analysis during study planning
Base effect sizes on pilot data or meta-analyses rather than guesses
Document all power analysis parameters in your methods section
Consider both statistical and practical significance – a statistically significant but tiny effect may not be meaningful
Re-evaluate power if your study design changes (e.g., adding predictors or measurement time points)
Use sensitivity analyses to explore how varying parameters affect required sample size
Consult with a statistician for complex designs or if you’re unsure about assumptions

By following these guidelines and using tools like G*Power effectively, you can design studies that are statistically rigorous, ethically sound, and resource-efficient. Proper sample size calculation is an investment that pays dividends in the quality and impact of your research findings.

How To Calculate Sample Size Using Gpower