Background: Effect sizes quantify the magnitude of group differences, yet hearing aid research still relies on Cohen’s benchmarks (0.20, 0.50, 0.80). These cutoffs are not field-specific and may misrepresent typical effects. Objective: To characterize the distribution of effect sizes in adult hearing aid research and use these data to estimate sample sizes required to achieve varying levels of statistical power. Methods: A systematic search of PubMed, CINAHL, and Embase identified English-language randomized controlled trials (RCTs) involving adults with mild-to-severe hearing loss using commercially available air-conduction hearing devices. Eligible outcomes included standardized self-reports and behavioral measures. Absolute Hedges’ g values were calculated, with the 25th, 50th, and 75th percentiles representing small, medium, and large effects. A priori power analyses estimated required sample sizes. Results: From 15,066 records, 33 trials (4,471 participants) met the inclusion criteria; of these, 17 trials provided 63 effect sizes. Across all outcomes, the 25th, 50th, and 75th percentiles indicating that Cohen’s standardized benchmarks may portray effects in hearing aid research as smaller than they typically are. Estimated sample sizes showed that few published studies met 80% power for a medium effect. Conclusions: The empirical distribution of effect sizes in hearing aid RCTs is shifted towards smaller numerical values relative to Cohen’s conventional benchmark. Therefore, using Cohen’s generic benchmarks makes an effect appear smaller than it is in the context of this field . We recommend using the effect sizes of 0.1, 0.2, and 0.5 for small, medium, and large effect sizes, respectively, when interpreting hearing aid trial results. Adoption of these empirically derived benchmarks will improve the accuracy of interpretation, guide more realistic sample size planning, and enhance the replicability of future trials.
Tried to make this a bit clearer too