Statistical Analysis and Handling of Missing Data in Cluster Randomized Trials: a Systematic Review

Overview

Journal Trials

Publisher Biomed Central

Specialties General Medicine
Pharmacology

Date 2016 Feb 11

PMID 26862034

Citations 41

Authors

Mallorie H Fiero

Shuang Huang

Eyal Oren

Melanie L Bell

Affiliations

Soon will be listed here.

Abstract

Background: Cluster randomized trials (CRTs) randomize participants in groups, rather than as individuals and are key tools used to assess interventions in health research where treatment contamination is likely or if individual randomization is not feasible. Two potential major pitfalls exist regarding CRTs, namely handling missing data and not accounting for clustering in the primary analysis. The aim of this review was to evaluate approaches for handling missing data and statistical analysis with respect to the primary outcome in CRTs.

Methods: We systematically searched for CRTs published between August 2013 and July 2014 using PubMed, Web of Science, and PsycINFO. For each trial, two independent reviewers assessed the extent of the missing data and method(s) used for handling missing data in the primary and sensitivity analyses. We evaluated the primary analysis and determined whether it was at the cluster or individual level.

Results: Of the 86 included CRTs, 80 (93%) trials reported some missing outcome data. Of those reporting missing data, the median percent of individuals with a missing outcome was 19% (range 0.5 to 90%). The most common way to handle missing data in the primary analysis was complete case analysis (44, 55%), whereas 18 (22%) used mixed models, six (8%) used single imputation, four (5%) used unweighted generalized estimating equations, and two (2%) used multiple imputation. Fourteen (16%) trials reported a sensitivity analysis for missing data, but most assumed the same missing data mechanism as in the primary analysis. Overall, 67 (78%) trials accounted for clustering in the primary analysis.

Conclusions: High rates of missing outcome data are present in the majority of CRTs, yet handling missing data in practice remains suboptimal. Researchers and applied statisticians should carry out appropriate missing data methods, which are valid under plausible assumptions in order to increase statistical power in trials and reduce the possibility of bias. Sensitivity analysis should be performed, with weakened assumptions regarding the missing data mechanism to explore the robustness of results reported in the primary analysis.

Citing Articles

Estimating marginal treatment effect in cluster randomized trials with multi-level missing outcomes.

Chang C, Wang R Biometrics. 2024; 80(4).

PMID: 39656746 PMC: 11629964. DOI: 10.1093/biomtc/ujae135.

Influence of El Niño southern oscillation on precipitation variability in Northeast Thailand.

Chueasa B, Humphries U, Waqas M MethodsX. 2024; 13:102954.

PMID: 39315397 PMC: 11417571. DOI: 10.1016/j.mex.2024.102954.

Terrorism group prediction using feature combination and BiGRU with self-attention mechanism.

Abdalsalam M, Li C, Dahou A, Kryvinska N PeerJ Comput Sci. 2024; 10:e2252.

PMID: 39314736 PMC: 11419613. DOI: 10.7717/peerj-cs.2252.

Risk of bias assessment tool for systematic review and meta-analysis of the gut microbiome.

Lampeter T, Love C, Tang T, Marella A, Lee H, Oganyan A Gut Microbiome (Camb). 2024; 4:e13.

PMID: 39295908 PMC: 11406368. DOI: 10.1017/gmb.2023.12.

Assessing treatment effect heterogeneity in the presence of missing effect modifier data in cluster-randomized trials.

Blette B, Halpern S, Li F, Harhay M Stat Methods Med Res. 2024; 33(5):909-927.

PMID: 38567439 PMC: 11041086. DOI: 10.1177/09622802241242323.

References

Zeger S, Liang K . Longitudinal data analysis for discrete and continuous outcomes. Biometrics. 1986; 42(1):121-30. View

Wears R . Advanced statistics: statistical methods for analyzing cluster and cluster-randomized data. Acad Emerg Med. 2002; 9(4):330-41. DOI: 10.1111/j.1553-2712.2002.tb01332.x. View

Freiberger E, Blank W, Salb J, Geilhof B, Hentschke C, Landendoerfer P . Effects of a complex intervention on fall risk in the general practitioner setting: a cluster randomized controlled trial. Clin Interv Aging. 2013; 8:1079-88. PMC: 3749819. DOI: 10.2147/CIA.S46218. View

Hemming K, Haines T, Chilton P, Girling A, Lilford R . The stepped wedge cluster randomised trial: rationale, design, analysis, and reporting. BMJ. 2015; 350:h391. DOI: 10.1136/bmj.h391. View

Cornfield J . Randomization by group: a formal analysis. Am J Epidemiol. 1978; 108(2):100-2. DOI: 10.1093/oxfordjournals.aje.a112592. View

Ma J, Akhtar-Danesh N, Dolovich L, Thabane L . Imputation strategies for missing binary outcomes in cluster randomized trials. BMC Med Res Methodol. 2011; 11:18. PMC: 3055218. DOI: 10.1186/1471-2288-11-18. View

Campbell M, Donner A, Klar N . Developments in cluster randomized trials and Statistics in Medicine. Stat Med. 2006; 26(1):2-19. DOI: 10.1002/sim.2731. View

Moher D, Liberati A, Tetzlaff J, Altman D . Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. J Clin Epidemiol. 2009; 62(10):1006-12. DOI: 10.1016/j.jclinepi.2009.06.005. View

Campbell M, Elbourne D, Altman D . CONSORT statement: extension to cluster randomised trials. BMJ. 2004; 328(7441):702-8. PMC: 381234. DOI: 10.1136/bmj.328.7441.702. View

10.

Nauta J, Knol D, Adriaensens L, Wolt K, van Mechelen W, Verhagen E . Prevention of fall-related injuries in 7-year-old to 12-year-old children: a cluster randomised controlled trial. Br J Sports Med. 2013; 47(14):909-13. DOI: 10.1136/bjsports-2012-091439. View

11.

Campbell M, Mollison J, Steen N, Grimshaw J, Eccles M . Analysis of cluster randomized trials in primary care: a practical approach. Fam Pract. 2000; 17(2):192-6. DOI: 10.1093/fampra/17.2.192. View

12.

Bell M, Fairclough D . Practical and statistical issues in missing data for longitudinal patient-reported outcomes. Stat Methods Med Res. 2013; 23(5):440-59. DOI: 10.1177/0962280213476378. View

13.

Shakeshaft A, Doran C, Petrie D, Breen C, Havard A, Abudeen A . The effectiveness of community action in reducing risky alcohol consumption and harm: a cluster randomised controlled trial. PLoS Med. 2014; 11(3):e1001617. PMC: 3949675. DOI: 10.1371/journal.pmed.1001617. View

14.

Scott N, McPherson G, Ramsay C, Campbell M . The method of minimization for allocation to clinical trials. a review. Control Clin Trials. 2002; 23(6):662-74. DOI: 10.1016/s0197-2456(02)00242-8. View

15.

Ma J, Raina P, Beyene J, Thabane L . Comparison of population-averaged and cluster-specific models for the analysis of cluster randomized trials with missing binary outcomes: a simulation study. BMC Med Res Methodol. 2013; 13:9. PMC: 3560270. DOI: 10.1186/1471-2288-13-9. View

16.

Taljaard M, Donner A, Klar N . Imputation strategies for missing continuous outcomes in cluster randomized trials. Biom J. 2008; 50(3):329-45. DOI: 10.1002/bimj.200710423. View

17.

Sox H, Goodman S . The methods of comparative effectiveness research. Annu Rev Public Health. 2012; 33:425-45. DOI: 10.1146/annurev-publhealth-031811-124610. View

18.

Fiero M, Huang S, Bell M . Statistical analysis and handling of missing data in cluster randomised trials: protocol for a systematic review. BMJ Open. 2015; 5(5):e007378. PMC: 4431058. DOI: 10.1136/bmjopen-2014-007378. View

19.

Zlotkin S, Newton S, Aimone A, Azindow I, Amenga-Etego S, Tchum K . Effect of iron fortification on malaria incidence in infants and young children in Ghana: a randomized trial. JAMA. 2013; 310(9):938-47. DOI: 10.1001/jama.2013.277129. View

20.

Simpson J, Klar N, Donnor A . Accounting for cluster randomization: a review of primary prevention trials, 1990 through 1993. Am J Public Health. 1995; 85(10):1378-83. PMC: 1615612. DOI: 10.2105/ajph.85.10.1378. View