Cluster Randomised Trials with a Binary Outcome and a Small Number of Clusters: Comparison of Individual and Cluster Level Analysis Method
Overview
Health Services
Affiliations
Background: Cluster randomised trials (CRTs) are often designed with a small number of clusters, but it is not clear which analysis methods are optimal when the outcome is binary. This simulation study aimed to determine (i) whether cluster-level analysis (CL), generalised linear mixed models (GLMM), and generalised estimating equations with sandwich variance (GEE) approaches maintain acceptable type-one error including the impact of non-normality of cluster effects and low prevalence, and if so (ii) which methods have the greatest power. We simulated CRTs with 8-30 clusters, altering the cluster-size, outcome prevalence, intracluster correlation coefficient, and cluster effect distribution. We analysed each dataset with weighted and unweighted CL; GLMM with adaptive quadrature and restricted pseudolikelihood; GEE with Kauermann-and-Carroll and Fay-and-Graubard sandwich variance using independent and exchangeable working correlation matrices. P-values were from a t-distribution with degrees of freedom (DoF) as clusters minus cluster-level parameters; GLMM pseudolikelihood also used Satterthwaite and Kenward-Roger DoF.
Results: Unweighted CL, GLMM pseudolikelihood, and Fay-and-Graubard GEE with independent or exchangeable working correlation matrix controlled type-one error in > 97% scenarios with clusters minus parameters DoF. Cluster-effect distribution and prevalence of outcome did not usually affect analysis method performance. GEE had the least power. With 20-30 clusters, GLMM had greater power than CL with varying cluster-size but similar power otherwise; with fewer clusters, GLMM had lower power with common cluster-size, similar power with medium variation, and greater power with large variation in cluster-size.
Conclusion: We recommend that CRTs with ≤ 30 clusters and a binary outcome use an unweighted CL or restricted pseudolikelihood GLMM both with DoF clusters minus cluster-level parameters.
Design of field trials for the evaluation of transmissible vaccines in animal populations.
Sheen J, Kennedy-Shaffer L, Levy M, Metcalf C PLoS Comput Biol. 2025; 21(2):e1012779.
PMID: 39899630 PMC: 11790233. DOI: 10.1371/journal.pcbi.1012779.
Tan R, Kavishe G, Kulinkina A, Renggli S, Luwanda L, Mangu C PLOS Digit Health. 2024; 3(12):e0000694.
PMID: 39715234 PMC: 11666054. DOI: 10.1371/journal.pdig.0000694.
Hemming K, Thompson J, Taljaard M, Watson S, Kasza J, Thompson J Trials. 2024; 25(1):818.
PMID: 39695707 PMC: 11653799. DOI: 10.1186/s13063-024-08653-1.
Demystifying estimands in cluster-randomised trials.
Kahan B, Blette B, Harhay M, Halpern S, Jairath V, Copas A Stat Methods Med Res. 2024; 33(7):1211-1232.
PMID: 38780480 PMC: 11348634. DOI: 10.1177/09622802241254197.
Offorha B, Walters S, Jacques R BMC Med Res Methodol. 2023; 23(1):293.
PMID: 38093221 PMC: 10717070. DOI: 10.1186/s12874-023-02107-z.