» Articles » PMID: 37343895

Multiple Imputation of Missing Data Under Missing at Random: Compatible Imputation Models Are Not Sufficient to Avoid Bias if They Are Mis-specified

Overview
Publisher Elsevier
Specialty Public Health
Date 2023 Jun 21
PMID 37343895
Authors
Affiliations
Soon will be listed here.
Abstract

Objectives: Epidemiological studies often have missing data, which are commonly handled by multiple imputation (MI). Standard (default) MI procedures use simple linear covariate functions in the imputation model. We examine the bias that may be caused by acceptance of this default option and evaluate methods to identify problematic imputation models, providing practical guidance for researchers.

Study Design And Setting: Using simulation and real data analysis, we investigated how imputation model mis-specification affected MI performance, comparing results with complete records analysis (CRA). We considered scenarios in which imputation model mis-specification occurred because (i) the analysis model was mis-specified or (ii) the relationship between exposure and confounder was mis-specified.

Results: Mis-specification of the relationship between outcome and exposure, or between exposure and confounder, can cause biased CRA and MI estimates (in addition to any bias in the full-data estimate due to analysis model mis-specification). MI by predictive mean matching can mitigate model mis-specification. Methods for examining model mis-specification were effective in identifying mis-specified relationships.

Conclusion: When using MI methods that assume data are MAR, compatibility between the analysis and imputation models is necessary, but not sufficient to avoid bias. We propose a step-by-step procedure for identifying and correcting mis-specification of imputation models.

Citing Articles

Challenge of missing data in observational studies: investigating cross-sectional imputation methods for assessing disease activity in axial spondyloarthritis.

Georgiadis S, Pons M, Rasmussen S, Hetland M, Linde L, Di Giuseppe D RMD Open. 2025; 11(1).

PMID: 39979039 PMC: 11843021. DOI: 10.1136/rmdopen-2024-004844.


Multiple imputation using auxiliary imputation variables that only predict missingness can increase bias due to data missing not at random.

Curnow E, Cornish R, Heron J, Carpenter J, Tilling K BMC Med Res Methodol. 2024; 24(1):231.

PMID: 39375597 PMC: 11457445. DOI: 10.1186/s12874-024-02353-9.


Effectiveness of early pharmaceutical interventions in symptomatic COVID-19 patients: A randomized clinical trial.

Azhar S, Akram J, Latif W, Ibanez N, Mumtaz S, Rafi A Pak J Med Sci. 2024; 40(5):800-810.

PMID: 38827854 PMC: 11140354. DOI: 10.12669/pjms.40.5.8757.


Handling of outcome missing data dependent on measured or unmeasured background factors in micro-randomized trial: Simulation and application study.

Kondo M, Oba K Digit Health. 2024; 10:20552076241249631.

PMID: 38698826 PMC: 11064756. DOI: 10.1177/20552076241249631.


Categorisation of continuous covariates for stratified randomisation: How should we adjust?.

Sullivan T, Morris T, Kahan B, Cuthbert A, Yelland L Stat Med. 2024; 43(11):2083-2095.

PMID: 38487976 PMC: 7616414. DOI: 10.1002/sim.10060.

References
1.
Hughes R, Heron J, Sterne J, Tilling K . Accounting for missing data in statistical analyses: multiple imputation is not always the answer. Int J Epidemiol. 2019; 48(4):1294-1304. PMC: 6693809. DOI: 10.1093/ije/dyz032. View

2.
White I, Royston P, Wood A . Multiple imputation using chained equations: Issues and guidance for practice. Stat Med. 2011; 30(4):377-99. DOI: 10.1002/sim.4067. View

3.
Carpenter J, Smuk M . Missing data: A statistical framework for practice. Biom J. 2021; 63(5):915-947. PMC: 7615108. DOI: 10.1002/bimj.202000196. View

4.
van Buuren S . Multiple imputation of discrete and continuous data by fully conditional specification. Stat Methods Med Res. 2007; 16(3):219-42. DOI: 10.1177/0962280206074463. View

5.
Vickers A . Whose data set is it anyway? Sharing raw data from randomized trials. Trials. 2006; 7:15. PMC: 1489946. DOI: 10.1186/1745-6215-7-15. View