» Articles » PMID: 10347858

Applications of Multiple Imputation in Medical Studies: from AIDS to NHANES

Overview
Publisher Sage Publications
Specialties Public Health
Science
Date 1999 May 29
PMID 10347858
Citations 83
Authors
Affiliations
Soon will be listed here.
Abstract

Rubin's multiple imputation is a three-step method for handling complex missing data, or more generally, incomplete-data problems, which arise frequently in medical studies. At the first step, m (> 1) completed-data sets are created by imputing the unobserved data m times using m independent draws from an imputation model, which is constructed to reasonably approximate the true distributional relationship between the unobserved data and the available information, and thus reduce potentially very serious nonresponse bias due to systematic difference between the observed data and the unobserved ones. At the second step, m complete-data analyses are performed by treating each completed-data set as a real complete-data set, and thus standard complete-data procedures and software can be utilized directly. At the third step, the results from the m complete-data analyses are combined in a simple, appropriate way to obtain the so-called repeated-imputation inference, which properly takes into account the uncertainty in the imputed values. This paper reviews three applications of Rubin's method that are directly relevant for medical studies. The first is about estimating the reporting delay in acquired immune deficiency syndrome (AIDS) surveillance systems for the purpose of estimating survival time after AIDS diagnosis. The second focuses on the issue of missing data and noncompliance in randomized experiments, where a school choice experiment is used as an illustration. The third looks at handling nonresponse in United States National Health and Nutrition Examination Surveys (NHANES). The emphasis of our review is on the building of imputation models (i.e. the first step), which is the most fundamental aspect of the method.

Citing Articles

Modeling the determinants of attrition in a two-stage epilepsy prevalence survey in Nairobi using machine learning.

Mwanga D, Kipchirchir I, Muhua G, Newton C, Kadengye D Glob Epidemiol. 2025; 9:100183.

PMID: 39926376 PMC: 11804775. DOI: 10.1016/j.gloepi.2025.100183.


The relationship of serum klotho levels and triglyceride glucose index-related indicators.

Zhou Y, Wang Y, Li F, Shi Y, Wu T, Li Y Lipids Health Dis. 2024; 23(1):399.

PMID: 39639327 PMC: 11619470. DOI: 10.1186/s12944-024-02379-4.


Early Prediction of Cardiac Arrest in the Intensive Care Unit Using Explainable Machine Learning: Retrospective Study.

Kim Y, Seo W, Lee S, Koo J, Kim G, Song H J Med Internet Res. 2024; 26:e62890.

PMID: 39288404 PMC: 11445627. DOI: 10.2196/62890.


J-shaped relationship between stress hyperglycemia ratio and 90-day and 180-day mortality in patients with a first diagnosis of acute myocardial infarction: analysis of the MIMIC-IV database.

Hu B, Chen X, Wang Y, Wei X, Feng J, Hou L Diabetol Metab Syndr. 2024; 16(1):132.

PMID: 38880917 PMC: 11181615. DOI: 10.1186/s13098-024-01380-2.


Combined associations of regular exercise and work-related moderate-to-vigorous physical activity with occupational stress responses: a cross-sectional study.

Abe T, Okuyama K, Motohiro A, Shiratsuchi D, Isomura M Front Sports Act Living. 2024; 6:1386775.

PMID: 38783865 PMC: 11111849. DOI: 10.3389/fspor.2024.1386775.