» Articles » PMID: 16596579

Gaussianization-based Quasi-imputation and Expansion Strategies for Incomplete Correlated Binary Responses

Overview
Journal Stat Med
Publisher Wiley
Specialty Public Health
Date 2006 Apr 6
PMID 16596579
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

New quasi-imputation and expansion strategies for correlated binary responses are proposed by borrowing ideas from random number generation. The core idea is to convert correlated binary outcomes to multivariate normal outcomes in a sensible way so that re-conversion to the binary scale, after performing multiple imputation, yields the original specified marginal expectations and correlations. This conversion process ensures that the correlations are transformed reasonably which in turn allows us to take advantage of well-developed imputation techniques for Gaussian outcomes. We use the phrase 'quasi' because the original observations are not guaranteed to be preserved. We argue that if the inferential goals are well-defined, it is not necessary to strictly adhere to the established definition of multiple imputation. Our expansion scheme employs a similar strategy where imputation is used as an intermediate step. It leads to proportionally inflated observed patterns, forcing the data set to a complete rectangular format. The plausibility of the proposed methodology is examined by applying it to a wide range of simulated data sets that reflect alternative assumptions on complete data populations and missing-data mechanisms. We also present an application using a data set from obesity research. We conclude that the proposed method is a promising tool for handling incomplete longitudinal or clustered binary outcomes under ignorable non-response mechanisms.

Citing Articles

Confidence Intervals for the Area Under the Receiver Operating Characteristic Curve in the Presence of Ignorable Missing Data.

Cho H, Matthews G, Harel O Int Stat Rev. 2019; 87(1):152-177.

PMID: 31007356 PMC: 6472951. DOI: 10.1111/insr.12277.


Are characteristics of the medical home associated with diabetes care costs?.

Flottemesch T, Hudson Scholle S, OConnor P, Solberg L, Asche S, Pawlson L Med Care. 2012; 50(8):676-84.

PMID: 22710277 PMC: 4641308. DOI: 10.1097/MLR.0b013e3182551793.


Simulation of massive public health data by power polynomials.

Demirtas H, Hedeker D, Mermelstein R Stat Med. 2012; 31(27):3337-46.

PMID: 22532052 PMC: 3650647. DOI: 10.1002/sim.5362.


Multiple imputation inference for multivariate multilevel continuous data with ignorable non-response.

Yucel R Philos Trans A Math Phys Eng Sci. 2008; 366(1874):2389-403.

PMID: 18407897 PMC: 3227146. DOI: 10.1098/rsta.2008.0038.