» Articles » PMID: 33770823

The Effect of Random-effects Misspecification on Classification Accuracy

Overview
Journal Int J Biostat
Specialty Public Health
Date 2021 Mar 26
PMID 33770823
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

Mixed models are a useful way of analysing longitudinal data. Random effects terms allow modelling of patient specific deviations from the overall trend over time. Correlation between repeated measurements are captured by specifying a joint distribution for all random effects in a model. Typically, this joint distribution is assumed to be a multivariate normal distribution. For Gaussian outcomes misspecification of the random effects distribution usually has little impact. However, when the outcome is discrete (e.g. counts or binary outcomes) generalised linear mixed models (GLMMs) are used to analyse longitudinal trends. Opinion is divided about how robust GLMMs are to misspecification of the random effects. Previous work explored the impact of random effects misspecification on the bias of model parameters in single outcome GLMMs. Accepting that these model parameters may be biased, we investigate whether this affects our ability to classify patients into clinical groups using a longitudinal discriminant analysis. We also consider multiple outcomes, which can significantly increase the dimensions of the random effects distribution when modelled simultaneously. We show that when there is severe departure from normality, more flexible mixture distributions can give better classification accuracy. However, in many cases, wrongly assuming a single multivariate normal distribution has little impact on classification accuracy.

Citing Articles

Relationship between prostate-specific antigen, alkaline phosphatase levels, and time-to-tumor shrinkage: understanding the progression of prostate cancer in a longitudinal study.

Liaqat M, Khan R, Fischer F, Kamal S BMC Urol. 2024; 24(1):137.

PMID: 38956570 PMC: 11221162. DOI: 10.1186/s12894-024-01522-8.


Outcomes associated with unrecognized acute kidney injury in postoperative pediatric cardiac patients.

Kimura S, Shimizu K, Iwasaki T, Kanazawa T, Morimatsu T, Hatano T Pediatr Nephrol. 2023; 38(8):2861-2871.

PMID: 36929386 DOI: 10.1007/s00467-023-05925-x.

References
1.
Neuhaus J, McCulloch C, Boylan R . A note on type II error under random effects misspecification in generalized linear mixed models. Biometrics. 2011; 67(2):654-6. PMC: 3079788. DOI: 10.1111/j.1541-0420.2010.01474.x. View

2.
Marshall G, De la Cruz-Mesia R, Quintana F, Baron A . Discriminant analysis for longitudinal data with multiple continuous responses and possibly missing data. Biometrics. 2008; 65(1):69-80. DOI: 10.1111/j.1541-0420.2008.01016.x. View

3.
Litiere S, Alonso A, Molenberghs G . The impact of a misspecified random-effects distribution on the estimation and the performance of inferential procedures in generalized linear mixed models. Stat Med. 2007; 27(16):3125-44. DOI: 10.1002/sim.3157. View

4.
Komarek A, Hansen B, Kuiper E, van Buuren H, Lesaffre E . Discriminant analysis using a multivariate linear mixed model with a normal mixture in the random effects distribution. Stat Med. 2010; 29(30):3267-83. DOI: 10.1002/sim.3849. View

5.
Hughes D, Komarek A, Czanner G, Garcia-Finana M . Dynamic longitudinal discriminant analysis using multiple longitudinal markers of different types. Stat Methods Med Res. 2016; 27(7):2060-2080. PMC: 5985589. DOI: 10.1177/0962280216674496. View