Impact of Diagnostic Misclassification on Estimation of Genetic Correlations Using Genome-wide Genotypes
Overview
Affiliations
Disorders that share genetic risk factors often are placed in closely related diagnostic categories and treated similarly. Until recently, evidence for shared genetic etiology derived from classical research strategies--coaggregation in family and twin studies. Accumulating sufficient numbers of families was often problematic. However, in the era of genome-wide genotyping, we can now directly estimate the degree of sharing of genetic risk factors between disorders. This strategy is practical even for very rare disorders, where it is infeasible to ascertain informative families. Importantly, the estimates of genetic correlations from genome-wide genotypes are derived using such distant relatives that contamination by shared environmental factors seems unlikely. However, any method that seeks to quantify the shared etiology of disorders assumes they can be distinguished diagnostically from one another without error. Here we investigate the impact of misdiagnosis on estimates of genetic correlation both from traditional family data and from genome-wide genotypes of case-control samples from unrelated individuals. Our analyses show similar results for levels of misdiagnosis in both types of data. In both scenarios, genetic variances and heritabilities tend to be slightly underestimated but genetic correlations are overestimated, sometimes substantially so. For example, two genetically distinct but equally heritable disorders each with prevalence 1%, can generate false-positive estimates of genetic correlations of >0.2 in the presence of 10% reciprocal misdiagnosis. Strategies for minimizing the effects of misdiagnosis in cross-disorder genetic studies are discussed.
Cai N, Verhulst B, Andreassen O, Buitelaar J, Edenberg H, Hettema J Mol Psychiatry. 2024; .
PMID: 39730880 DOI: 10.1038/s41380-024-02878-x.
Westergaard D, Jorgensen F, Waaben J, Jung A, Lademann M, Hansen T Nat Commun. 2024; 15(1):7457.
PMID: 39198472 PMC: 11358290. DOI: 10.1038/s41467-024-51795-8.
Structural models of genome-wide covariance identify multiple common dimensions in autism.
de Hoyos L, Barendse M, Schlag F, van Donkelaar M, Verhoef E, Shapland C Nat Commun. 2024; 15(1):1770.
PMID: 38413609 PMC: 10899248. DOI: 10.1038/s41467-024-46128-8.
LaBianca S, Brikell I, Helenius D, Loughnan R, Mefford J, Palmer C Nat Genet. 2023; 56(2):234-244.
PMID: 38036780 PMC: 11439085. DOI: 10.1038/s41588-023-01593-7.
Genomic SEM applied to explore etiological divergences in bipolar subtypes.
Lawrence J, Breunig S, Foote I, Tallis C, Grotzinger A Psychol Med. 2023; 54(6):1152-1159.
PMID: 37885278 PMC: 11552689. DOI: 10.1017/S0033291723002957.