» Articles » PMID: 37985818

Phenotype Integration Improves Power and Preserves Specificity in Biobank-based Genetic Studies of Major Depressive Disorder

Overview
Journal Nat Genet
Specialty Genetics
Date 2023 Nov 21
PMID 37985818
Authors
Affiliations
Soon will be listed here.
Abstract

Biobanks often contain several phenotypes relevant to diseases such as major depressive disorder (MDD), with partly distinct genetic architectures. Researchers face complex tradeoffs between shallow (large sample size, low specificity/sensitivity) and deep (small sample size, high specificity/sensitivity) phenotypes, and the optimal choices are often unclear. Here we propose to integrate these phenotypes to combine the benefits of each. We use phenotype imputation to integrate information across hundreds of MDD-relevant phenotypes, which significantly increases genome-wide association study (GWAS) power and polygenic risk score (PRS) prediction accuracy of the deepest available MDD phenotype in UK Biobank, LifetimeMDD. We demonstrate that imputation preserves specificity in its genetic architecture using a novel PRS-based pleiotropy metric. We further find that integration via summary statistics also enhances GWAS power and PRS predictions, but can introduce nonspecific genetic effects depending on input. Our work provides a simple and scalable approach to improve genetic studies in large biobanks by integrating shallow and deep phenotypes.

Citing Articles

Trans-ancestral rare variant association study with machine learning-based phenotyping for metabolic dysfunction-associated steatotic liver disease.

Chen R, Petrazzini B, Duffy A, Rocheleau G, Jordan D, Bansal M Genome Biol. 2025; 26(1):50.

PMID: 40065360 PMC: 11892324. DOI: 10.1186/s13059-025-03518-5.


Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes.

Kunkel D, Sorensen P, Shankar V, Morgante F PLoS Genet. 2025; 21(1):e1011519.

PMID: 39775068 PMC: 11741642. DOI: 10.1371/journal.pgen.1011519.


Assessment and ascertainment in psychiatric molecular genetics: challenges and opportunities for cross-disorder research.

Cai N, Verhulst B, Andreassen O, Buitelaar J, Edenberg H, Hettema J Mol Psychiatry. 2024; .

PMID: 39730880 DOI: 10.1038/s41380-024-02878-x.


Genetic liability estimated from large-scale family data improves genetic prediction, risk score profiling, and gene mapping for major depression.

Dybdahl Krebs M, Georgii Hellberg K, Lundberg M, Appadurai V, Ohlsson H, Pedersen E Am J Hum Genet. 2024; 111(11):2494-2509.

PMID: 39471805 PMC: 11568754. DOI: 10.1016/j.ajhg.2024.09.009.


Expanding drug targets for 112 chronic diseases using a machine learning-assisted genetic priority score.

Chen R, Duffy A, Petrazzini B, Vy H, Stein D, Mort M Nat Commun. 2024; 15(1):8891.

PMID: 39406732 PMC: 11480483. DOI: 10.1038/s41467-024-53333-y.


References
1.
Levey D, Stein M, Wendt F, Pathak G, Zhou H, Aslan M . Bi-ancestral depression GWAS in the Million Veteran Program and meta-analysis in >1.2 million individuals highlight new therapeutic directions. Nat Neurosci. 2021; 24(7):954-963. PMC: 8404304. DOI: 10.1038/s41593-021-00860-2. View

2.
An U, Pazokitoroudi A, Alvarez M, Huang L, Bacanu S, Schork A . Deep learning-based phenotype imputation on population-scale biobank data increases genetic discoveries. Nat Genet. 2023; 55(12):2269-2276. PMC: 10703681. DOI: 10.1038/s41588-023-01558-w. View

3.
Han B, Eskin E . Random-effects model aimed at discovering associations in meta-analysis of genome-wide association studies. Am J Hum Genet. 2011; 88(5):586-98. PMC: 3146723. DOI: 10.1016/j.ajhg.2011.04.014. View

4.
Flint J . The genetic basis of major depressive disorder. Mol Psychiatry. 2023; 28(6):2254-2265. PMC: 10611584. DOI: 10.1038/s41380-023-01957-9. View

5.
Dahl A, Nguyen K, Cai N, Gandal M, Flint J, Zaitlen N . A Robust Method Uncovers Significant Context-Specific Heritability in Diverse Complex Traits. Am J Hum Genet. 2020; 106(1):71-91. PMC: 7042488. DOI: 10.1016/j.ajhg.2019.11.015. View