Extreme Polygenicity of Complex Traits Is Explained by Negative Selection
Overview
Affiliations
Complex traits and common diseases are extremely polygenic, their heritability spread across thousands of loci. One possible explanation is that thousands of genes and loci have similarly important biological effects when mutated. However, we hypothesize that for most complex traits, relatively few genes and loci are critical, and negative selection-purging large-effect mutations in these regions-leaves behind common-variant associations in thousands of less critical regions instead. We refer to this phenomenon as flattening. To quantify its effects, we introduce a mathematical definition of polygenicity, the effective number of independently associated SNPs (M), which describes how evenly the heritability of a trait is spread across the genome. We developed a method, stratified LD fourth moments regression (S-LD4M), to estimate M, validating that it produces robust estimates in simulations. Analyzing 33 complex traits (average N = 361k), we determined that heritability is spread ∼4× more evenly among common SNPs than among low-frequency SNPs. This difference, together with evolutionary modeling of new mutations, suggests that complex traits would be orders of magnitude less polygenic if not for the influence of negative selection. We also determined that heritability is spread more evenly within functionally important regions in proportion to their heritability enrichment; functionally important regions do not harbor common SNPs with greatly increased causal effect sizes, due to selective constraint. Our results suggest that for most complex traits, the genes and loci with the most critical biological effects often differ from those with the strongest common-variant associations.
Turecek P, Fort J, Flegr J Proc Biol Sci. 2025; 292(2042):20242756.
PMID: 40040457 PMC: 11880841. DOI: 10.1098/rspb.2024.2756.
Spence J, Mostafavi H, Ota M, Milind N, Gjorgjieva T, Smith C bioRxiv. 2025; .
PMID: 39935885 PMC: 11812597. DOI: 10.1101/2024.12.12.628073.
Comparing statistical learning methods for complex trait prediction from gene expression.
Klimkowski Arango N, Morgante F PLoS One. 2025; 20(2):e0317516.
PMID: 39932918 PMC: 11813155. DOI: 10.1371/journal.pone.0317516.
A scalable variational inference approach for increased mixed-model association power.
Loya H, Kalantzis G, Cooper F, Palamara P Nat Genet. 2025; 57(2):461-468.
PMID: 39789286 PMC: 11821521. DOI: 10.1038/s41588-024-02044-7.
Davis C, Jinwala Z, Hatoum A, Toikumo S, Agrawal A, Rentsch C JAMA Netw Open. 2025; 8(1):e2453913.
PMID: 39786773 PMC: 11718552. DOI: 10.1001/jamanetworkopen.2024.53913.