Extreme Polygenicity of Complex Traits Is Explained by Negative Selection

Overview

Journal Am J Hum Genet

Publisher Cell Press

Specialty Genetics

Date 2019 Aug 13

PMID 31402091

Citations 120

Authors

Luke J OConnor

Armin P Schoech

Farhad Hormozdiari

Steven Gazal

Nick Patterson

Alkes L Price

Affiliations

Soon will be listed here.

Abstract

Complex traits and common diseases are extremely polygenic, their heritability spread across thousands of loci. One possible explanation is that thousands of genes and loci have similarly important biological effects when mutated. However, we hypothesize that for most complex traits, relatively few genes and loci are critical, and negative selection-purging large-effect mutations in these regions-leaves behind common-variant associations in thousands of less critical regions instead. We refer to this phenomenon as flattening. To quantify its effects, we introduce a mathematical definition of polygenicity, the effective number of independently associated SNPs (M), which describes how evenly the heritability of a trait is spread across the genome. We developed a method, stratified LD fourth moments regression (S-LD4M), to estimate M, validating that it produces robust estimates in simulations. Analyzing 33 complex traits (average N = 361k), we determined that heritability is spread ∼4× more evenly among common SNPs than among low-frequency SNPs. This difference, together with evolutionary modeling of new mutations, suggests that complex traits would be orders of magnitude less polygenic if not for the influence of negative selection. We also determined that heritability is spread more evenly within functionally important regions in proportion to their heritability enrichment; functionally important regions do not harbor common SNPs with greatly increased causal effect sizes, due to selective constraint. Our results suggest that for most complex traits, the genes and loci with the most critical biological effects often differ from those with the strongest common-variant associations.

Citing Articles

Crusaders, monks and family fortunes: evolutionary models of male homosexuality and related phenomena.

Turecek P, Fort J, Flegr J Proc Biol Sci. 2025; 292(2042):20242756.

PMID: 40040457 PMC: 11880841. DOI: 10.1098/rspb.2024.2756.

Specificity, length, and luck: How genes are prioritized by rare and common variant association studies.

Spence J, Mostafavi H, Ota M, Milind N, Gjorgjieva T, Smith C bioRxiv. 2025; .

PMID: 39935885 PMC: 11812597. DOI: 10.1101/2024.12.12.628073.

Comparing statistical learning methods for complex trait prediction from gene expression.

Klimkowski Arango N, Morgante F PLoS One. 2025; 20(2):e0317516.

PMID: 39932918 PMC: 11813155. DOI: 10.1371/journal.pone.0317516.

A scalable variational inference approach for increased mixed-model association power.

Loya H, Kalantzis G, Cooper F, Palamara P Nat Genet. 2025; 57(2):461-468.

PMID: 39789286 PMC: 11821521. DOI: 10.1038/s41588-024-02044-7.

Utility of Candidate Genes From an Algorithm Designed to Predict Genetic Risk for Opioid Use Disorder.

Davis C, Jinwala Z, Hatoum A, Toikumo S, Agrawal A, Rentsch C JAMA Netw Open. 2025; 8(1):e2453913.

PMID: 39786773 PMC: 11718552. DOI: 10.1001/jamanetworkopen.2024.53913.

References

Huang H, Fang M, Jostins L, Mirkov M, Boucher G, Anderson C . Fine-mapping inflammatory bowel disease loci to single-variant resolution. Nature. 2017; 547(7662):173-178. PMC: 5511510. DOI: 10.1038/nature22969. View

Campbell C, Ogburn E, Lunetta K, Lyon H, Freedman M, Groop L . Demonstrating stratification in a European American population. Nat Genet. 2005; 37(8):868-72. DOI: 10.1038/ng1607. View

Hormozdiari F, Gazal S, van de Geijn B, Finucane H, Ju C, Loh P . Leveraging molecular quantitative trait loci to understand the genetic architecture of diseases and complex traits. Nat Genet. 2018; 50(7):1041-1047. PMC: 6030458. DOI: 10.1038/s41588-018-0148-2. View

Lek M, Karczewski K, Minikel E, Samocha K, Banks E, Fennell T . Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016; 536(7616):285-91. PMC: 5018207. DOI: 10.1038/nature19057. View

Keightley P, Hill W . Quantitative genetic variability maintained by mutation-stabilizing selection balance in finite populations. Genet Res. 1988; 52(1):33-43. DOI: 10.1017/s0016672300027282. View

Boyle E, Li Y, Pritchard J . An Expanded View of Complex Traits: From Polygenic to Omnigenic. Cell. 2017; 169(7):1177-1186. PMC: 5536862. DOI: 10.1016/j.cell.2017.05.038. View

Iossifov I, ORoak B, Sanders S, Ronemus M, Krumm N, Levy D . The contribution of de novo coding mutations to autism spectrum disorder. Nature. 2014; 515(7526):216-21. PMC: 4313871. DOI: 10.1038/nature13908. View

Yang J, Bakshi A, Zhu Z, Hemani G, Vinkhuyzen A, Lee S . Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index. Nat Genet. 2015; 47(10):1114-20. PMC: 4589513. DOI: 10.1038/ng.3390. View

Neale B, Kou Y, Liu L, Maayan A, Samocha K, Sabo A . Patterns and rates of exonic de novo mutations in autism spectrum disorders. Nature. 2012; 485(7397):242-5. PMC: 3613847. DOI: 10.1038/nature11011. View

10.

Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx B . Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet. 2016; 48(3):245-52. PMC: 4767558. DOI: 10.1038/ng.3506. View

11.

Visscher P, Wray N, Zhang Q, Sklar P, McCarthy M, Brown M . 10 Years of GWAS Discovery: Biology, Function, and Translation. Am J Hum Genet. 2017; 101(1):5-22. PMC: 5501872. DOI: 10.1016/j.ajhg.2017.06.005. View

12.

Zhang Y, Qi G, Park J, Chatterjee N . Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits. Nat Genet. 2018; 50(9):1318-1326. DOI: 10.1038/s41588-018-0193-x. View

13.

Fromer M, Pocklington A, Kavanagh D, Williams H, Dwyer S, Gormley P . De novo mutations in schizophrenia implicate synaptic networks. Nature. 2014; 506(7487):179-84. PMC: 4237002. DOI: 10.1038/nature12929. View

14.

Hunter D, Altshuler D, Rader D . From Darwin's finches to canaries in the coal mine--mining the genome for new biology. N Engl J Med. 2008; 358(26):2760-3. DOI: 10.1056/NEJMp0804318. View

15.

Wray N, Wijmenga C, Sullivan P, Yang J, Visscher P . Common Disease Is More Complex Than Implied by the Core Gene Omnigenic Model. Cell. 2018; 173(7):1573-1580. DOI: 10.1016/j.cell.2018.05.051. View

16.

Niemi M, Martin H, Rice D, Gallone G, Gordon S, Kelemen M . Common genetic variants contribute to risk of rare severe neurodevelopmental disorders. Nature. 2018; 562(7726):268-271. PMC: 6726472. DOI: 10.1038/s41586-018-0566-4. View

17.

Stahl E, Wegmann D, Trynka G, Gutierrez-Achury J, Do R, Voight B . Bayesian inference analyses of the polygenic architecture of rheumatoid arthritis. Nat Genet. 2012; 44(5):483-9. PMC: 6560362. DOI: 10.1038/ng.2232. View

18.

Hirschhorn J . Genomewide association studies--illuminating biologic pathways. N Engl J Med. 2009; 360(17):1699-701. DOI: 10.1056/NEJMp0808934. View

19.

Zuk O, Schaffner S, Samocha K, Do R, Hechter E, Kathiresan S . Searching for missing heritability: designing rare variant association studies. Proc Natl Acad Sci U S A. 2014; 111(4):E455-64. PMC: 3910587. DOI: 10.1073/pnas.1322563111. View

20.

Simons Y, Bullaughey K, Hudson R, Sella G . A population genetic interpretation of GWAS findings for human quantitative traits. PLoS Biol. 2018; 16(3):e2002985. PMC: 5871013. DOI: 10.1371/journal.pbio.2002985. View