» Articles » PMID: 30398642

Localizing and Classifying Adaptive Targets with Trend Filtered Regression

Overview
Journal Mol Biol Evol
Specialty Biology
Date 2018 Nov 7
PMID 30398642
Citations 24
Authors
Affiliations
Soon will be listed here.
Abstract

Identifying genomic locations of natural selection from sequence data is an ongoing challenge in population genetics. Current methods utilizing information combined from several summary statistics typically assume no correlation of summary statistics regardless of the genomic location from which they are calculated. However, due to linkage disequilibrium, summary statistics calculated at nearby genomic positions are highly correlated. We introduce an approach termed Trendsetter that accounts for the similarity of statistics calculated from adjacent genomic regions through trend filtering, while reducing the effects of multicollinearity through regularization. Our penalized regression framework has high power to detect sweeps, is capable of classifying sweep regions as either hard or soft, and can be applied to other selection scenarios as well. We find that Trendsetter is robust to both extensive missing data and strong background selection, and has comparable power to similar current approaches. Moreover, the model learned by Trendsetter can be viewed as a set of curves modeling the spatial distribution of summary statistics in the genome. Application to human genomic data revealed positively selected regions previously discovered such as LCT in Europeans and EDAR in East Asians. We also identified a number of novel candidates and show that populations with greater relatedness share more sweep signals.

Citing Articles

Digital Image Processing to Detect Adaptive Evolution.

Amin M, Hasan M, DeGiorgio M Mol Biol Evol. 2024; 41(12).

PMID: 39565932 PMC: 11631197. DOI: 10.1093/molbev/msae242.


Population size rescaling significantly biases outcomes of forward-in-time population genetic simulations.

Dabi A, Schrider D Genetics. 2024; 229(1):1-57.

PMID: 39503241 PMC: 11708920. DOI: 10.1093/genetics/iyae180.


Tree Sequences as a General-Purpose Tool for Population Genetic Inference.

Whitehouse L, Ray D, Schrider D Mol Biol Evol. 2024; 41(11).

PMID: 39460991 PMC: 11600592. DOI: 10.1093/molbev/msae223.


Genome-Wide Analysis of Genetic Diversity and Selection Signatures in Zaobei Beef Cattle.

Shi L, Zhang P, Liu Q, Liu C, Cheng L, Yu B Animals (Basel). 2024; 14(16).

PMID: 39199980 PMC: 11350888. DOI: 10.3390/ani14162447.


Tree sequences as a general-purpose tool for population genetic inference.

Whitehouse L, Ray D, Schrider D bioRxiv. 2024; .

PMID: 39185244 PMC: 11343121. DOI: 10.1101/2024.02.20.581288.


References
1.
Sherry T . Identifying migratory birds' population bottlenecks in time and space. Proc Natl Acad Sci U S A. 2018; 115(14):3515-3517. PMC: 5889679. DOI: 10.1073/pnas.1802174115. View

2.
Sabeti P, Fry B, Lohmueller J, Hostetter E, Cotsapas C, Xie X . Genome-wide detection and characterization of positive selection in human populations. Nature. 2007; 449(7164):913-8. PMC: 2687721. DOI: 10.1038/nature06250. View

3.
Jensen J, Kim Y, Bauer DuMont V, Aquadro C, Bustamante C . Distinguishing between selective sweeps and demography using DNA polymorphism data. Genetics. 2005; 170(3):1401-10. PMC: 1451184. DOI: 10.1534/genetics.104.038224. View

4.
Tenesa A, Navarro P, Hayes B, Duffy D, Clarke G, Goddard M . Recent human effective population size estimated from linkage disequilibrium. Genome Res. 2007; 17(4):520-6. PMC: 1832099. DOI: 10.1101/gr.6023607. View

5.
Scally A, Durbin R . Revising the human mutation rate: implications for understanding human evolution. Nat Rev Genet. 2012; 13(10):745-53. DOI: 10.1038/nrg3295. View