» Articles » PMID: 35404934

A Spatially Aware Likelihood Test to Detect Sweeps from Haplotype Distributions

Overview
Journal PLoS Genet
Specialty Genetics
Date 2022 Apr 11
PMID 35404934
Authors
Affiliations
Soon will be listed here.
Abstract

The inference of positive selection in genomes is a problem of great interest in evolutionary genomics. By identifying putative regions of the genome that contain adaptive mutations, we are able to learn about the biology of organisms and their evolutionary history. Here we introduce a composite likelihood method that identifies recently completed or ongoing positive selection by searching for extreme distortions in the spatial distribution of the haplotype frequency spectrum along the genome relative to the genome-wide expectation taken as neutrality. Furthermore, the method simultaneously infers two parameters of the sweep: the number of sweeping haplotypes and the "width" of the sweep, which is related to the strength and timing of selection. We demonstrate that this method outperforms the leading haplotype-based selection statistics, though strong signals in low-recombination regions merit extra scrutiny. As a positive control, we apply it to two well-studied human populations from the 1000 Genomes Project and examine haplotype frequency spectrum patterns at the LCT and MHC loci. We also apply it to a data set of brown rats sampled in NYC and identify genes related to olfactory perception. To facilitate use of this method, we have implemented it in user-friendly open source software.

Citing Articles

Computational Genomics and Its Applications to Anthropological Questions.

Witt K, Villanea F Am J Biol Anthropol. 2025; 186 Suppl 78:e70010.

PMID: 40071816 PMC: 11898561. DOI: 10.1002/ajpa.70010.


Sweeps in space: leveraging geographic data to identify beneficial alleles in .

Rehmann C, Small S, Ralph P, Kern A bioRxiv. 2025; .

PMID: 39975147 PMC: 11839090. DOI: 10.1101/2025.02.07.637123.


Genomic signatures of adaptation in native lizards exposed to human-introduced fire ants.

Assis B, Sullivan A, Marciniak S, Bergey C, Garcia V, Szpiech Z Nat Commun. 2025; 16(1):89.

PMID: 39746982 PMC: 11695932. DOI: 10.1038/s41467-024-55020-4.


Digital Image Processing to Detect Adaptive Evolution.

Amin M, Hasan M, DeGiorgio M Mol Biol Evol. 2024; 41(12).

PMID: 39565932 PMC: 11631197. DOI: 10.1093/molbev/msae242.


Tree Sequences as a General-Purpose Tool for Population Genetic Inference.

Whitehouse L, Ray D, Schrider D Mol Biol Evol. 2024; 41(11).

PMID: 39460991 PMC: 11600592. DOI: 10.1093/molbev/msae223.


References
1.
Mughal M, Koch H, Huang J, Chiaromonte F, DeGiorgio M . Learning the properties of adaptive regions with functional data analysis. PLoS Genet. 2020; 16(8):e1008896. PMC: 7480868. DOI: 10.1371/journal.pgen.1008896. View

2.
Schrider D, Kern A . S/HIC: Robust Identification of Soft and Hard Sweeps Using Machine Learning. PLoS Genet. 2016; 12(3):e1005928. PMC: 4792382. DOI: 10.1371/journal.pgen.1005928. View

3.
Sabeti P, Fry B, Lohmueller J, Hostetter E, Cotsapas C, Xie X . Genome-wide detection and characterization of positive selection in human populations. Nature. 2007; 449(7164):913-8. PMC: 2687721. DOI: 10.1038/nature06250. View

4.
DeGiorgio M, Huber C, Hubisz M, Hellmann I, Nielsen R . SweepFinder2: increased sensitivity, robustness and flexibility. Bioinformatics. 2016; 32(12):1895-7. DOI: 10.1093/bioinformatics/btw051. View

5.
Charlesworth D, Charlesworth B, Morgan M . The pattern of neutral molecular variation under the background selection model. Genetics. 1995; 141(4):1619-32. PMC: 1206892. DOI: 10.1093/genetics/141.4.1619. View