» Articles » PMID: 17110489

Controlling the False-positive Rate in Multilocus Genome Scans for Selection

Overview
Journal Genetics
Specialty Genetics
Date 2006 Nov 18
PMID 17110489
Citations 92
Authors
Affiliations
Soon will be listed here.
Abstract

Rapid typing of genetic variation at many regions of the genome is an efficient way to survey variability in natural populations in an effort to identify segments of the genome that have experienced recent natural selection. Following such a genome scan, individual regions may be chosen for further sequencing and a more detailed analysis of patterns of variability, often to perform a parametric test for selection and to estimate the strength of a recent selective sweep. We show here that not accounting for the ascertainment of loci in such analyses leads to false inference of natural selection when the true model is selective neutrality, because the procedure of choosing unusual loci (in comparison to the rest of the genome-scan data) selects regions of the genome with genealogies similar to those expected under models of recent directional selection. We describe a simple and efficient correction for this ascertainment bias, which restores the false-positive rate to near-nominal levels. For the parameters considered here, we find that obtaining a test with the expected distribution of P-values depends on accurately accounting both for ascertainment of regions and for demography. Finally, we use simulations to explore the utility of relying on outlier loci to detect recent selective sweeps. We find that measures of diversity and of population differentiation are more effective than summaries of the site-frequency spectrum and that sequencing larger regions (2.5 kbp) in genome-scan studies leads to more power to detect recent selective sweeps.

Citing Articles

iHDSel software: The price equation and the population stability index to detect genomic patterns compatible with selective sweeps. An example with SARS-CoV-2.

Carvajal-Rodriguez A Biol Methods Protoc. 2024; 9(1):bpae089.

PMID: 39679303 PMC: 11646571. DOI: 10.1093/biomethods/bpae089.


A whole-genome scan for evidence of recent positive and balancing selection in aye-ayes () utilizing a well-fit evolutionary baseline model.

Soni V, Terbot 2nd J, Versoza C, Pfeifer S, Jensen J bioRxiv. 2024; .

PMID: 39605496 PMC: 11601216. DOI: 10.1101/2024.11.08.622667.


Digital Image Processing to Detect Adaptive Evolution.

Amin M, Hasan M, DeGiorgio M Mol Biol Evol. 2024; 41(12).

PMID: 39565932 PMC: 11631197. DOI: 10.1093/molbev/msae242.


Genomes of Microtus Rodents Highlight the Importance of Olfactory and Immune Systems in Their Fast Radiation.

Gouy A, Wang X, Kapopoulou A, Neuenschwander S, Schmid E, Excoffier L Genome Biol Evol. 2024; 16(11).

PMID: 39445808 PMC: 11579656. DOI: 10.1093/gbe/evae233.


Biases in ARG-Based Inference of Historical Population Size in Populations Experiencing Selection.

Marsh J, Johri P Mol Biol Evol. 2024; 41(7).

PMID: 38874402 PMC: 11245712. DOI: 10.1093/molbev/msae118.


References
1.
Watterson G . On the number of segregating sites in genetical models without recombination. Theor Popul Biol. 1975; 7(2):256-76. DOI: 10.1016/0040-5809(75)90020-9. View

2.
Akashi H . Inferring weak selection from patterns of polymorphism and divergence at "silent" sites in Drosophila DNA. Genetics. 1995; 139(2):1067-76. PMC: 1206357. DOI: 10.1093/genetics/139.2.1067. View

3.
Akey J, Eberle M, Rieder M, Carlson C, Shriver M, Nickerson D . Population history and natural selection shape patterns of genetic variation in 132 genes. PLoS Biol. 2004; 2(10):e286. PMC: 515367. DOI: 10.1371/journal.pbio.0020286. View

4.
Nielsen R, Williamson S, Kim Y, Hubisz M, Clark A, Bustamante C . Genomic scans for selective sweeps using SNP data. Genome Res. 2005; 15(11):1566-75. PMC: 1310644. DOI: 10.1101/gr.4252305. View

5.
Wright S, Vroh Bi I, Schroeder S, Yamasaki M, Doebley J, McMullen M . The effects of artificial selection on the maize genome. Science. 2005; 308(5726):1310-4. DOI: 10.1126/science.1107891. View