» Articles » PMID: 23335334

Correcting Coalescent Analyses for Panel-based SNP Ascertainment

Overview
Journal Genetics
Specialty Genetics
Date 2013 Jan 22
PMID 23335334
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Single-nucleotide polymorphism (SNP) data are routinely obtained by sequencing a region of interest in a small panel, constructing a chip with probes specific to sites found to vary in the panel, and using the chip to assay subsequent samples. The size of the chip is often reduced by removing low-frequency alleles from the set of SNPs. Using coalescent estimation of the scaled population size parameter, Θ, as a test case, we demonstrate the loss of information inherent in this procedure and develop corrections for coalescent analysis of SNPs obtained via a panel. We show that more accurate Θ-estimates can be recovered if the panel size is known, but at considerable computational cost as the panel individuals must be explicitly modeled in the analysis. We extend this technique to apply to the case where rare alleles have been omitted from the SNP panel. We find that when appropriate corrections for panel ascertainment and rare-allele omission are used, the biases introduced by ascertainment are largely correctable, but recovered estimates are less accurate than would be obtained with fully sequenced data. This method is then applied to recombinant multiple population data to investigate the effects of recombination and migration on the estimate of Θ.

Citing Articles

Effects of single nucleotide polymorphism ascertainment on population structure inferences.

Dokan K, Kawamura S, Teshima K G3 (Bethesda). 2021; 11(9).

PMID: 33871576 PMC: 8496283. DOI: 10.1093/g3journal/jkab128.


Use of modern tomato breeding germplasm for deciphering the genetic control of agronomical traits by Genome Wide Association study.

Bauchet G, Grenier S, Samson N, Bonnet J, Grivet L, Causse M Theor Appl Genet. 2017; 130(5):875-889.

PMID: 28188333 DOI: 10.1007/s00122-017-2857-9.


Short Tree, Long Tree, Right Tree, Wrong Tree: New Acquisition Bias Corrections for Inferring SNP Phylogenies.

Leache A, Banbury B, Felsenstein J, Nieto-Montes de Oca A, Stamatakis A Syst Biol. 2015; 64(6):1032-47.

PMID: 26227865 PMC: 4604835. DOI: 10.1093/sysbio/syv053.


How do SNP ascertainment schemes and population demographics affect inferences about population history?.

McTavish E, Hillis D BMC Genomics. 2015; 16:266.

PMID: 25887858 PMC: 4428227. DOI: 10.1186/s12864-015-1469-5.


Correcting for sequencing error in maximum likelihood phylogeny inference.

Kuhner M, McGill J G3 (Bethesda). 2014; 4(12):2545-52.

PMID: 25378476 PMC: 4267948. DOI: 10.1534/g3.114.014365.

References
1.
Watterson G . On the number of segregating sites in genetical models without recombination. Theor Popul Biol. 1975; 7(2):256-76. DOI: 10.1016/0040-5809(75)90020-9. View

2.
Kimura M . A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol. 1980; 16(2):111-20. DOI: 10.1007/BF01731581. View

3.
Adams A, Hudson R . Maximum-likelihood estimation of demographic parameters using the frequency spectrum of unlinked single-nucleotide polymorphisms. Genetics. 2004; 168(3):1699-712. PMC: 1448761. DOI: 10.1534/genetics.104.030171. View

4.
Nielsen R . Population genetic analysis of ascertained SNP data. Hum Genomics. 2004; 1(3):218-24. PMC: 3525085. DOI: 10.1186/1479-7364-1-3-218. View

5.
Ewens W . The sampling theory of selectively neutral alleles. Theor Popul Biol. 1972; 3(1):87-112. DOI: 10.1016/0040-5809(72)90035-4. View