Tree-guided Bayesian Inference of Population Structures
Overview
Authors
Affiliations
Motivation: Inferring population structures using genetic data sampled from a group of individuals is a challenging task. Many methods either consider a fixed population number or ignore the correlation between populations. As a result, they can lose sensitivity and specificity in detecting subtle stratifications. In addition, when a large number of genetic markers are used, many existing algorithms perform rather inefficiently.
Result: We propose a new Bayesian method to infer population structures using multiple unlinked single nucleotide polymorphisms (SNPs). Our approach explicitly considers the population correlation through a tree hierarchy, and treat the population number as a random variable. Using both simulated and real datasets of worldwide samples, we demonstrate that an incorporated tree can consistently improve the power in detecting subtle population stratifications. A tree-based model often involves a large number of unknown parameters, and the corresponding estimation procedure can be highly inefficient. We further implement a partition method to analytically integrate out all nuisance parameters in the tree. As a result, our method can analyze large SNP datasets with significantly improved convergence rate.
Availability: http://www.stat.psu.edu/~yuzhang/tips.tar.
High evolutionary potential of marine zooplankton.
Peijnenburg K, Goetze E Ecol Evol. 2014; 3(8):2765-81.
PMID: 24567838 PMC: 3930040. DOI: 10.1002/ece3.644.
De novo inference of stratification and local admixture in sequencing studies.
Zhang Y BMC Bioinformatics. 2013; 14 Suppl 5:S17.
PMID: 23734678 PMC: 3622634. DOI: 10.1186/1471-2105-14-S5-S17.
Joint inference of population assignment and demographic history.
Choi S, Hey J Genetics. 2011; 189(2):561-77.
PMID: 21775468 PMC: 3189801. DOI: 10.1534/genetics.111.129205.
Onogi A, Nurimoto M, Morita M BMC Bioinformatics. 2011; 12:263.
PMID: 21708038 PMC: 3161044. DOI: 10.1186/1471-2105-12-263.
Kalinowski S Heredity (Edinb). 2010; 106(4):625-32.
PMID: 20683484 PMC: 3183908. DOI: 10.1038/hdy.2010.95.