» Articles » PMID: 18296461

Tree-guided Bayesian Inference of Population Structures

Overview
Journal Bioinformatics
Specialty Biology
Date 2008 Feb 26
PMID 18296461
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Inferring population structures using genetic data sampled from a group of individuals is a challenging task. Many methods either consider a fixed population number or ignore the correlation between populations. As a result, they can lose sensitivity and specificity in detecting subtle stratifications. In addition, when a large number of genetic markers are used, many existing algorithms perform rather inefficiently.

Result: We propose a new Bayesian method to infer population structures using multiple unlinked single nucleotide polymorphisms (SNPs). Our approach explicitly considers the population correlation through a tree hierarchy, and treat the population number as a random variable. Using both simulated and real datasets of worldwide samples, we demonstrate that an incorporated tree can consistently improve the power in detecting subtle population stratifications. A tree-based model often involves a large number of unknown parameters, and the corresponding estimation procedure can be highly inefficient. We further implement a partition method to analytically integrate out all nuisance parameters in the tree. As a result, our method can analyze large SNP datasets with significantly improved convergence rate.

Availability: http://www.stat.psu.edu/~yuzhang/tips.tar.

Citing Articles

High evolutionary potential of marine zooplankton.

Peijnenburg K, Goetze E Ecol Evol. 2014; 3(8):2765-81.

PMID: 24567838 PMC: 3930040. DOI: 10.1002/ece3.644.


De novo inference of stratification and local admixture in sequencing studies.

Zhang Y BMC Bioinformatics. 2013; 14 Suppl 5:S17.

PMID: 23734678 PMC: 3622634. DOI: 10.1186/1471-2105-14-S5-S17.


Joint inference of population assignment and demographic history.

Choi S, Hey J Genetics. 2011; 189(2):561-77.

PMID: 21775468 PMC: 3189801. DOI: 10.1534/genetics.111.129205.


Characterization of a Bayesian genetic clustering algorithm based on a Dirichlet process prior and comparison among Bayesian clustering methods.

Onogi A, Nurimoto M, Morita M BMC Bioinformatics. 2011; 12:263.

PMID: 21708038 PMC: 3161044. DOI: 10.1186/1471-2105-12-263.


The computer program STRUCTURE does not reliably identify the main genetic clusters within species: simulations and implications for human population structure.

Kalinowski S Heredity (Edinb). 2010; 106(4):625-32.

PMID: 20683484 PMC: 3183908. DOI: 10.1038/hdy.2010.95.