» Articles » PMID: 39323740

Exact Decoding of a Sequentially Markov Coalescent Model in Genetics

Overview
Journal J Am Stat Assoc
Specialty Public Health
Date 2024 Sep 26
PMID 39323740
Authors
Affiliations
Soon will be listed here.
Abstract

In statistical genetics, the sequentially Markov coalescent (SMC) is an important family of models for approximating the distribution of genetic variation data under complex evolutionary models. Methods based on SMC are widely used in genetics and evolutionary biology, with significant applications to genotype phasing and imputation, recombination rate estimation, and inferring population history. SMC allows for likelihood-based inference using hidden Markov models (HMMs), where the latent variable represents a genealogy. Because genealogies are continuous, while HMMs are discrete, SMC requires discretizing the space of trees in a way that is awkward and creates bias. In this work, we propose a method that circumvents this requirement, enabling SMC-based inference to be performed in the natural setting of a continuous state space. We derive fast, exact procedures for frequentist and Bayesian inference using SMC. Compared to existing methods, ours requires minimal user intervention or parameter tuning, no numerical optimization or E-M, and is faster and more accurate.

Citing Articles

Accelerated Bayesian inference of population size history from recombining sequence data.

Terhorst J bioRxiv. 2024; .

PMID: 38585997 PMC: 10996539. DOI: 10.1101/2024.03.25.586640.


The solution surface of the Li-Stephens haplotype copying model.

Jin Y, Terhorst J Algorithms Mol Biol. 2023; 18(1):12.

PMID: 37559098 PMC: 10410957. DOI: 10.1186/s13015-023-00237-z.

References
1.
Kamm J, Terhorst J, Song Y . Efficient computation of the joint sample frequency spectra for multiple populations. J Comput Graph Stat. 2017; 26(1):182-194. PMC: 5319604. DOI: 10.1080/10618600.2016.1159212. View

2.
Paul J, Steinrucken M, Song Y . An accurate sequentially Markov conditional sampling distribution for the coalescent with recombination. Genetics. 2011; 187(4):1115-28. PMC: 3070520. DOI: 10.1534/genetics.110.125534. View

3.
Palamara P, Terhorst J, Song Y, Price A . High-throughput inference of pairwise coalescence times identifies signals of selection and enriched disease heritability. Nat Genet. 2018; 50(9):1311-1317. PMC: 6145075. DOI: 10.1038/s41588-018-0177-x. View

4.
Howie B, Fuchsberger C, Stephens M, Marchini J, Abecasis G . Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat Genet. 2012; 44(8):955-9. PMC: 3696580. DOI: 10.1038/ng.2354. View

5.
Spence J, Steinrucken M, Terhorst J, Song Y . Inference of population history using coalescent HMMs: review and outlook. Curr Opin Genet Dev. 2018; 53:70-76. PMC: 6296859. DOI: 10.1016/j.gde.2018.07.002. View