» Articles » PMID: 36656906

Fast and Accurate Joint Inference of Coancestry Parameters for Populations And/or Individuals

Overview
Journal PLoS Genet
Specialty Genetics
Date 2023 Jan 19
PMID 36656906
Authors
Affiliations
Soon will be listed here.
Abstract

We introduce a fast, new algorithm for inferring from allele count data the FST parameters describing genetic distances among a set of populations and/or unrelated diploid individuals, and a tree with branch lengths corresponding to FST values. The tree can reflect historical processes of splitting and divergence, but seeks to represent the actual genetic variance as accurately as possible with a tree structure. We generalise two major approaches to defining FST, via correlations and mismatch probabilities of sampled allele pairs, which measure shared and non-shared components of genetic variance. A diploid individual can be treated as a population of two gametes, which allows inference of coancestry coefficients for individuals as well as for populations, or a combination of the two. A simulation study illustrates that our fast method-of-moments estimation of FST values, simultaneously for multiple populations/individuals, gains statistical efficiency over pairwise approaches when the population structure is close to tree-like. We apply our approach to genome-wide genotypes from the 26 worldwide human populations of the 1000 Genomes Project. We first analyse at the population level, then a subset of individuals and in a final analysis we pool individuals from the more homogeneous populations. This flexible analysis approach gives advantages over traditional approaches to population structure/coancestry, including visual and quantitative assessments of long-standing questions about the relative magnitudes of within- and between-population genetic differences.

Citing Articles

Genetic and Phenotypic Evaluation of European Maize Landraces as a Tool for Conservation and Valorization of Agrobiodiversity.

Balconi C, Galaretto A, Malvar R, Nicolas S, Redaelli R, Andjelkovic V Biology (Basel). 2024; 13(6).

PMID: 38927334 PMC: 11201045. DOI: 10.3390/biology13060454.


An allele-sharing, moment-based estimator of global, population-specific and population-pair FST under a general model of population structure.

Goudet J, Weir B PLoS Genet. 2023; 19(11):e1010871.

PMID: 38011288 PMC: 10703327. DOI: 10.1371/journal.pgen.1010871.

References
1.
Abecasis G, Altshuler D, Auton A, Brooks L, Durbin R, Gibbs R . A map of human genome variation from population-scale sequencing. Nature. 2010; 467(7319):1061-73. PMC: 3042601. DOI: 10.1038/nature09534. View

2.
Ochoa A, Storey J . Estimating FST and kinship for arbitrary population structures. PLoS Genet. 2021; 17(1):e1009241. PMC: 7846127. DOI: 10.1371/journal.pgen.1009241. View

3.
Saitou N, Nei M . The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987; 4(4):406-25. DOI: 10.1093/oxfordjournals.molbev.a040454. View

4.
Bhatia G, Patterson N, Sankararaman S, Price A . Estimating and interpreting FST: the impact of rare variants. Genome Res. 2013; 23(9):1514-21. PMC: 3759727. DOI: 10.1101/gr.154831.113. View

5.
Weir B, Goudet J . A Unified Characterization of Population Structure and Relatedness. Genetics. 2017; 206(4):2085-2103. PMC: 5560808. DOI: 10.1534/genetics.116.198424. View