» Articles » PMID: 23861382

Estimating and Interpreting FST: the Impact of Rare Variants

Overview
Journal Genome Res
Specialty Genetics
Date 2013 Jul 18
PMID 23861382
Citations 267
Authors
Affiliations
Soon will be listed here.
Abstract

In a pair of seminal papers, Sewall Wright and Gustave Malécot introduced FST as a measure of structure in natural populations. In the decades that followed, a number of papers provided differing definitions, estimation methods, and interpretations beyond Wright's. While this diversity in methods has enabled many studies in genetics, it has also introduced confusion regarding how to estimate FST from available data. Considering this confusion, wide variation in published estimates of FST for pairs of HapMap populations is a cause for concern. These estimates changed-in some cases more than twofold-when comparing estimates from genotyping arrays to those from sequence data. Indeed, changes in FST from sequencing data might be expected due to population genetic factors affecting rare variants. While rare variants do influence the result, we show that this is largely through differences in estimation methods. Correcting for this yields estimates of FST that are much more concordant between sequence and genotype data. These differences relate to three specific issues: (1) estimating FST for a single SNP, (2) combining estimates of FST across multiple SNPs, and (3) selecting the set of SNPs used in the computation. Changes in each of these aspects of estimation may result in FST estimates that are highly divergent from one another. Here, we clarify these issues and propose solutions.

Citing Articles

Chromosome-level reference genome assembly of the gyrfalcon (Falco rusticolus) and population genomics offer insights into the falcon population in Mongolia.

Al-Ajli F, Formenti G, Fedrigo O, Tracey A, Sims Y, Howe K Sci Rep. 2025; 15(1):4154.

PMID: 39900672 PMC: 11790892. DOI: 10.1038/s41598-025-88216-9.


Subcontinental Genetic Diversity in the Research Program: Implications for Biomedical Research.

Gouveia M, Meeks K, Borda V, Leal T, Kehdy F, Mogire R bioRxiv. 2025; .

PMID: 39829860 PMC: 11741438. DOI: 10.1101/2025.01.09.632250.


Characterizing substructure via mixture modeling in large-scale genetic summary statistics.

Stoneman H, Price A, Trout N, Lamont R, Tifour S, Pozdeyev N Am J Hum Genet. 2025; 112(2):235-253.

PMID: 39824191 PMC: 11866976. DOI: 10.1016/j.ajhg.2024.12.007.


Genomic Insights Into Red Squirrels in Scotland Reveal Loss of Heterozygosity Associated With Extreme Founder Effects.

Marr M, Humble E, Lurz P, Wilson L, Milne E, Beckmann K Evol Appl. 2025; 18(1):e70072.

PMID: 39822659 PMC: 11735740. DOI: 10.1111/eva.70072.


No evidence for sex-differential transcriptomes driving genome-wide sex-differential natural selection.

Ming M, Cheng C, Kirkpatrick M, Harpak A Am J Hum Genet. 2025; 112(2):254-260.

PMID: 39814022 PMC: 11866945. DOI: 10.1016/j.ajhg.2024.12.016.


References
1.
Balding D, Nichols R . A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity. Genetica. 1995; 96(1-2):3-12. DOI: 10.1007/BF01441146. View

2.
Palumbi S, Baker C . Contrasting population structure from nuclear intron sequences and mtDNA of humpback whales. Mol Biol Evol. 1994; 11(3):426-35. DOI: 10.1093/oxfordjournals.molbev.a040115. View

3.
Albrechtsen A, Nielsen F, Nielsen R . Ascertainment biases in SNP chips affect measures of population divergence. Mol Biol Evol. 2010; 27(11):2534-47. PMC: 3107607. DOI: 10.1093/molbev/msq148. View

4.
Nei M . Analysis of gene diversity in subdivided populations. Proc Natl Acad Sci U S A. 1973; 70(12):3321-3. PMC: 427228. DOI: 10.1073/pnas.70.12.3321. View

5.
Holsinger K, Weir B . Genetics in geographically structured populations: defining, estimating and interpreting F(ST). Nat Rev Genet. 2009; 10(9):639-50. PMC: 4687486. DOI: 10.1038/nrg2611. View