» Articles » PMID: 17194218

Population Structure and Eigenanalysis

Overview
Journal PLoS Genet
Specialty Genetics
Date 2006 Dec 30
PMID 17194218
Citations 2555
Authors
Affiliations
Soon will be listed here.
Abstract

Current methods for inferring population structure from genetic data do not provide formal significance tests for population differentiation. We discuss an approach to studying population structure (principal components analysis) that was first applied to genetic data by Cavalli-Sforza and colleagues. We place the method on a solid statistical footing, using results from modern statistics to develop formal significance tests. We also uncover a general "phase change" phenomenon about the ability to detect structure in genetic data, which emerges from the statistical theory we use, and has an important implication for the ability to discover structure in genetic data: for a fixed but large dataset size, divergence between two populations (as measured, for example, by a statistic like FST) below a threshold is essentially undetectable, but a little above threshold, detection will be easy. This means that we can predict the dataset size needed to detect structure.

Citing Articles

High continuity of forager ancestry in the Neolithic period of the eastern Maghreb.

Lipson M, Ringbauer H, Lucarini G, Aouadi N, Aoudia L, Belhouchet L Nature. 2025; .

PMID: 40074896 DOI: 10.1038/s41586-025-08699-4.


Genome-Wide Association Study Identifying a Novel Gene Related to a History of Febrile Convulsions in Patients With Focal Epilepsy.

Kim J, Lee H, Park H, Lee J, Kim W J Clin Neurol. 2025; 21(2):123-130.

PMID: 40065453 PMC: 11896740. DOI: 10.3988/jcn.2024.0296.


Determining population structure from k-mer frequencies.

Hrytsenko Y, Daniels N, Schwartz R PeerJ. 2025; 13:e18939.

PMID: 40061228 PMC: 11890038. DOI: 10.7717/peerj.18939.


Convergent evolution of complex adaptive traits modulates angiogenesis in high-altitude Andean and Himalayan human populations.

Ferraretti G, Rill A, Abondio P, Smith K, Ojeda-Granados C, De Fanti S Commun Biol. 2025; 8(1):377.

PMID: 40050470 PMC: 11885840. DOI: 10.1038/s42003-025-07813-6.


Ancestral origins and post-admixture adaptive evolution of highland Tajiks.

Wen J, Liu J, Feng Q, Lu Y, Yuan K, Zhang X Natl Sci Rev. 2025; 11(9):nwae284.

PMID: 40040643 PMC: 11879426. DOI: 10.1093/nsr/nwae284.


References
1.
Balding D, Nichols R . A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity. Genetica. 1995; 96(1-2):3-12. DOI: 10.1007/BF01441146. View

2.
Spielman R, McGinnis R, Ewens W . Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). Am J Hum Genet. 1993; 52(3):506-16. PMC: 1682161. View

3.
Patterson N, Hattangadi N, Lane B, Lohmueller K, Hafler D, Oksenberg J . Methods for high-density admixture mapping of disease genes. Am J Hum Genet. 2004; 74(5):979-1000. PMC: 1181990. DOI: 10.1086/420871. View

4.
Hoggart C, Shriver M, Kittles R, Clayton D, McKeigue P . Design and analysis of admixture mapping studies. Am J Hum Genet. 2004; 74(5):965-78. PMC: 1181989. DOI: 10.1086/420855. View

5.
Lovell A, Moreau C, Yotova V, Xiao F, Bourgeois S, Gehl D . Ethiopia: between Sub-Saharan Africa and western Eurasia. Ann Hum Genet. 2005; 69(Pt 3):275-87. DOI: 10.1046/j.1529-8817.2005.00152.x. View