» Articles » PMID: 17485429

CLUMPP: a Cluster Matching and Permutation Program for Dealing with Label Switching and Multimodality in Analysis of Population Structure

Overview
Journal Bioinformatics
Specialty Biology
Date 2007 May 9
PMID 17485429
Citations 1877
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Clustering of individuals into populations on the basis of multilocus genotypes is informative in a variety of settings. In population-genetic clustering algorithms, such as BAPS, STRUCTURE and TESS, individual multilocus genotypes are partitioned over a set of clusters, often using unsupervised approaches that involve stochastic simulation. As a result, replicate cluster analyses of the same data may produce several distinct solutions for estimated cluster membership coefficients, even though the same initial conditions were used. Major differences among clustering solutions have two main sources: (1) 'label switching' of clusters across replicates, caused by the arbitrary way in which clusters in an unsupervised analysis are labeled, and (2) 'genuine multimodality,' truly distinct solutions across replicates.

Results: To facilitate the interpretation of population-genetic clustering results, we describe three algorithms for aligning multiple replicate analyses of the same data set. We have implemented these algorithms in the computer program CLUMPP (CLUster Matching and Permutation Program). We illustrate the use of CLUMPP by aligning the cluster membership coefficients from 100 replicate cluster analyses of 600 chickens from 20 different breeds.

Availability: CLUMPP is freely available at http://rosenberglab.bioinformatics.med.umich.edu/clumpp.html.

Citing Articles

Population panmixia of the pelagic shrimp Lucensosergia lucens between Japanese and Taiwanese waters in the western North Pacific.

Hirai J, Hsiao S, Yeh H, Nishikawa J Sci Rep. 2025; 15(1):7040.

PMID: 40044689 PMC: 11882780. DOI: 10.1038/s41598-025-91208-4.


Migration of wheat stripe rust from the primary oversummering region to neighboring regions in China.

Li Y, Zhang S, Liu D, Zhang T, Zhang Z, Zhao J Commun Biol. 2025; 8(1):350.

PMID: 40033097 PMC: 11876435. DOI: 10.1038/s42003-025-07789-3.


Characterization of Indian waxy and non-waxy maize germplasm for genetic differentiation through SNP genotyping.

Venadan S, Das A, Dixit S, Arora A, Kumar B, Hossain F Mol Genet Genomics. 2025; 300(1):27.

PMID: 40011230 DOI: 10.1007/s00438-024-02222-6.


Genetic structure and designing a preliminary core collection of in China based on 12 microsatellites markers.

Lei X, Su X, Zhou C, Jiang S, Yuan X, Zhao Y PeerJ. 2025; 13:e18909.

PMID: 39995990 PMC: 11849519. DOI: 10.7717/peerj.18909.


Negative frequency-dependent selection through variations in seedling fitness due to genetic differentiation of parents' pair in a tropical rainforest tree, (Dipterocarpaceae).

Tani N, Ng C, Lee S, Lee C, Muhammad N, Kondo T Front Genet. 2025; 16:1552024.

PMID: 39981260 PMC: 11839620. DOI: 10.3389/fgene.2025.1552024.