» Articles » PMID: 20932336

Misty Mountain Clustering: Application to Fast Unsupervised Flow Cytometry Gating

Overview
Publisher Biomed Central
Specialty Biology
Date 2010 Oct 12
PMID 20932336
Citations 12
Authors
Affiliations
Soon will be listed here.
Abstract

Background: There are many important clustering questions in computational biology for which no satisfactory method exists. Automated clustering algorithms, when applied to large, multidimensional datasets, such as flow cytometry data, prove unsatisfactory in terms of speed, problems with local minima or cluster shape bias. Model-based approaches are restricted by the assumptions of the fitting functions. Furthermore, model based clustering requires serial clustering for all cluster numbers within a user defined interval. The final cluster number is then selected by various criteria. These supervised serial clustering methods are time consuming and frequently different criteria result in different optimal cluster numbers. Various unsupervised heuristic approaches that have been developed such as affinity propagation are too expensive to be applied to datasets on the order of 106 points that are often generated by high throughput experiments.

Results: To circumvent these limitations, we developed a new, unsupervised density contour clustering algorithm, called Misty Mountain, that is based on percolation theory and that efficiently analyzes large data sets. The approach can be envisioned as a progressive top-down removal of clouds covering a data histogram relief map to identify clusters by the appearance of statistically distinct peaks and ridges. This is a parallel clustering method that finds every cluster after analyzing only once the cross sections of the histogram. The overall run time for the composite steps of the algorithm increases linearly by the number of data points. The clustering of 106 data points in 2D data space takes place within about 15 seconds on a standard laptop PC. Comparison of the performance of this algorithm with other state of the art automated flow cytometry gating methods indicate that Misty Mountain provides substantial improvements in both run time and in the accuracy of cluster assignment.

Conclusions: Misty Mountain is fast, unbiased for cluster shape, identifies stable clusters and is robust to noise. It provides a useful, general solution for multidimensional clustering problems. We demonstrate its suitability for automated gating of flow cytometry data.

Citing Articles

[Automatic clustering method of flow cytometry data based on -distributed stochastic neighbor embedding].

Meng X, Wang Y, Zhu L Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2018; 35(5):697-704.

PMID: 30370707 PMC: 9935266. DOI: 10.7507/1001-5515.201802037.


[Cell data clustering method in flow cytometry based on kernel principal component analysis].

Ma S, Dong M, Zhang F, Pan Z, Zhu L Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2018; 34(1):115-22.

PMID: 29717598 PMC: 9935356.


Toward deterministic and semiautomated SPADE analysis.

Qiu P Cytometry A. 2017; 91(3):281-289.

PMID: 28234411 PMC: 5410769. DOI: 10.1002/cyto.a.23068.


Unfold High-Dimensional Clouds for Exhaustive Gating of Flow Cytometry Data.

Qiu P IEEE/ACM Trans Comput Biol Bioinform. 2015; 11(6):1045-51.

PMID: 26357042 PMC: 4866872. DOI: 10.1109/TCBB.2014.2321403.


Methods for discovery and characterization of cell subsets in high dimensional mass cytometry data.

Diggins K, Ferrell Jr P, Irish J Methods. 2015; 82:55-63.

PMID: 25979346 PMC: 4468028. DOI: 10.1016/j.ymeth.2015.05.008.


References
1.
Wilkins M, Hardy S, Boddy L, Morris C . Comparison of five clustering algorithms to classify phytoplankton from flow cytometry data. Cytometry. 2001; 44(3):210-7. DOI: 10.1002/1097-0320(20010701)44:3<210::aid-cyto1113>3.0.co;2-y. View

2.
Frey B, Dueck D . Clustering by passing messages between data points. Science. 2007; 315(5814):972-6. DOI: 10.1126/science.1136800. View

3.
Kirkpatrick S, Gelatt Jr C, Vecchi M . Optimization by simulated annealing. Science. 1983; 220(4598):671-80. DOI: 10.1126/science.220.4598.671. View

4.
Lizard G . Flow cytometry analyses and bioinformatics: interest in new softwares to optimize novel technologies and to favor the emergence of innovative concepts in cell research. Cytometry A. 2007; 71(9):646-7. DOI: 10.1002/cyto.a.20444. View

5.
Beckman R, Salzman G, Stewart C . Classification and regression trees for bone marrow immunophenotyping. Cytometry. 1995; 20(3):210-7. DOI: 10.1002/cyto.990200304. View