Dynamic Clustering of Gene Expression
Overview
Authors
Affiliations
It is well accepted that genes are simultaneously involved in multiple biological processes and that genes are coordinated over the duration of such events. Unfortunately, clustering methodologies that group genes for the purpose of novel gene discovery fail to acknowledge the dynamic nature of biological processes and provide static clusters, even when the expression of genes is assessed across time or developmental stages. By taking advantage of techniques and theories from time frequency analysis, periodic gene expression profiles are dynamically clustered based on the assumption that different spectral frequencies characterize different biological processes. A two-step cluster validation approach is proposed to statistically estimate both the optimal number of clusters and to distinguish significant clusters from noise. The resulting clusters reveal coordinated coexpressed genes. This novel dynamic clustering approach has broad applicability to a vast range of sequential data scenarios where the order of the series is of interest.
Constantino C, Carvalho A, Vinga S BioData Min. 2021; 14(1):25.
PMID: 33853663 PMC: 8048345. DOI: 10.1186/s13040-021-00257-8.
A multiobjective multi-view cluster ensemble technique: Application in patient subclassification.
Mitra S, Saha S PLoS One. 2019; 14(5):e0216904.
PMID: 31120942 PMC: 6533037. DOI: 10.1371/journal.pone.0216904.
A Comparison Study on Similarity and Dissimilarity Measures in Clustering Continuous Data.
Shirkhorshidi A, Aghabozorgi S, Wah T PLoS One. 2015; 10(12):e0144059.
PMID: 26658987 PMC: 4686108. DOI: 10.1371/journal.pone.0144059.