» Articles » PMID: 12611802

Clustering of Time-course Gene Expression Data Using a Mixed-effects Model with B-splines

Overview
Journal Bioinformatics
Specialty Biology
Date 2003 Mar 4
PMID 12611802
Citations 80
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Time-course gene expression data are often measured to study dynamic biological systems and gene regulatory networks. To account for time dependency of the gene expression measurements over time and the noisy nature of the microarray data, the mixed-effects model using B-splines was introduced. This paper further explores such mixed-effects model in analyzing the time-course gene expression data and in performing clustering of genes in a mixture model framework.

Results: After fitting the mixture model in the framework of the mixed-effects model using an EM algorithm, we obtained the smooth mean gene expression curve for each cluster. For each gene, we obtained the best linear unbiased smooth estimate of its gene expression trajectory over time, combining data from that gene and other genes in the same cluster. Simulated data indicate that the methods can effectively cluster noisy curves into clusters differing in either the shapes of the curves or the times to the peaks of the curves. We further demonstrate the proposed method by clustering the yeast genes based on their cell cycle gene expression data and the human genes based on the temporal transcriptional response of fibroblasts to serum. Clear periodic patterns and varying times to peaks are observed for different clusters of the cell-cycle regulated genes. Results of the analysis of the human fibroblasts data show seven distinct transcriptional response profiles with biological relevance.

Availability: Matlab programs are available on request from the authors.

Citing Articles

clusterMLD: An Efficient Hierarchical Clustering Method for Multivariate Longitudinal Data.

Zhou J, Zhang Y, Tu W J Comput Graph Stat. 2023; 32(3):1131-1144.

PMID: 37859643 PMC: 10584088. DOI: 10.1080/10618600.2022.2149540.


Robust clustering of COVID-19 cases across U.S. counties using mixtures of asymmetric time series models with time varying and freely indexed covariates.

Maleki M, Bidram H, Wraith D J Appl Stat. 2023; 50(11-12):2648-2662.

PMID: 37529575 PMC: 10388823. DOI: 10.1080/02664763.2021.2019688.


Functional mixed effects clustering with application to longitudinal urologic chronic pelvic pain syndrome symptom data.

Guo W, You M, Yi J, Pontari M, Landis J J Am Stat Assoc. 2023; 117(540):1631-1641.

PMID: 36845296 PMC: 9949755. DOI: 10.1080/01621459.2022.2066536.


ALOHA: Aggregated local extrema splines for high-throughput dose-response analysis.

Davidson S, Wheeler M, Auerbach S, Sivaganesan S, Medvedovic M Comput Toxicol. 2022; 21.

PMID: 35083394 PMC: 8785973. DOI: 10.1016/j.comtox.2021.100196.


Parameter Estimation and Variable Selection for Big Systems of Linear Ordinary Differential Equations: A Matrix-Based Approach.

Wu L, Qiu X, Yuan Y, Wu H J Am Stat Assoc. 2021; 114(526):657-667.

PMID: 34385718 PMC: 8357247. DOI: 10.1080/01621459.2017.1423074.