» Articles » PMID: 12935332

Continuous Representations of Time-series Gene Expression Data

Overview
Journal J Comput Biol
Date 2003 Aug 26
PMID 12935332
Citations 83
Authors
Affiliations
Soon will be listed here.
Abstract

We present algorithms for time-series gene expression analysis that permit the principled estimation of unobserved time points, clustering, and dataset alignment. Each expression profile is modeled as a cubic spline (piecewise polynomial) that is estimated from the observed data and every time point influences the overall smooth expression curve. We constrain the spline coefficients of genes in the same class to have similar expression patterns, while also allowing for gene specific parameters. We show that unobserved time points can be reconstructed using our method with 10-15% less error when compared to previous best methods. Our clustering algorithm operates directly on the continuous representations of gene expression profiles, and we demonstrate that this is particularly effective when applied to nonuniformly sampled data. Our continuous alignment algorithm also avoids difficulties encountered by discrete approaches. In particular, our method allows for control of the number of degrees of freedom of the warp through the specification of parameterized functions, which helps to avoid overfitting. We demonstrate that our algorithm produces stable low-error alignments on real expression data and further show a specific application to yeast knock-out data that produces biologically meaningful results.

Citing Articles

Integrating Gene Expression Data into Single-Step Method (ssBLUP) Improves Genomic Prediction Accuracy for Complex Traits of Duroc × Erhualian F Pig Population.

Xu F, Che Z, Qiao J, Han P, Miao N, Dai X Curr Issues Mol Biol. 2024; 46(12):13713-13724.

PMID: 39727947 PMC: 11727526. DOI: 10.3390/cimb46120819.


Integrating patients in time series clinical transcriptomics data.

Hasanaj E, Mathur S, Bar-Joseph Z Bioinformatics. 2024; 40(Suppl 1):i151-i159.

PMID: 38940139 PMC: 11256926. DOI: 10.1093/bioinformatics/btae241.


Cell-specific imputation of drug connectivity mapping with incomplete data.

Sapashnik D, Newman R, Pietras C, Zhou D, Devkota K, Qu F PLoS One. 2023; 18(2):e0278289.

PMID: 36795645 PMC: 9934325. DOI: 10.1371/journal.pone.0278289.


A machine-learning approach for long-term prediction of experimental cardiac action potential time series using an autoencoder and echo state networks.

Shahi S, Fenton F, Cherry E Chaos. 2022; 32(6):063117.

PMID: 35778132 PMC: 9188460. DOI: 10.1063/5.0087812.


Prediction of chaotic time series using recurrent neural networks and reservoir computing techniques: A comparative study.

Shahi S, Fenton F, Cherry E Mach Learn Appl. 2022; 8.

PMID: 35755176 PMC: 9230140. DOI: 10.1016/j.mlwa.2022.100300.