Hello. Sign in to personalize your visit. New user? Register now.  
Journal of Computational Biology
Continuous Representations of Time-Series Gene Expression Data

To cite this article:
Ziv Bar-Joseph, Georg K. Gerber, David K. Gifford, Tommi S. Jaakkola, Itamar Simon. Journal of Computational Biology. June 2003, 10(3-4): 341-356. doi:10.1089/10665270360688057.

Published in Volume: 10 Issue 3-4: July 5, 2004

Full Text: • PDF for printing (297.2 KB) • PDF w/ links (284.3 KB)


Ziv Bar-Joseph
MIT Laboratory for Computer Science, 200 Technology Square, Cambridge, MA 02139
Georg K. Gerber
MIT Laboratory for Computer Science, 200 Technology Square, Cambridge, MA 02139
David K. Gifford
MIT Laboratory for Computer Science, 200 Technology Square, Cambridge, MA 02139
Tommi S. Jaakkola
MIT Artificial Intelligence Laboratory, 200 Technology Square, Cambridge, MA 02139
Itamar Simon
Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142

We present algorithms for time-series gene expression analysis that permit the principled estimation of unobserved time points, clustering, and dataset alignment. Each expression profile is modeled as a cubic spline (piecewise polynomial) that is estimated from the observed data and every time point influences the overall smooth expression curve. We constrain the spline coefficients of genes in the same class to have similar expression patterns, while also allowing for gene specific parameters. We show that unobserved time points can be reconstructed using our method with 10-15% less error when compared to previous best methods. Our clustering algorithm operates directly on the continuous representations of gene expression profiles, and we demonstrate that this is particularly effective when applied to nonuniformly sampled data. Our continuous alignment algorithm also avoids difficulties encountered by discrete approaches. In particular, our method allows for control of the number of degrees of freedom of the warp through the specification of parameterized functions, which helps to avoid overfitting. We demonstrate that our algorithm produces stable low-error alignments on real expression data and further show a specific application to yeast knock-out data that produces biologically meaningful results.

Free first page

This paper was cited by:

Clustered alignments of gene-expression time series data
A. A. Smith, A. Vollrath, C. A. Bradfield, M. Craven
Bioinformatics. Jul 2009, Vol. 25, No. 12: i119-i1127
CrossRef
Functional maximum-likelihood estimation of ARH(p) models
M. D. Ruiz-Medina, R. Salmerón
Stochastic Environmental Research and Risk Assessment. Mar 2009
CrossRef
Timing of Gene Expression Responses to Environmental Changes
Gal Chechik, Daphne Koller
Journal of Computational Biology. Feb 2009, Vol. 16, No. 2: 279-290
Abstract | Full Text PDF | Supplementary Material | Reprints & Permissions
Clustering of Gene Expression Data Based on Shape Similarity
Travis J. Hestilow, Yufei Huang
EURASIP Journal on Bioinformatics and Systems Biology. Feb 2009, Vol. 2009: 1-13
CrossRef
Expression profiling analysis for genes related to meat quality and carcass traits during postnatal development of backfat in two pig breeds
MingZhou Li, Li Zhu, XueWei Li, SuRong Shuai, XiaoKun Teng, HuaSheng Xiao, Qiang Li, Lei Chen, YuJiao Guo, JinYong Wang
Science in China Series C: Life Sciences. Sep 2008, Vol. 51, No. 8: 718-733
CrossRef
Discovering Statistically Significant Periodic Gene Expression
Jie Chen, Kuang-Chao Chang
International Statistical Review. Sep 2008, Vol. 76, No. 2: 228-246
CrossRef
Gene profiling for determining pluripotent genes in a time course microarray experiment
J. Tuke, G. F. V. Glonek, P. J. Solomon
Biostatistics. Jun 2008, Vol. 10, No. 1: 80-93
CrossRef
Influence of mRNA decay rates on the computational prediction of transcription rate profiles from gene expression profiles
Chi-Fang Chin, Arthur Chun-Chieh Shih, Kuo-Chin Fan
Journal of Biosciences. Jan 2008, Vol. 32, No. S3: 1251-1262
CrossRef
Significance analysis of time-series transcriptomic data: A methodology that enables the identification and further exploration of the differentially expressed genes at each time-point
Bhaskar Dutta, Robert Snyder, Maria I. Klapa
Biotechnology and Bioengineering. Nov 2007, Vol. 98, No. 3: 668-678
CrossRef
Bayesian State Space Models for Inferring and Predicting Temporal Gene Expression Profiles
Yulan Liang, Arpad Kelemen
Biometrical Journal. Jul 2007, Vol. 49, No. 6: 801-814
CrossRef
Reconstructing dynamic regulatory maps
Jason Ernst, Oded Vainas, Christopher T Harbison, Itamar Simon, Ziv Bar-Joseph
Molecular Systems Biology. Feb 2007, Vol. 3
CrossRef
Clustering Time-Series Gene Expression Data Using Smoothing Spline Derivatives
S. Déjean, P. G. P. Martin, A. Baccini, P. Besse
EURASIP Journal on Bioinformatics and Systems Biology. Feb 2007, Vol. 2007: 1-11
CrossRef
Analysis of Gene Coexpression by B-Spline Based CoD Estimation
Huai Li, Yu Sun, Ming Zhan
EURASIP Journal on Bioinformatics and Systems Biology. Feb 2007, Vol. 2007: 1-11
CrossRef
The Wavelet-Based Cluster Analysis for Temporal Gene Expression Data
J. Z. Song, K. M. Duan, T. Ware, M. Surette
EURASIP Journal on Bioinformatics and Systems Biology. Feb 2007, Vol. 2007: 1-8
CrossRef
A Bayesian Mixture Model for Partitioning Gene Expression Data
Chuan Zhou, Jon Wakefield
Biometrics. Jul 2006, Vol. 62, No. 2: 515-525
CrossRef
Combined static and dynamic analysis for determining the quality of time-series expression profiles
Itamar Simon, Zahava Siegfried, Jason Ernst, Ziv Bar-Joseph
Nature Biotechnology. Jan 2006, Vol. 23, No. 12: 1503-1508
CrossRef
All articles
Previous Next