|
Journal of Computational Biology
Continuous Representations of Time-Series Gene Expression Data
To cite this article:
Ziv Bar-Joseph, Georg K. Gerber, David K. Gifford, Tommi S. Jaakkola, Itamar Simon.
Journal of Computational Biology.
June 2003,
10(3-4): 341-356.
doi:10.1089/10665270360688057.
Published in Volume: 10 Issue 3-4: July 5, 2004
Ziv Bar-Joseph MIT Laboratory for Computer Science, 200 Technology Square, Cambridge, MA 02139 Georg K. Gerber MIT Laboratory for Computer Science, 200 Technology Square, Cambridge, MA 02139 David K. Gifford MIT Laboratory for Computer Science, 200 Technology Square, Cambridge, MA 02139 Tommi S. Jaakkola MIT Artificial Intelligence Laboratory, 200 Technology Square, Cambridge, MA 02139 Itamar Simon Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142 We present algorithms for time-series gene expression analysis that permit the principled estimation of unobserved time points, clustering, and dataset alignment. Each expression profile is modeled as a cubic spline (piecewise polynomial) that is estimated from the observed data and every time point influences the overall smooth expression curve. We constrain the spline coefficients of genes in the same class to have similar expression patterns, while also allowing for gene specific parameters. We show that unobserved time points can be reconstructed using our method with 10-15% less error when compared to previous best methods. Our clustering algorithm operates directly on the continuous representations of gene expression profiles, and we demonstrate that this is particularly effective when applied to nonuniformly sampled data. Our continuous alignment algorithm also avoids difficulties encountered by discrete approaches. In particular, our method allows for control of the number of degrees of freedom of the warp through the specification of parameterized functions, which helps to avoid overfitting. We demonstrate that our algorithm produces stable low-error alignments on real expression data and further show a specific application to yeast knock-out data that produces biologically meaningful results.  This paper was cited by:Clustered alignments of gene-expression time series data A. A. Smith, A. Vollrath, C. A. Bradfield, M. Craven Bioinformatics. Jul 2009, Vol. 25, No. 12: i119-i1127 CrossRef Functional maximum-likelihood estimation of ARH(p) models M. D. Ruiz-Medina, R. Salmerón Stochastic Environmental Research and Risk Assessment. Mar 2009 CrossRef Timing of Gene Expression Responses to Environmental Changes Gal Chechik, Daphne Koller Journal of Computational Biology. Feb 2009, Vol. 16, No. 2: 279-290 Abstract | Full Text PDF
|
Supplementary Material | Reprints & PermissionsClustering of Gene Expression Data Based on Shape Similarity Travis J. Hestilow, Yufei Huang EURASIP Journal on Bioinformatics and Systems Biology. Feb 2009, Vol. 2009: 1-13 CrossRef Expression profiling analysis for genes related to meat quality and carcass traits during postnatal development of backfat in two pig breeds MingZhou Li, Li Zhu, XueWei Li, SuRong Shuai, XiaoKun Teng, HuaSheng Xiao, Qiang Li, Lei Chen, YuJiao Guo, JinYong Wang Science in China Series C: Life Sciences. Sep 2008, Vol. 51, No. 8: 718-733 CrossRef Discovering Statistically Significant Periodic Gene Expression Jie Chen, Kuang-Chao Chang International Statistical Review. Sep 2008, Vol. 76, No. 2: 228-246 CrossRef Gene profiling for determining pluripotent genes in a time course microarray experiment J. Tuke, G. F. V. Glonek, P. J. Solomon Biostatistics. Jun 2008, Vol. 10, No. 1: 80-93 CrossRef Influence of mRNA decay rates on the computational prediction of transcription rate profiles from gene expression profiles Chi-Fang Chin, Arthur Chun-Chieh Shih, Kuo-Chin Fan Journal of Biosciences. Jan 2008, Vol. 32, No. S3: 1251-1262 CrossRef Significance analysis of time-series transcriptomic data: A methodology that enables the identification and further exploration of the differentially expressed genes at each time-point Bhaskar Dutta, Robert Snyder, Maria I. Klapa Biotechnology and Bioengineering. Nov 2007, Vol. 98, No. 3: 668-678 CrossRef Bayesian State Space Models for Inferring and Predicting Temporal Gene Expression Profiles Yulan Liang, Arpad Kelemen Biometrical Journal. Jul 2007, Vol. 49, No. 6: 801-814 CrossRef Reconstructing dynamic regulatory maps Jason Ernst, Oded Vainas, Christopher T Harbison, Itamar Simon, Ziv Bar-Joseph Molecular Systems Biology. Feb 2007, Vol. 3 CrossRef Clustering Time-Series Gene Expression Data Using Smoothing Spline Derivatives S. Déjean, P. G. P. Martin, A. Baccini, P. Besse EURASIP Journal on Bioinformatics and Systems Biology. Feb 2007, Vol. 2007: 1-11 CrossRef Analysis of Gene Coexpression by B-Spline Based CoD Estimation Huai Li, Yu Sun, Ming Zhan EURASIP Journal on Bioinformatics and Systems Biology. Feb 2007, Vol. 2007: 1-11 CrossRef The Wavelet-Based Cluster Analysis for Temporal Gene Expression Data J. Z. Song, K. M. Duan, T. Ware, M. Surette EURASIP Journal on Bioinformatics and Systems Biology. Feb 2007, Vol. 2007: 1-8 CrossRef A Bayesian Mixture Model for Partitioning Gene Expression Data Chuan Zhou, Jon Wakefield Biometrics. Jul 2006, Vol. 62, No. 2: 515-525 CrossRef Combined static and dynamic analysis for determining the quality of time-series expression profiles Itamar Simon, Zahava Siegfried, Jason Ernst, Ziv Bar-Joseph Nature Biotechnology. Jan 2006, Vol. 23, No. 12: 1503-1508 CrossRef
|
|