|
Journal of Computational Biology
Learning Gene Functional Classifications from Multiple Data Types
To cite this article:
Paul Pavlidis, Jason Weston, Jinsong Cai, William Stafford Noble.
Journal of Computational Biology.
April 2002,
9(2): 401-411.
doi:10.1089/10665270252935539.
Paul Pavlidis Columbia Genome Center, Columbia University, New York, NY 10027 Jason Weston BIOwulf Technologies LLC, 305 Broadway, 9th Floor, New York, NY 10007 Jinsong Cai Department of Medical Informatics, Columbia University, New York, NY, 10027 William Stafford Noble Department of Computer Science, Columbia University, New York, NY, 10027 In our attempts to understand cellular function at the molecular level, we must be able to synthesize information from disparate types of genomic data. We consider the problem of inferring gene functional classifications from a heterogeneous data set consisting of DNA microarray expression measurements and phylogenetic profiles from whole-genome sequence comparisons. We demonstrate the application of the support vector machine (SVM) learning algorithm to this functional inference task. Our results suggest the importance of exploiting prior information about the heterogeneity of the data. In particular, we propose an SVM kernel function that is explicitly heterogeneous. In addition, we describe feature scaling methods for further exploiting prior knowledge of heterogeneity by giving each data type different weights.  This paper was cited by:Physical protein-protein interactions predicted from microarrays T.-t. Soong, K. O. Wrzeszczynski, B. Rost Bioinformatics. Oct 2008, Vol. 24, No. 22: 2608-2614 CrossRef A support vector machine model for the prediction of proteotypic peptides for accurate mass and time proteomics B.-J. M. Webb-Robertson, W. R. Cannon, C. S. Oehmen, A. R. Shah, V. Gurumoorthi, M. S. Lipton, K. M. Waters Bioinformatics. Jun 2008, Vol. 24, No. 13: 1503-1509 CrossRef Gene Expression Alterations in the Sphingolipid Metabolism Pathways during Progression of Dementia and Alzheimer’s Disease: A Shift Toward Ceramide Accumulation at the Earliest Recognizable Stages of Alzheimer’s Disease? Pavel Katsel, Celeste Li, Vahram Haroutunian Neurochemical Research. Apr 2007, Vol. 32, No. 4-5: 845-856 CrossRef Computational prediction of cancer-gene function Pingzhao Hu, Gary Bader, Dennis A. Wigle, Andrew Emili Nature Reviews Cancer. Feb 2007, Vol. 7, No. 1: 23-34 CrossRef CCR2 modulates inflammatory and metabolic effects of high-fat feeding Stuart P. Weisberg, Deborah Hunter, Reid Huber, Jacob Lemieux, Sarah Slaymaker, Kris Vaddi, Israel Charo, Rudolph L. Leibel, Anthony W. Ferrante Jr. Journal of Clinical Investigation. Feb 2006, Vol. 116, No. 1: 115-124 CrossRef Microarray data analysis: from disarray to consolidation and consensus David B. Allison, Xiangqin Cui, Grier P. Page, Mahyar Sabripour Nature Reviews Genetics. Feb 2006, Vol. 7, No. 1: 55-65 CrossRef
|
|