Hello. Sign in to personalize your visit. New user? Register now.  
Journal of Computational Biology
Learning Gene Functional Classifications from Multiple Data Types

To cite this article:
Paul Pavlidis, Jason Weston, Jinsong Cai, William Stafford Noble. Journal of Computational Biology. April 2002, 9(2): 401-411. doi:10.1089/10665270252935539.

Full Text: • PDF for printing (275.9 KB) • PDF w/ links (308.2 KB)


Paul Pavlidis
Columbia Genome Center, Columbia University, New York, NY 10027
Jason Weston
BIOwulf Technologies LLC, 305 Broadway, 9th Floor, New York, NY 10007
Jinsong Cai
Department of Medical Informatics, Columbia University, New York, NY, 10027
William Stafford Noble
Department of Computer Science, Columbia University, New York, NY, 10027

In our attempts to understand cellular function at the molecular level, we must be able to synthesize information from disparate types of genomic data. We consider the problem of inferring gene functional classifications from a heterogeneous data set consisting of DNA microarray expression measurements and phylogenetic profiles from whole-genome sequence comparisons. We demonstrate the application of the support vector machine (SVM) learning algorithm to this functional inference task. Our results suggest the importance of exploiting prior information about the heterogeneity of the data. In particular, we propose an SVM kernel function that is explicitly heterogeneous. In addition, we describe feature scaling methods for further exploiting prior knowledge of heterogeneity by giving each data type different weights.

Free first page

This paper was cited by:

Physical protein-protein interactions predicted from microarrays
T.-t. Soong, K. O. Wrzeszczynski, B. Rost
Bioinformatics. Oct 2008, Vol. 24, No. 22: 2608-2614
CrossRef
A support vector machine model for the prediction of proteotypic peptides for accurate mass and time proteomics
B.-J. M. Webb-Robertson, W. R. Cannon, C. S. Oehmen, A. R. Shah, V. Gurumoorthi, M. S. Lipton, K. M. Waters
Bioinformatics. Jun 2008, Vol. 24, No. 13: 1503-1509
CrossRef
Gene Expression Alterations in the Sphingolipid Metabolism Pathways during Progression of Dementia and Alzheimer’s Disease: A Shift Toward Ceramide Accumulation at the Earliest Recognizable Stages of Alzheimer’s Disease?
Pavel Katsel, Celeste Li, Vahram Haroutunian
Neurochemical Research. Apr 2007, Vol. 32, No. 4-5: 845-856
CrossRef
Computational prediction of cancer-gene function
Pingzhao Hu, Gary Bader, Dennis A. Wigle, Andrew Emili
Nature Reviews Cancer. Feb 2007, Vol. 7, No. 1: 23-34
CrossRef
CCR2 modulates inflammatory and metabolic effects of high-fat feeding
Stuart P. Weisberg, Deborah Hunter, Reid Huber, Jacob Lemieux, Sarah Slaymaker, Kris Vaddi, Israel Charo, Rudolph L. Leibel, Anthony W. Ferrante Jr.
Journal of Clinical Investigation. Feb 2006, Vol. 116, No. 1: 115-124
CrossRef
Microarray data analysis: from disarray to consolidation and consensus
David B. Allison, Xiangqin Cui, Grier P. Page, Mahyar Sabripour
Nature Reviews Genetics. Feb 2006, Vol. 7, No. 1: 55-65
CrossRef
All articles
Previous Next