|
Journal of Computational Biology
Comparing Graph Representations of Protein Structure for Mining Family-Specific Residue-Based Packing Motifs
To cite this article:
Jun Huan, Deepak Bandyopadhyay, Wei Wang, Jack Snoeyink, Jan Prins, Alexander Tropsha.
Journal of Computational Biology.
July/August 2005,
12(6): 657-671.
doi:10.1089/cmb.2005.12.657.
Published in Volume: 12 Issue 6: August 18, 2005
Jun Huan Department of Computer Science, University of North Carolina, Chapel Hill, NC. Deepak Bandyopadhyay Department of Computer Science, University of North Carolina, Chapel Hill, NC. Wei Wang Department of Computer Science, University of North Carolina, Chapel Hill, NC. Jack Snoeyink Department of Computer Science, University of North Carolina, Chapel Hill, NC. Jan Prins Department of Computer Science, University of North Carolina, Chapel Hill, NC. Alexander Tropsha The Laboratory for Molecular Modeling, Division of Medicinal Chemistry and Natural Products, School of Pharmacy, University of North Carolina, Chapel Hill, NC. We find recurring amino-acid residue packing patterns, or spatial motifs, that are characteristic of protein structural families, by applying a novel frequent subgraph mining algorithm to graph representations of protein three-dimensional structure. Graph nodes represent amino acids, and edges are chosen in one of three ways: first, using a threshold for contact distance between residues; second, using Delaunay tessellation; and third, using the recently developed almost-Delaunay edges. For a set of graphs representing a protein family from the Structural Classification of Proteins (SCOP) database, subgraph mining typically identi- fies several hundred common subgraphs corresponding to spatial motifs that are frequently found in proteins in the family but rarely found outside of it. We find that some of the large motifs map onto known functional regions in two protein families explored in this study, i.e., serine proteases and kinases. We find that graphs based on almost-Delaunay edges significantly reduce the number of edges in the graph representation and hence present computational advantage, yet the patterns extracted from such graphs have a biological interpretation approximately equivalent to that of those extracted from distance based graphs.  This paper was cited by:Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: I. Method development Deepak Bandyopadhyay, Jun Huan, Jan Prins, Jack Snoeyink, Wei Wang, Alexander Tropsha Journal of Computer-Aided Molecular Design. Jul 2009 CrossRef Prediction of topological contacts in proteins using learning classifier systems Michael Stout, Jaume Bacardit, Jonathan D. Hirst, Robert E. Smith, Natalio Krasnogor Soft Computing. Mar 2009, Vol. 13, No. 3: 245-258 CrossRef Structure-based function inference using protein family-specific fingerprints Deepak Bandyopadhyay, Jun Huan, Jinze Liu, Jan Prins, Jack Snoeyink, Wei Wang, Alexander Tropsha Protein Science. Jul 2006, Vol. 15, No. 6: 1537-1543 CrossRef
|
|