Hello. Sign in to personalize your visit. New user? Register now.  
Journal of Computational Biology
A Random Graph Approach to NMR Sequential Assignment

To cite this article:
Chris Bailey-Kellogg, Sheetal Chainraj, Gopal Pandurangan. Journal of Computational Biology. July/August 2005, 12(6): 569-583. doi:10.1089/cmb.2005.12.569.

Published in Volume: 12 Issue 6: August 18, 2005

Full Text: • PDF for printing (176.6 KB) • PDF w/ links (193.3 KB)


Chris Bailey-Kellogg
Department of Computer Science, Dartmouth College, Hanover, NH 03755.
Sheetal Chainraj
Department of Computer Science, Purdue University, West Lafayette, IN 47907.
Gopal Pandurangan
Department of Computer Science, Purdue University, West Lafayette, IN 47907.

Nuclear magnetic resonance (NMR) spectroscopy allows scientists to study protein structure, dynamics and interactions in solution. A necessary first step for such applications is determining the resonance assignment, mapping spectral data to atoms and residues in the primary sequence. Automated resonance assignment algorithms rely on information regarding connectivity (e.g., through-bond atomic interactions) and amino acid type, typically using the former to determine strings of connected residues and the latter to map those strings to positions in the primary sequence. Significant ambiguity exists in both connectivity and amino acid type information. This paper focuses on the information content available in connectivity alone and develops a novel random-graph theoretic framework and algorithm for connectivity-driven NMR sequential assignment. Our random graph model captures the structure of chemical shift degeneracy, a key source of connectivity ambiguity. We then give a simple and natural randomized algorithm for finding optimal assignments as sets of connected fragments in NMR graphs. The algorithm naturally and efficiently reuses substrings while exploring connectivity choices; it overcomes local ambiguity by enforcing global consistency of all choices. By analyzing our algorithm under our random graph model, we show that it can provably tolerate relatively large ambiguity while still giving expected optimal performance in polynomial time. We present results from practical applications of the algorithm to experimental datasets from a variety of proteins and experimental set-ups. We demonstrate that our approach is able to overcome significant noise and local ambiguity in identifying significant fragments of sequential assignments.

Free first page

This paper was cited by:

Automated structure determination from NMR spectra
Peter Güntert
European Biophysics Journal. Mar 2009, Vol. 38, No. 2: 129-143
CrossRef
Graphical interpretation of Boolean operators for protein NMR assignments
Dries Verdegem, Klaas Dijkstra, Xavier Hanoulle, Guy Lippens
Journal of Biomolecular NMR. Oct 2008, Vol. 42, No. 1: 11-21
CrossRef
Contact replacement for NMR resonance assignment
F. Xiong, G. Pandurangan, C. Bailey-Kellogg
Bioinformatics. Jun 2008, Vol. 24, No. 13: i205-i213
CrossRef
All articles
Previous Next