Hello. Sign in to personalize your visit. New user? Register now.  
Journal of Computational Biology
Transition Priors for Protein Hidden Markov Models: An Empirical Study towards Maximum Discrimination

To cite this article:
Markus Wistrand, Erik L. L. Sonnhammer. Journal of Computational Biology. January 2004, 11(1): 181-193. doi:10.1089/106652704773416957.

Full Text: • PDF for printing (433.4 KB) • PDF w/ links (453.6 KB)


Markus Wistrand
Center for Genomics and Bioinformatics, Karolinska Institutet, S-17177 Stockholm, Sweden
Erik L. L. Sonnhammer
Center for Genomics and Bioinformatics, Karolinska Institutet, S-17177 Stockholm, Sweden

Insertions and deletions in a profile hidden Markov model (HMM) are modeled by transition probabilities between insert, delete and match states. These are estimated by combining observed data and prior probabilities. The transition prior probabilities can be defined either ad hoc or by maximum likelihood (ML) estimation. We show that the choice of transition prior greatly affects the HMM's ability to discriminate between true and false hits. HMM discrimination was measured using the HMMER 2.2 package applied to 373 families from Pfam. We measured the discrimination between true members and noise sequences employing various ML transition priors and also systematically scanned the parameter space of ad hoc transition priors. Our results indicate that ML priors produce far from optimal discrimination, and we present an empirically derived prior that considerably decreases the number of misclassifications compared to ML. Most of the difference stems from the probabilities for exiting a delete state. The ML prior, which is unaware of noise sequences, estimates a delete-to-delete probability that is relatively high and does not penalize noise sequences enough for optimal discrimination.

Free first page
All articles
Previous Next