|
Journal of Computational Biology
Combining Phylogenetic and Hidden Markov Models in Biosequence Analysis
To cite this article:
Adam Siepel, David Haussler.
Journal of Computational Biology.
March 2004,
11(2-3): 413-428.
doi:10.1089/1066527041410472.
Adam Siepel Center for Biomolecular Science and Engineering, University of California, Santa Cruz, CA 95064 David Haussler Howard Hughes Medical Institute, University of California, Santa Cruz, CA 95064 A few models have appeared in recent years that consider not only the way substitutions occur through evolutionary history at each site of a genome, but also the way the process changes from one site to the next. These models combine phylogenetic models of molecular evolution, which apply to individual sites, and hidden Markov models, which allow for changes from site to site. Besides improving the realism of ordinary phylogenetic models, they are potentially very powerful tools for inference and prediction—for example, for gene finding or prediction of secondary structure. In this paper, we review progress on combined phylogenetic and hidden Markov models and present some extensions to previous work. Our main result is a simple and efficient method for accommodating higher-order states in the HMM, which allows for context-dependent models of substitution—that is, models that consider the effects of neighboring bases on the pattern of substitution. We present experimental results indicating that higher-order states, autocorrelated rates, and multiple functional categories all lead to significant improvements in the fit of a combined phylogenetic and hidden Markov model, with the effect of higher-order states being particularly pronounced.  This paper was cited by:Segmenting bacterial and viral DNA sequence alignments with a trans-dimensional phylogenetic factorial hidden Markov model Wolfgang P. Lehrach, Dirk Husmeier Journal of the Royal Statistical Society: Series C (Applied Statistics). Aug 2009, Vol. 58, No. 3: 307-327 CrossRef Investigating Protein-Coding Sequence Evolution with Probabilistic Codon Substitution Models M. Anisimova, C. Kosiol Molecular Biology and Evolution. Mar 2009, Vol. 26, No. 2: 255-271 CrossRef Complexity reduction in context-dependent DNA substitution models W. H. Majoros, U. Ohler Bioinformatics. Dec 2008, Vol. 25, No. 2: 175-182 CrossRef MotifMap: a human genome-wide map of candidate regulatory motif sites X. Xie, P. Rigor, P. Baldi Bioinformatics. Dec 2008, Vol. 25, No. 2: 167-174 CrossRef Evaluation of cis-regulatory function in zebrafish E. E. Pashos, E. Kague, S. Fisher Briefings in Functional Genomics and Proteomics. Dec 2008, Vol. 7, No. 6: 465-473 CrossRef Patterns of dioxin-altered mRNA expression in livers of dioxin-sensitive versus dioxin-resistant rats Monique A. Franc, Ivy D. Moffat, Paul C. Boutros, Jouni T. Tuomisto, Jouko Tuomisto, Raimo Pohjanvirta, Allan B. Okey Archives of Toxicology. Dec 2008, Vol. 82, No. 11: 809-830 CrossRef Models of coding sequence evolution W. Delport, K. Scheffler, C. Seoighe Briefings in Bioinformatics. Nov 2008, Vol. 10, No. 1: 97-109 CrossRef xREI: a phylo-grammar visualization webserver L. Barquist, I. Holmes Nucleic Acids Research. Jun 2008, Vol. 36, No. Web Server: W65-W69 CrossRef Delineating Slowly and Rapidly Evolving Fractions of the Drosophila Genome Jonathan M. Keith, Peter Adams, Stuart Stephen, John S. Mattick Journal of Computational Biology. May 2008, Vol. 15, No. 4: 407-430 Abstract | Full Text PDF | Reprints & PermissionsExact and Heuristic Algorithms for the Indel Maximum Likelihood Problem Abdoulaye Banire Diallo, Vladimir Makarenkov, Mathieu Blanchette Journal of Computational Biology. May 2007, Vol. 14, No. 4: 446-461 Abstract | Full Text PDF | Reprints & PermissionsComputational Biology: Toward Deciphering Gene Regulatory Information in Mammalian Genomes Hongkai Ji, Wing Hung Wong Biometrics. Oct 2006, Vol. 62, No. 3: 645-663 CrossRef Genetic determinants of normal variation in coagulation factor (F) IX levels: genome-wide scan and examination of the FIX structural gene M. KHACHIDZE, A. BUIL, K. R. VIEL, S. PORTER, D. WARREN, D. K. MACHIAH, J. M. SORIA, J. C. SOUTO, A. AMERI, M. LATHROP, J. BLANGERO, J. FONTCUBERTA, S. T. WARREN, L. ALMASY, T. E. HOWARD Journal of Thrombosis and Haemostasis. Aug 2006, Vol. 4, No. 7: 1537-1545 CrossRef Genome-wide analysis of mammalian promoter architecture and evolution Piero Carninci, Albin Sandelin, Boris Lenhard, Shintaro Katayama, Kazuro Shimokawa, Jasmina Ponjavic, Colin A M Semple, Martin S Taylor, Pär G Engström, Martin C Frith, Alistair R R Forrest, Wynand B Alkema, Sin Lam Tan, Charles Plessy, Rimantas Kodzius, Timothy Ravasi, Takeya Kasukawa, Shiro Fukuda, Mutsumi Kanamori-Katayama, Yayoi Kitazume, Hideya Kawaji, Chikatoshi Kai, Mari Nakamura, Hideaki Konno, Kenji Nakano, Salim Mottagui-Tabar, Peter Arner, Alessandra Chesi, Stefano Gustincich, Francesca Persichetti, Harukazu Suzuki, Sean M Grimmond, Christine A Wells, Valerio Orlando, Claes Wahlestedt, Edison T Liu, Matthias Harbers, Jun Kawai, Vladimir B Bajic, David A Hume, Yoshihide Hayashizaki Nature Genetics. Jul 2006, Vol. 38, No. 6: 626-635 CrossRef Distant regulatory elements in a Sox10-βGEO BAC transgene are required for expression of
Sox10
in the enteric nervous system and other neural crest-derived tissues Karen K. Deal, V. Ashley Cantrell, Ronald L. Chandler, Thomas L. Saunders, Douglas P. Mortlock, E. Michelle Southard-Smith Developmental Dynamics. Jun 2006, Vol. 235, No. 5: 1413-1432 CrossRef Association analysis of the chromosome 4p-located G protein-coupled receptor 78 (GPR78) gene in bipolar affective disorder and schizophrenia S L Underwood, A Christoforou, P A Thomson, N R Wray, A Tenesa, J Whittaker, R A Adams, S Le Hellard, S W Morris, D H R Blackwood, W J Muir, D J Porteous, K L Evans Molecular Psychiatry. May 2006, Vol. 11, No. 4: 384-394 CrossRef Identification of hundreds of conserved and nonconserved human microRNAs Isaac Bentwich, Amir Avniel, Yael Karov, Ranit Aharonov, Shlomit Gilad, Omer Barad, Adi Barzilai, Paz Einat, Uri Einav, Eti Meiri, Eilon Sharon, Yael Spector, Zvi Bentwich Nature Genetics. Aug 2005, Vol. 37, No. 7: 766-770 CrossRef
|
|