Hello. Sign in to personalize your visit. New user? Register now.  
Journal of Computational Biology
A Pseudo-Boolean Framework for Computing Rearrangement Distances between Genomes with Duplicates

To cite this article:
Sébastien Angibaud, Guillaume Fertin, Irena Rusu, Stéphane Vialette. Journal of Computational Biology. May 2007, 14(4): 379-393. doi:10.1089/cmb.2007.A001.

Full Text: • PDF for printing (252.5 KB) • PDF w/ links (222.1 KB)


Sébastien Angibaud
Laboratoire d'Informatique de Nantes-Atlantique (LINA), FRE CNRS 2729, Université de Nantes, 2 rue de la Houssinière, 44322 Nantes Cedex 3, France. Sebastien.Angibaud@univ-nantes.fr
Guillaume Fertin
Laboratoire d'Informatique de Nantes-Atlantique (LINA), FRE CNRS 2729, Université de Nantes, Nantes, France.
Irena Rusu
Laboratoire d'Informatique de Nantes-Atlantique (LINA), FRE CNRS 2729, Université de Nantes, Nantes, France.
Stéphane Vialette
Laboratoire de Recherche en Informatique (LRI), UMR CNRS 8623, Faculté des Sciences d'Orsay, Université Paris-Sud, Orsay, France.

Computing genomic distances between whole genomes is a fundamental problem in comparative genomics. Recent researches have resulted in different genomic distance definitions, for example, number of breakpoints, number of common intervals, number of conserved intervals, and Maximum Adjacency Disruption number. Unfortunately, it turns out that, in presence of duplications, most problems are NP–hard, and hence several heuristics have been recently proposed. However, while it is relatively easy to compare heuristics between them, until now very little is known about the absolute accuracy of these heuristics. Therefore, there is a great need for algorithmic approaches that compute exact solutions for these genomic distances. In this paper, we present a novel generic pseudo-boolean approach for computing the exact genomic distance between two whole genomes in presence of duplications, and put strong emphasis on common intervals under the maximum matching model. Of particular importance, we show three heuristics which provide very good results on a well-known public dataset of γ-Proteobacteria.

Free first page

This paper was cited by:

Efficient Tools for Computing the Number of Breakpoints and the Number of Adjacencies between Two Genomes with Duplicate Genes
Sébastien Angibaud, Guillaume Fertin, Irena Rusu, Annelyse Thévenin, Stéphane Vialette
Journal of Computational Biology. Oct 2008, Vol. 15, No. 8: 1093-1115
Abstract | Full Text PDF | Reprints & Permissions
All articles
Previous Next