|
|
||||||||
Dipartimento di Matematica e Informatica, University of Udine, 33100 Udine, Italy
We introduce an exact algorithm, based on integer linear programming (ILP), for the parsimony haplotyping problem (PHP). The PHP uses molecular data and is aimed at the determination of a smallest set of haplotypes that explain a given set of genotypes. Our approach is based on a set-covering formulation of the problem, solved by branch and bound with both column and row generation. Existing ILP methods for the PHP suffer from the large size of the solution space, when the genotypes are long and with many heterozygous sites. Our approach, on the other hand, is based on an effective implicit representation of the solution space, and allows the solution of both real data and simulated instances, which are very hard to solve for other ILPs.
Dipartimento di Matematica e Informatica, University of Udine, 33100 Udine, Italy
lancia{at}dimi.uniud.it
serafini{at}dimi.uniud.it
Key words: integer programming; computational biology; haplotyping; branch and price; branch and cut; set covering
History: received November 2007;
accepted April 2008.
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |