INFORMS Journal on Computing
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


INFORMS JOURNAL ON COMPUTING
Vol. 16, No. 4, Fall 2004, pp. 419-429
DOI: 10.1287/ijoc.1040.0090
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Meneses, C. N.
Right arrow Articles by Pardalos, P. M.
Right arrow Search for Related Content

Optimal Solutions for the Closest-String Problem via Integer Programming

Cláudio N. Meneses, Zhaosong Lu, Carlos A. S. Oliveira, Panos M. Pardalos

Department of Industrial and Systems Engineering, University of Florida, 303 Weil Hall, Gainesville, Florida 32611, USA
School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332-0205, USA
Department of Industrial and Systems Engineering, University of Florida, 303 Weil Hall, Gainesville, Florida 32611, USA
Department of Industrial and Systems Engineering, University of Florida, 303 Weil Hall, Gainesville, Florida 32611, USA

claudio{at}ufl.edu
zhaosong{at}isye.gatech.edu
oliveira{at}ufl.edu
pardalos{at}ufl.edu

In this paper we study the closest-string problem (CSP), which can be defined as follows: Given a finite set S = {s1, s2, ..., sn} of strings, each string with length m, find a center string t of length m minimizing d, such that for every string si isin S, dH(t, si) ≤ d. By dH(t, si) we mean the Hamming distance between t and si. This is an NP-hard problem, with applications in molecular biology and coding theory. Even though there are good approximation algorithms for this problem, and exact algorithms for instances with d constant, there are no studies trying to solve it exactly for the general case. In this paper we propose three integer-programming (IP) formulations and a heuristic, which is used to provide upper bounds on the value of an optimal solution. We report computational results of a branch-and-bound algorithm based on one of the IP formulations, and of the heuristic, executed over randomly generated instances. These results show that it is possible to solve CSP instances of moderate size to optimality.

Key words: closest-string problem; computational biology; mathematical programming; branch-and-bound algorithms
History: received February 2004; revised March 2004; accepted April 2004.







HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2004 by INFORMS.