INFORMS Journal on Computing
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


INFORMS JOURNAL ON COMPUTING
Vol. 19, No. 2, Spring 2007, pp. 185-200
DOI: 10.1287/ijoc.1050.0153
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Adomavicius, G.
Right arrow Articles by Tuzhilin, A.
Right arrow Search for Related Content

Validation Sequence Optimization: A Theoretical Approach

Gediminas Adomavicius, Alexander Tuzhilin

Department of Information and Decision Sciences, Carlson School of Management, University of Minnesota, 321 19th Avenue South, Minneapolis, Minnesota 55455, USA
Department of Information, Operations, and Management Sciences, Stern School of Business, New York University, 44 West Fourth Street, New York, New York 10012, USA

gedas{at}umn.edu
atuzhili{at}stern.nyu.edu

The need to validate large amounts of data with the help of the domain expert arises naturally in many data-intensive applications, including data mining, data stream, and database-related applications. This paper presents a general validation approach that generalizes different expert-driven validation methods developed for specialized validation problems. In particular, we model the validation process as a sequence of validation operators, explore various properties of such sequences, and present theoretical results that provide for better understanding of the validation process. We also address the problem of selecting the best validation sequence among the class of equivalent sequence permutations. We demonstrate that this optimization problem is NP-hard and present two heuristic algorithms for improving validation sequences.

Key words: validation; validation operators; validation sequences; sequence optimization; computational complexity; heuristic algorithms; dynamic programming; data mining
History: received June 2003; revised October 2004; accepted May 2005.







HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2007 by INFORMS.