[BibTeX] [RIS]
A Comparison of Strategies for Handling Missing Values in Rule Learning
Type of publication: Techreport
Citation: jf:TUD-KE-2009-03
Number: TUD-KE-2009-03
Year: 2009
Institution: TU Darmstadt, Knowledge Engineering Group
URL: http://www.ke.tu-darmstadt.de/publications/reports/tud-ke-2009-03.pdf
Abstract: In this paper, we review possible strategies for handling missing values in separate-and-conquer rule learning algorithms, and compare them experimentally on a large number of datasets. In particular through a careful study with data with controlled levels of missing values we get additional insights on the strategies’ different biases w.r.t. attributes with missing values. Somewhat surprisingly, a strategy that implements a strong bias against the use of attributes with missing values, exhibits the best average performance on 24 datasets from the UCI repository.
Keywords:
Authors Wohlrab, Lars
FĂĽrnkranz, Johannes
Attachments
  • tud-ke-2009-03.pdf
Topics