[BibTeX] [RIS]
From Local Patterns to Global Models: The {LeGo} Approach to Data Mining
Type of publication: Techreport
Citation: jf:TUD-KE-2007-06
Number: TUD-KE-2007-06
Year: 2007
Institution: TU Darmstadt, Knowledge Engineering Group
URL: http://www.ke.informatik.tu-darmstadt.de/publications/reports/tud-ke-2007-06.pdf
Abstract: In this paper we present LeGo, a generic framework that utilizes existing local pattern mining techniques for global modeling in a variety of diverse data mining tasks. In the spirit of well known KDD process models, our work identifies different phases within the data mining step, each of which is formulated in terms of different formal constraints. It starts with a phase of mining patterns that are individually promising. Later phases establish the context given by the global data mining task by selecting groups of diverse and highly informative patterns, which are finally combined to one or more global models that address the overall data mining task(s). The paper discusses the connection to various learning techniques, and illustrates that our framework is broad enough to cover and leverage frequent pattern mining, subgroup discovery, pattern teams, multi-view learning, and several other popular algorithms. The Safarii learning toolbox serves as a proof-of-concept of its high potential for practical data mining applications. Finally, we point out several challenging open research questions that naturally emerge in a constraint-based local-to-global pattern mining, selection, and combination framework.
Authors Crémilleux, Bruno
Fürnkranz, Johannes
Knobbe, Arno J.
Scholz, Martin
  • tud-ke-2007-06.pdf