Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Using trees to mine multirelational databases
Jiménez A., Berzal F., Cubero J. Data Mining and Knowledge Discovery24 (1):1-39,2012.Type:Article
Date Reviewed: Oct 11 2012

Data mining algorithms look for patterns in data, traditionally represented in a single set. However, when mining for properties of compound objects (for example, logs and financial or scientific data), more adequate representation of these structures is needed. As the source of the data is often a relational database, it is important to extend previous mining methods to deal with objects stored in multiple related tables. This is the target of the multirelational data mining discipline.

Several approaches have been developed in recent years to achieve the tight coupling of data mining techniques to relational database management system (RDBMS) technology. These approaches include solutions provided by both commercial companies and academic research groups. The first approaches simply mapped multiple tables into one before analysis; the results were redundant and sometimes hid the structures of the compound objects.

The authors of this paper extracted the tree-like structures of objects stored in a database in different related tables, allowing existing tree-mining techniques to be applied in RDBMS. (The issue of how to deal with tables embodying M:N relationships was left open in the paper.) They show two representations, and the essence of the trees is extracted in two ways. The resulting 2x2 variants are compared both informally and formally.

The authors show how to determine frequent patterns within the set of trees, and how to mine for association rules among them, filtered by three kinds of rule-mining constraints. Finally, experimental results are shown for different datasets, using the authors’ tree-mining system.

I recommend this paper to interested data mining researchers.

Reviewer:  K. Balogh Review #: CR140587 (1302-0128)
Bookmark and Share
 
Data Mining (H.2.8 ... )
 
 
Relational Databases (H.2.4 ... )
 
 
Trees (G.2.2 ... )
 
 
Graph Theory (G.2.2 )
 
Would you recommend this review?
yes
no
Other reviews under "Data Mining": Date
Feature selection and effective classifiers
Deogun J. (ed), Choubey S., Raghavan V. (ed), Sever H. (ed) Journal of the American Society for Information Science 49(5): 423-434, 1998. Type: Article
May 1 1999
Rule induction with extension matrices
Wu X. (ed) Journal of the American Society for Information Science 49(5): 435-454, 1998. Type: Article
Jul 1 1998
Predictive data mining
Weiss S., Indurkhya N., Morgan Kaufmann Publishers Inc., San Francisco, CA, 1998. Type: Book (9781558604032)
Feb 1 1999
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy