Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Rule induction with extension matrices
Wu X. (ed) Journal of the American Society for Information Science49 (5):435-454,1998.Type:Article
Date Reviewed: Jul 1 1998

Wu describes a heuristic, attribute-based, noise-tolerant data mining program based on the extension matrix approach. Given n negative examples and p positive examples, where each example has a attributes, one can define the negative example matrix NEM to be the matrix whose rows are the negative examples and whose columns give the attribute values. For each positive example P, one constructs an extension matrix whose elements are * (a so-called dead element) when P has the same value as the corresponding attribute of the negative example, and whose elements have the value of the attribute of the negative example otherwise. If one can find a set of n non-dead elements, one in each row, this defines a conjunctive formula that separates the given example P from the negative example. The algorithm HCV uses heuristics to find, in polynomial time ( O ( p n a3 + p2 n a ) ), a disjunctive formula separating all the positive examples from the negative examples whenever at least one such formula exists.

The paper contains a brief discussion of how the algorithm can be modified to cope with noisy data. It concludes with the results of experimental comparisons of HCV with other data mining algorithms, including C4.5. The author observes that HCV is much faster than C4.5 on some sets and slower on other sets.

Reviewer:  J. P. E. Hodgson Review #: CR121757 (9807-0535)
Bookmark and Share
  Featured Reviewer  
 
Data Mining (H.2.8 ... )
 
 
Algorithm Design And Analysis (G.4 ... )
 
 
Heuristic Methods (I.2.8 ... )
 
 
Induction (I.2.6 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Data Mining": Date
Feature selection and effective classifiers
Deogun J. (ed), Choubey S., Raghavan V. (ed), Sever H. (ed) Journal of the American Society for Information Science 49(5): 423-434, 1998. Type: Article
May 1 1999
Predictive data mining
Weiss S., Indurkhya N., Morgan Kaufmann Publishers Inc., San Francisco, CA, 1998. Type: Book (9781558604032)
Feb 1 1999
Data mining solutions
Westphal C., Blaxton T., John Wiley & Sons, Inc., New York, NY, 1998. Type: Book (9780471253846)
Jun 1 1999
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy