Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
A survey on data preprocessing for data stream mining
Ramírez-Gallego S., Krawczyk B., García S., Woźniak M., Herrera F. Neurocomputing239  39-57,2017.Type:Article
Date Reviewed: Dec 28 2017

Data stream mining has become an important phenomenon with new technologies ranging from patient tracking to stock market investing. Data streams contain data items in temporal order and are potentially endless. Efficient and effective mining of them is important because mining takes place online. This paper is a survey of preprocessing for data stream mining.

In the paper, the authors first present fundamental concepts such as concept drift related to data streams. They emphasize principles of proper experiment design in this machine learning domain. Then, they present the important preprocessing aspects of data mining: data reduction in terms of dimensionality reduction, like elimination of redundant features; instance reduction, like reducing the number of training instances; and feature space simplification, like discretization. They analyze the leading algorithms in terms of their predictive, reduction, time, and memory performance. The experiments contain 20 datasets, of which 13 are real.

The survey is comprehensive and the future research pointers are good. The study is well done and will be useful both to practitioners and researchers. It is a noticeable addition to the literature of a research area that is in its infancy.

Reviewer:  F. Can Review #: CR145735 (1802-0098)
Bookmark and Share
  Reviewer Selected
 
 
Data Mining (H.2.8 ... )
 
 
Learning (I.2.6 )
 
Would you recommend this review?
yes
no
Other reviews under "Data Mining": Date
Feature selection and effective classifiers
Deogun J. (ed), Choubey S., Raghavan V. (ed), Sever H. (ed) Journal of the American Society for Information Science 49(5): 423-434, 1998. Type: Article
May 1 1999
Rule induction with extension matrices
Wu X. (ed) Journal of the American Society for Information Science 49(5): 435-454, 1998. Type: Article
Jul 1 1998
Predictive data mining
Weiss S., Indurkhya N., Morgan Kaufmann Publishers Inc., San Francisco, CA, 1998. Type: Book (9781558604032)
Feb 1 1999
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy