Computing Reviews, the leading online review service for computing literature.

Search

A survey on data preprocessing for data stream mining
Ramírez-Gallego S., Krawczyk B., García S., Woźniak M., Herrera F. Neurocomputing239 39-57,2017.Type:Article

Date Reviewed: Dec 28 2017

Data stream mining has become an important phenomenon with new technologies ranging from patient tracking to stock market investing. Data streams contain data items in temporal order and are potentially endless. Efficient and effective mining of them is important because mining takes place online. This paper is a survey of preprocessing for data stream mining. In the paper, the authors first present fundamental concepts such as concept drift related to data streams. They emphasize principles of proper experiment design in this machine learning domain. Then, they present the important preprocessing aspects of data mining: data reduction in terms of dimensionality reduction, like elimination of redundant features; instance reduction, like reducing the number of training instances; and feature space simplification, like discretization. They analyze the leading algorithms in terms of their predictive, reduction, time, and memory performance. The experiments contain 20 datasets, of which 13 are real. The survey is comprehensive and the future research pointers are good. The study is well done and will be useful both to practitioners and researchers. It is a noticeable addition to the literature of a research area that is in its infancy.

Reviewer: F. Can	Review #: CR145735 (1802-0098)

Data Mining (H.2.8 ... )

Learning (I.2.6 )

Would you recommend this review?

yes

Other reviews under "Data Mining":	Date

Feature selection and effective classifiers Deogun J. (ed), Choubey S., Raghavan V. (ed), Sever H. (ed) Journal of the American Society for Information Science 49(5): 423-434, 1998. Type: Article	May 1 1999

Rule induction with extension matrices Wu X. (ed) Journal of the American Society for Information Science 49(5): 435-454, 1998. Type: Article	Jul 1 1998

Predictive data mining Weiss S., Indurkhya N., Morgan Kaufmann Publishers Inc., San Francisco, CA, 1998. Type: Book (9781558604032)	Feb 1 1999

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy