Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
A survey on ensemble learning for data stream classification
Gomes H., Barddal J., Enembreck F., Bifet A. ACM Computing Surveys50 (2):1-36,2017.Type:Article
Date Reviewed: Jun 16 2017

The automation of several processes, such as business transactions, smartphones, and various types of sensors, has severely increased the number of data stream generators. In data stream classification, data items are represented by a vector of features. Data items come very frequently in temporal order and are practically endless. Memory- and time-efficient correct online decisions are required for labeling. Only a finite number of recent instances are accessible to learn from. Ensemble learners combine decisions coming from different classifiers. These classifiers are called base learners. Base learners can be heterogeneous or homogeneous. They use different parameters or different biases or may learn from different data items. It is expected that base members complement each other and so an ensemble classifier provides a decision better than those of its members. Such ensembles are referred to as diverse.

In ensembles, there can be base learners with frequent incorrect decisions or some of them may have no contributions since their decisions may be mostly similar to other decisions. Such learners should be pruned and chance should be given to new ones. During the process of classification, new labels may appear or some existing labels may disappear. Input vectors that should be assigned their current labels change, or for certain input vector patterns that remain the same, labels assigned to them change. Some input vector patterns that do not appear for some time may reappear again and, remembering how decisions were made for them, may improve efficiency and effectiveness. Certain input vector patterns may show temporal interdependencies in terms of their labels.

This paper is a comprehensive survey on ensemble learning for data stream classification. It is easy to read. The provided taxonomy is a good summary of possibilities based on 65 classifiers. It emphasizes their similar and dissimilar aspects. The included list of open-source software resources is useful. Future research problems provide good pointers.

The paper will be useful to anybody interested in ensemble learning, with or without data streams. Another good survey on the same topic, published recently, shows the hotness of the area [1]. I expect to see some significant breakthroughs on this topic in the near future.

Reviewer:  F. Can Review #: CR145356 (1708-0564)
1) Krawczyk, B.; Minku, L. L.; Gama, J.; Stefanowski, J.; Wozniak, M. Ensemble learning for data stream analysis: a survey. Information Fusion 37 (2017), 132–156.
Bookmark and Share
  Reviewer Selected
 
 
Classifier Design And Evaluation (I.5.2 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Classifier Design And Evaluation": Date
Linear discrimination with symmetrical models
Bobrowski L. Pattern Recognition 19(1): 101-109, 1986. Type: Article
Feb 1 1988
An application of a graph distance measure to the classification of muscle tissue patterns
Sanfeliu A. (ed), Fu K., Prewitt J. International Journal of Pattern Recognition and Artificial Intelligence 1(1): 17-42, 1987. Type: Article
Dec 1 1989
Selective networks and recognition automata
George N. J., Edelman G.  Computer culture: the scientific, intellectual, and social impact of the computer (, New York,2011984. Type: Proceedings
May 1 1987
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy