Computing Reviews, the leading online review service for computing literature.

Search

DIMO: distributed index for matching multimedia objects using MapReduce
Abdelsadek A., Hefeeda M. MMSys 2014 (Proceedings of the 5th ACM Multimedia Systems Conference, Singapore, Mar 19, 2014)115-126.2014.Type:Proceedings

Date Reviewed: Jun 17 2014

Datasets with high dimensions have an inherent problem popularly known as the “curse of dimensionality.” As such, clustering algorithms perform poorly when the number of attributes increases. When the dataset is large, the task of computing the clusters becomes computationally expensive. In this paper, the authors attempt to solve both of these problems, while using the k-nearest neighbors clustering algorithm. They propose the use of a memory-efficient data structure (KD tree) in the MapReduce computing paradigm framework. The MapReduce framework is convenient in many ways; as the data and the computing nodes scale, managing this change is almost effortless. The KD tree is constructed as a balanced tree (using singular value decomposition (SVD)) and only the leaf nodes store the data points. This makes the tree memory efficient even for large datasets. Due to this, duplicating this data structure across different computing nodes is feasible, enabling parallel computations. The k-nearest neighbors algorithm is then executed in parallel across these computing nodes. Using the distributed index for matching multimedia objects (DIMO) system, the authors are able to demonstrate a 20 percent improvement in precision over the systems that currently exist in the literature.

Reviewer: Kalyan Sasidhar Pathapati Subbu	Review #: CR142404 (1409-0781)

Distributed Systems (H.3.4 ... )

Multimedia Databases (H.2.4 ... )

Would you recommend this review?

yes

Other reviews under "Distributed Systems":	Date

High-dimensional nearest neighbor search with remote data centers Wang C., Wang X. Knowledge and Information Systems 4(4): 440-465, 2002. Type: Article	May 30 2003

Document replication and distribution in extensible geographically distributed Web servers Zhuo L., Wang C., Lau F. Journal of Parallel and Distributed Computing 63(10): 927-944, 2003. Type: Article	Jun 18 2004

Collectives and the design of complex systems Tumer K., Wolpert D., Turner K., Springer-Verlag, London, UK, 2004. 321, Type: Book (9780387401652)	Jul 27 2004

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy