Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
DIMO: distributed index for matching multimedia objects using MapReduce
Abdelsadek A., Hefeeda M.  MMSys 2014 (Proceedings of the 5th ACM Multimedia Systems Conference, Singapore, Mar 19, 2014)115-126.2014.Type:Proceedings
Date Reviewed: Jun 17 2014

Datasets with high dimensions have an inherent problem popularly known as the “curse of dimensionality.” As such, clustering algorithms perform poorly when the number of attributes increases. When the dataset is large, the task of computing the clusters becomes computationally expensive.

In this paper, the authors attempt to solve both of these problems, while using the k-nearest neighbors clustering algorithm. They propose the use of a memory-efficient data structure (KD tree) in the MapReduce computing paradigm framework. The MapReduce framework is convenient in many ways; as the data and the computing nodes scale, managing this change is almost effortless.

The KD tree is constructed as a balanced tree (using singular value decomposition (SVD)) and only the leaf nodes store the data points. This makes the tree memory efficient even for large datasets. Due to this, duplicating this data structure across different computing nodes is feasible, enabling parallel computations. The k-nearest neighbors algorithm is then executed in parallel across these computing nodes.

Using the distributed index for matching multimedia objects (DIMO) system, the authors are able to demonstrate a 20 percent improvement in precision over the systems that currently exist in the literature.

Reviewer:  Kalyan Sasidhar Pathapati Subbu Review #: CR142404 (1409-0781)
Bookmark and Share
  Reviewer Selected
 
 
Distributed Systems (H.3.4 ... )
 
 
Multimedia Databases (H.2.4 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Distributed Systems": Date
High-dimensional nearest neighbor search with remote data centers
Wang C., Wang X. Knowledge and Information Systems 4(4): 440-465, 2002. Type: Article
May 30 2003
Document replication and distribution in extensible geographically distributed Web servers
Zhuo L., Wang C., Lau F. Journal of Parallel and Distributed Computing 63(10): 927-944, 2003. Type: Article
Jun 18 2004
Collectives and the design of complex systems
Tumer K., Wolpert D., Turner K., Springer-Verlag, London, UK, 2004.  321, Type: Book (9780387401652)
Jul 27 2004
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy