Computing Reviews, the leading online review service for computing literature.

Search

Progressive skyline computation in database systems
Papadias D., Tao Y., Fu G., Seeger B. ACM Transactions on Database Systems30 (1):41-82,2005.Type:Article

Date Reviewed: Jan 24 2006

Searching to retrieve specific records from databases can be slowed by complicating factors. One source of complication addressed in this paper is the query selection criteria when multiple dimensions are involved, a complication made worse when the dimensions use nonkey data. A classic multiple dimension search often used in the literature, and used in this paper, is for a two-dimension retrieval, from a database about hotels, of the set of hotels that are closest to the beach and lowest in price: a so-called skyline retrieval. This paper starts by reporting, based on a survey of the literature, an evaluation and comparison of prominent algorithms for doing skyline searches. The six algorithms reported on are the divide-and-conquer, the block nested loop, the sort first skyline, the bit map, the index, and the nearest neighbor. The comparison emphasizes progressiveness, absence of false misses, absence of false hits, fairness, incorporation of retrieval references, and universality. Building on the nearest neighbor algorithm, the paper proposes an algorithm it claims is better than the six compared algorithms, one it terms the branch-and-bound algorithm. The paper explains the branch-and-bound algorithm, and offers an extensive formal proof of its correctness. The paper then briefly examines the effect of additions to and deletions from the searched databases. Following that, the paper identifies six “novel variations” on skyline search to document the superiority of the branch-and-bound algorithm: constrained skylines, ranked skylines, group-by skylines, dynamic skylines, enumerating and K-dominating queries, and skybands. Only the branch-and-bound is compatible with all six variations, and, on top of that, the paper introduces a new variation, approximate skybands, on which it also shines. The paper then provides an experimental evaluation of the efficiency of the branch-and-bound algorithm compared to the nearest neighbor algorithm (generally, the best of the six prominent algorithms for most skyline searches). For a non-novel search, the branch-and-bound algorithm makes a better showing on each of three rating factors: dimensionality, cardinality, and progressive behavior. On each of the six novel variations, the paper’s branch-and-bound algorithm also winds up working from about as well as to much better than the nearest neighbor algorithm. As the paper points out, saving skylines relevant for frequently asked types of queries could offer important savings for active database users. To me, the branch-and-bound algorithm’s claimed desirable properties suggest that data administrators should investigate the algorithm’s applicability for their own databases.

Reviewer: Ned Chapin	Review #: CR132347 (0608-0859)

Database Management (H.2 )

Information Search And Retrieval (H.3.3 )

Would you recommend this review?

yes

Other reviews under "Database Management":	Date

Raghu Ramakrishnan speaks out on deductive databases, what lies beyond scalability, how he burned through $20M briskly, why we should reach out to policymakers, and more Winslett M. ACM SIGMOD Record 35(2): 77-85, 2006. Type: Article	Nov 23 2006

Beginning PHP 5 and MySQL 5: from novice to professional (2nd ed.) Gilmore W., APress, LP, Berkeley, CA, 2005. 952, Type: Book (9781590595527)	Nov 30 2006

Information systems reengineering and integration Fong J., Springer-Verlag New York, Inc., Secaucus, NJ, 2006. 368, Type: Book (9781846283826)	Dec 13 2006

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy