Computing Reviews, the leading online review service for computing literature.

Search

External memory algorithms for string problems
Roh K., Crochemore M., Iliopoulos C., Park K. Fundamenta Informaticae84 (1):17-32,2008.Type:Article

Date Reviewed: Apr 15 2009

Modern systems typically use a cheaper external memory with a high storage capacity. Since access to this memory is relatively slow compared to the internal memory, applications that process huge amounts of data need to minimize communication across such a memory hierarchy. Thus, it becomes necessary to develop algorithms that minimize the input/output (I/O) operations between the internal and external memory. Roh et al. provide such algorithms for string problems, especially for the problem of finding the maximum suffix of a string. Although the paper mentions other string problems, the algorithms presented are shown to be easily derivable from an algorithm that can solve the maximum suffix problem. Among these other problems, an interesting one is the problem of string matching with a popular Knuth-Morris-Pratt (KMP) algorithm. Whereas the KMP algorithm uses a scan and shift method, the suggested improvement uses maximum suffix decomposition during the shift step. Roh et al. have previously published most of this work [1]. Although the two papers share a lot of material, the earlier version has some additional examples that illustrate why merely using a maximum suffix algorithm written for an internal memory model may cause inefficiencies when applied to systems that have external memory. In both versions of the paper, the theory is well presented. Adequate background material is provided for the benefit of the readers. First, the presentation considers an external memory model with a single disk, and later, it extends the concepts to models with multiple disks. However, for practitioners interested in implementing these techniques, this paper does not offer any experimental evidence. It would be interesting to compare the performance of the internal memory maximum suffix algorithm to the algorithms suggested here, on datasets with large strings, using an industrial system. The algorithms suggested add significant complexity to the implementation, but, on the bright side, they do advance the existing theory.

Reviewer: Paparao Kavalipati	Review #: CR136690 (0911-1056)

1)	Roh, K., Crochemore, M., Iliopoulos, C. S., Park, K. External memory algorithms for string problems. In Proceedings of the 17th Australasian Workshop on Combinatorial Algorithms. 2006.

Pattern Matching (F.2.2 ... )

Biology And Genetics (J.3 ... )

Combinatorial Algorithms (G.2.1 ... )

Sequencing And Scheduling (F.2.2 ... )

Combinatorics (G.2.1 )

Nonnumerical Algorithms And Problems (F.2.2 )

Would you recommend this review?

yes

Other reviews under "Pattern Matching":	Date

Deterministic sampling Vishkin U. SIAM Journal on Computing 20(1): 22-40, 1991. Type: Article	Jun 1 1992

Two-way string-matching Crochemore M., Perrin D. (ed) Journal of the ACM 38(3): 650-674, 1991. Type: Article	Sep 1 1992

On the exact complexity of string matching: lower bounds Galil Z., Giancarlo R. SIAM Journal on Computing 20(6): 1008-1020, 1991. Type: Article	Jan 1 1993

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy