Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
External memory algorithms for string problems
Roh K., Crochemore M., Iliopoulos C., Park K. Fundamenta Informaticae84 (1):17-32,2008.Type:Article
Date Reviewed: Apr 15 2009

Modern systems typically use a cheaper external memory with a high storage capacity. Since access to this memory is relatively slow compared to the internal memory, applications that process huge amounts of data need to minimize communication across such a memory hierarchy.

Thus, it becomes necessary to develop algorithms that minimize the input/output (I/O) operations between the internal and external memory. Roh et al. provide such algorithms for string problems, especially for the problem of finding the maximum suffix of a string. Although the paper mentions other string problems, the algorithms presented are shown to be easily derivable from an algorithm that can solve the maximum suffix problem. Among these other problems, an interesting one is the problem of string matching with a popular Knuth-Morris-Pratt (KMP) algorithm. Whereas the KMP algorithm uses a scan and shift method, the suggested improvement uses maximum suffix decomposition during the shift step.

Roh et al. have previously published most of this work [1]. Although the two papers share a lot of material, the earlier version has some additional examples that illustrate why merely using a maximum suffix algorithm written for an internal memory model may cause inefficiencies when applied to systems that have external memory. In both versions of the paper, the theory is well presented.

Adequate background material is provided for the benefit of the readers. First, the presentation considers an external memory model with a single disk, and later, it extends the concepts to models with multiple disks.

However, for practitioners interested in implementing these techniques, this paper does not offer any experimental evidence. It would be interesting to compare the performance of the internal memory maximum suffix algorithm to the algorithms suggested here, on datasets with large strings, using an industrial system.

The algorithms suggested add significant complexity to the implementation, but, on the bright side, they do advance the existing theory.

Reviewer:  Paparao Kavalipati Review #: CR136690 (0911-1056)
1) Roh, K., Crochemore, M., Iliopoulos, C. S., Park, K. External memory algorithms for string problems. In Proceedings of the 17th Australasian Workshop on Combinatorial Algorithms. 2006.
Bookmark and Share
  Featured Reviewer  
 
Pattern Matching (F.2.2 ... )
 
 
Biology And Genetics (J.3 ... )
 
 
Combinatorial Algorithms (G.2.1 ... )
 
 
Sequencing And Scheduling (F.2.2 ... )
 
 
Combinatorics (G.2.1 )
 
 
Nonnumerical Algorithms And Problems (F.2.2 )
 
  more  
Would you recommend this review?
yes
no
Other reviews under "Pattern Matching": Date
Deterministic sampling
Vishkin U. SIAM Journal on Computing 20(1): 22-40, 1991. Type: Article
Jun 1 1992
Two-way string-matching
Crochemore M., Perrin D. (ed) Journal of the ACM 38(3): 650-674, 1991. Type: Article
Sep 1 1992
On the exact complexity of string matching: lower bounds
Galil Z., Giancarlo R. SIAM Journal on Computing 20(6): 1008-1020, 1991. Type: Article
Jan 1 1993
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy