Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Parameter-Free Geometric Document Layout Analysis
Lee S., Ryu D. IEEE Transactions on Pattern Analysis and Machine Intelligence23 (11):1240-1256,2001.Type:Article
Date Reviewed: Jul 26 2002

There are a large number of documents for which electronic form is desirable, but for which no electronic representation is available. While optical character recognition systems have been available for many years, they are generally restricted to documents that have a highly regular text form. This paper describes an approach to segmenting a printed document (scanned into image format) so that it is partioned into areas that are recognized as text, image, table or separating lines, thus allowing individual document analyses of these contributing types.

The algorithm presented is a “parameter-free geometric document analysis method,” which uses a periodicity measure over a pyramidal quad-tree representation of the image. Advantages claimed for the method are that it can resolve touching or overlapping regions, paragraphs that begin with a large image-format character, and single text lines used as headings. Some comparisons with other methods and commercial software are given, over which the proposed method compares quite favorably.

The paper is reasonably well written, although at times the authors’ non-English background is apparent. This is most evident in the very first paragraph, in which the opening three sentences contain the stem “increase/decrease” five times. The style improves from there on, however, and the rest of the paper is easy to follow.

Reviewer:  John Hurst Review #: CR126296 (0210-0601)
Bookmark and Share
 
Document Analysis (I.7.5 ... )
 
 
Document Preparation (I.7.2 )
 
Would you recommend this review?
yes
no
Other reviews under "Document Analysis": Date
Generating indicative-informative summaries with sumUM: a 3D dynamic virtual shop
Saggion H., Lapalme G. Computational Linguistics 28(4): 497-526, 2002. Type: Article
Jun 20 2003
A hierarchical neural network document classifier with linguistic feature selection
Chen C., Lee H., Hwang C. Applied Intelligence 23(3): 277-294, 2005. Type: Article
Aug 2 2006
Digital document processing: major directions and recent advances (Advances in Pattern Recognition)
Chaudhuri B., Springer-Verlag New York, Inc., Secaucus, NJ, 2006.  468, Type: Book (9781846285011)
Aug 13 2007
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy