Computing Reviews, the leading online review service for computing literature.

Search

Parameter-Free Geometric Document Layout Analysis
Lee S., Ryu D. IEEE Transactions on Pattern Analysis and Machine Intelligence23 (11):1240-1256,2001.Type:Article

Date Reviewed: Jul 26 2002

There are a large number of documents for which electronic form is desirable, but for which no electronic representation is available. While optical character recognition systems have been available for many years, they are generally restricted to documents that have a highly regular text form. This paper describes an approach to segmenting a printed document (scanned into image format) so that it is partioned into areas that are recognized as text, image, table or separating lines, thus allowing individual document analyses of these contributing types. The algorithm presented is a “parameter-free geometric document analysis method,” which uses a periodicity measure over a pyramidal quad-tree representation of the image. Advantages claimed for the method are that it can resolve touching or overlapping regions, paragraphs that begin with a large image-format character, and single text lines used as headings. Some comparisons with other methods and commercial software are given, over which the proposed method compares quite favorably. The paper is reasonably well written, although at times the authors’ non-English background is apparent. This is most evident in the very first paragraph, in which the opening three sentences contain the stem “increase/decrease” five times. The style improves from there on, however, and the rest of the paper is easy to follow.

Reviewer: John Hurst	Review #: CR126296 (0210-0601)

Document Analysis (I.7.5 ... )

Document Preparation (I.7.2 )

Would you recommend this review?

yes

Other reviews under "Document Analysis":	Date

Generating indicative-informative summaries with sumUM: a 3D dynamic virtual shop Saggion H., Lapalme G. Computational Linguistics 28(4): 497-526, 2002. Type: Article	Jun 20 2003

A hierarchical neural network document classifier with linguistic feature selection Chen C., Lee H., Hwang C. Applied Intelligence 23(3): 277-294, 2005. Type: Article	Aug 2 2006

Digital document processing: major directions and recent advances (Advances in Pattern Recognition) Chaudhuri B., Springer-Verlag New York, Inc., Secaucus, NJ, 2006. 468, Type: Book (9781846285011)	Aug 13 2007

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy