Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Word division in Spanish
Mañas J. Communications of the ACM30 (7):612-616,1987.Type:Article
Date Reviewed: Jul 1 1989

The author describes a method for the insertion of hyphenation procedures for Spanish into text processing systems, most of which were designed for use with English. He studies the specific problems that arise from the syllabic structures of Spanish as well as from the typographical conventions that Hispanic writers have traditionally adopted.

The method described is primarily based on the study of Spanish orthographic and syllabic structures, which provide the first set of breaking rules. A second layer of rules translates traditional conventions of word partitioning. The third component is an aesthetic parameter: an adjustable threshold that allows the user to balance the percentage of hyphenations against the volume of spaces for a given document. This threshold introduces a pleasant flexibility in the modulation of the rules. The set of algorithms has been implemented in C and tested with the lexical analyzer generator lex, and the author gives some performance data.

This work will interest people working on text processing; in a wider context, it makes a useful contribution to the current research in document structures retrieval [1]. A lot of Hispanic people should also appreciate this work, as Spanish is a major human language and the algorithms are linguistically rooted.

Reviewer:  M. Borillo Review #: CR112155
1) Virbel, J.The contribution of linguistic knowledge to the interpretation of text structures. In Structured documents, J. André, V. Quint, and R. Furuta (Eds.), Cambridge Univ. Press, Cambridge, UK, 1988.
Bookmark and Share
 
Text Analysis (I.2.7 ... )
 
 
Linguistics (J.5 ... )
 
 
Text Processing (I.5.4 ... )
 
 
Word Processing (H.4.1 ... )
 
 
General (I.7.0 )
 
 
Miscellaneous (H.1.m )
 
Would you recommend this review?
yes
no
Other reviews under "Text Analysis": Date
Some issues in the semantics and pragmatics of definite reference in the context of natural language database access
Berry-Rogghe G. Circuits, Systems, and Signal Processing 3(1): 47-54, 1984. Type: Article
Jun 1 1985
Schemata for understanding of argumentation in newspaper texts
Roesner D.  Progress in artificial intelligence (, Orsay, France,3111985. Type: Proceedings
Apr 1 1986
Conceptual graphs for the analysis and generation of sentences
Velardi P., Pazienza M., De’ Giovanetti M. IBM Journal of Research and Development 32(2): 251-267, 1988. Type: Article
Oct 1 1989
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy