Computing Reviews, the leading online review service for computing literature.

Search

A platform for language independent summarization
Cabral L., Lins R., Mello R., Freitas F., Ávila B., Simske S., Riss M. DocEng 2014 (Proceedings of the 2014 ACM Symposium on Document Engineering, Fort Collins, CO, Sep 16-19, 2014)203-206.2014.Type:Proceedings

Date Reviewed: Nov 4 2014

This paper describes a language independent, automatic text summarization system, the platform for language independent summarization (PLIS). Traditional text summarization systems mainly focus on the processing of English text. PLIS is, however, able to work with multiple languages by translating non-English text to English, before applying a set of features to score and rank sentences with the input text. It is an extractive summarization system, thus the top-ranked sentences are then selected to compose the final summary. The paper explains PLIS well. It is appropriately structured and the language is simple and easily comprehensible. It is a short paper, however, and as a result misses out on several details. For example, while we know that PLIS makes use of three features, namely word frequency, sentence length, and sentence position, the paper neglects to explain how these are combined to obtain a final score for each sentence. It also does not explain if PLIS requires training to find an optimal combination of these three features, or if they are aggregated together heuristically in an unsupervised fashion. While the work on PLIS is notable and very important, I would have hoped for more extensive experimentation. PLIS is described as a multi-language summarization system. However, the experiments only focused on text in English, Spanish, and Portuguese. In the introduction, the authors explicitly note this as a failing in several related papers. As a result, I expected experiments over a wide gamut of European Union (EU) languages. Further, the paper does not mention the multilingual summarization pilot held at the 2011 Text Analysis Conference (TAC), which would have been very relevant [1]. All in all, this is a good read that is easy to follow. The technique employed by PLIS shows promising results, and this would be an interesting paper for researchers interested in multilingual text summarization.

Reviewer: Jun-Ping Ng	Review #: CR142892 (1502-0180)

1)	Giannakopoulos, G.; El-Haj, M.; Favre, B.; Litvak, M.; Steinberger, J.; Varma, V. TAC 2011 MultiLing pilot overview. In Proc. of the 2011 Text Analysis Conference. NIST, 2011, 1–17. http://www.nist.gov/tac/publications/2011/additional.papers/Summarization2011_MultiLing_overview.proceedings.pdf

Text Analysis (I.2.7 ... )

Machine Translation (I.2.7 ... )

Would you recommend this review?

yes

Other reviews under "Text Analysis":	Date

Some issues in the semantics and pragmatics of definite reference in the context of natural language database access Berry-Rogghe G. Circuits, Systems, and Signal Processing 3(1): 47-54, 1984. Type: Article	Jun 1 1985

Word division in Spanish Mañas J. Communications of the ACM 30(7): 612-616, 1987. Type: Article	Jul 1 1989

Schemata for understanding of argumentation in newspaper texts Roesner D. Progress in artificial intelligence (, Orsay, France,3111985. Type: Proceedings	Apr 1 1986

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy