Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Surveying stylometry techniques and applications
Neal T., Sundararajan K., Fatima A., Yan Y., Xiang Y., Woodard D. ACM Computing Surveys50 (6):1-36,2018.Type:Article
Date Reviewed: Apr 19 2018

Stylometry is analysis of textual data to find hidden patterns. This paper provides a comprehensive survey on this topic.

The paper contains three main parts. In the first part, the authors consider five stylometry problems:

  • Authorship attribution: finding the author of a text in a closed/predefined (or open) set of authors;
  • Authorship verification: determining whether given documents were written by the same author;
  • Authorship profiling: obtaining author demographics such as gender;
  • Stylochronometry: assigning time to text; and
  • Adversarial stylometry: altering style for evasion of authorship.

In the second part of the paper, they provide performance analysis for large-scale authorship analysis using 14 algorithms. They show that with several authors, this important task is challenging and invite new researchers to the field. In the third part, they define open problems.

The authors describe several open-source and commercial systems. The reference list is rich, with some omissions, and covers studies in different languages. It contains tables for feature categories, publicly available resources in terms of datasets and software, and so on. The tables make the material more accessible.

I like the emphasis of the large-scale authorship analysis that involves more than 1,000 authors and limited data. I see this aspect of stylometry as important in today’s world, with several possible dangers on the web and social media. The paper will be useful for practitioners and researchers. I enjoyed it.

Reviewer:  F. Can Review #: CR145987 (1807-0399)
Bookmark and Share
 
Natural Language Processing (I.2.7 )
 
 
Document Analysis (I.7.5 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Natural Language Processing": Date
Current research in natural language generation
Dale R. (ed), Mellish C. (ed), Zock M., Academic Press Prof., Inc., San Diego, CA, 1990. Type: Book (9780122007354)
Nov 1 1992
Incremental interpretation
Pereira F., Pollack M. Artificial Intelligence 50(1): 37-82, 1991. Type: Article
Aug 1 1992
Natural language and computational linguistics
Beardon C., Lumsden D., Holmes G., Ellis Horwood, Upper Saddle River, NJ, 1991. Type: Book (9780136128137)
Jul 1 1992
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy