Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Information search and retrieval in microblogs
Efron M. Journal of the American Society for Information Science and Technology62 (6):996-1008,2011.Type:Article
Date Reviewed: Dec 1 2011

Information retrieval (IR) has been a useful computing tool for more than three decades. This 13-page journal paper describes some of the changes in IR associated with using IR from microblogs, such as, for instance, from the tweets generated by the users of the social media application Twitter.

This paper is not a comprehensive report, but an early report covering some key problems encountered in the IR microblog experience. After an introduction to IR, the paper surveys seven observed microblog IR problems: sentiment analysis, opinion mining, entity search, user-generated metadata, authority, influence, and temporal issues. The paper closes with brief coverage of six outstanding problems: geographic data, data needs, data queries, search relevance, search recency, and search corpus abundance.

Fortunately, IR access to microblog documents is very quick and easy. Millions of such documents are readily accessible for IR. While individually the contents of the documents are short--less than 150 characters per document--the documents often have some distinctive attributes. Some are explicit, such as hashtags. Prime among the implicit attributes are fast responses about the contents that the microblogs cover. For example, The Wall Street Journal reported that a microblog document announcing the August 2011 earthquake in Virginia was received in New York City before the tremors from that earthquake were felt in New York City [1].

Unfortunately, some commonly desired attributes (for IR purposes) of documents are not always readily ascertainable from microblog documents. Two common examples are the identification of the author of a document, and the identification of the intended receiver of a document.

The paper leaves for future evaluation some IR matters, such as geography and security. Another example is content timing, when IR-relevant content is sometimes covered in many microblogs by many authors, or only in a few by a few. An example is the Haiti earthquake. Also, IR access to microblogs can be deliberately used as a tool, such as in the Brazilian work on monitoring dengue fever outbreaks in South America [2].

This paper supplies some helpful pointers for the effective use of IR from microblogs. Even though the paper is an early report, the timing attribute could have been introduced earlier and covered more fully. Overall, the paper provides stimulating reading about a modern use of IR.

Reviewer:  Ned Chapin Review #: CR139627 (1204-0397)
1) Hotz, R. L. Decoding our chatter. The Wall Street Journal, October 1, 2011, http://online.wsj.com/article/SB10001424052970204138204576598942105167646.html.
2) Corbyn, Z. Twitter to track dengue fever outbreaks in Brazil. New Scientist, July 18, 2011, http://www.newscientist.com/article/mg21128215.600-twitter-to-track-dengue-fever-outbreaks-in-brazil.html.
Bookmark and Share
  Featured Reviewer  
 
Information Search And Retrieval (H.3.3 )
 
Would you recommend this review?
yes
no
Other reviews under "Information Search And Retrieval": Date
Nested transactions in a combined IRS-DBMS architecture
Schek H. (ed)  Research and development in information retrieval (, King’s College, Cambridge,701984. Type: Proceedings
Nov 1 1985
An integrated fact/document information system for office automation
Ozkarahan E., Can F. (ed) Information Technology Research Development Applications 3(3): 142-156, 1984. Type: Article
Oct 1 1985
Access methods for text
Faloutsos C. ACM Computing Surveys 17(1): 49-74, 1985. Type: Article
Jan 1 1986
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy