Computing Reviews, the leading online review service for computing literature.

Search

Information search and retrieval in microblogs
Efron M. Journal of the American Society for Information Science and Technology62 (6):996-1008,2011.Type:Article

Date Reviewed: Dec 1 2011

Information retrieval (IR) has been a useful computing tool for more than three decades. This 13-page journal paper describes some of the changes in IR associated with using IR from microblogs, such as, for instance, from the tweets generated by the users of the social media application Twitter. This paper is not a comprehensive report, but an early report covering some key problems encountered in the IR microblog experience. After an introduction to IR, the paper surveys seven observed microblog IR problems: sentiment analysis, opinion mining, entity search, user-generated metadata, authority, influence, and temporal issues. The paper closes with brief coverage of six outstanding problems: geographic data, data needs, data queries, search relevance, search recency, and search corpus abundance. Fortunately, IR access to microblog documents is very quick and easy. Millions of such documents are readily accessible for IR. While individually the contents of the documents are short--less than 150 characters per document--the documents often have some distinctive attributes. Some are explicit, such as hashtags. Prime among the implicit attributes are fast responses about the contents that the microblogs cover. For example, The Wall Street Journal reported that a microblog document announcing the August 2011 earthquake in Virginia was received in New York City before the tremors from that earthquake were felt in New York City [1]. Unfortunately, some commonly desired attributes (for IR purposes) of documents are not always readily ascertainable from microblog documents. Two common examples are the identification of the author of a document, and the identification of the intended receiver of a document. The paper leaves for future evaluation some IR matters, such as geography and security. Another example is content timing, when IR-relevant content is sometimes covered in many microblogs by many authors, or only in a few by a few. An example is the Haiti earthquake. Also, IR access to microblogs can be deliberately used as a tool, such as in the Brazilian work on monitoring dengue fever outbreaks in South America [2]. This paper supplies some helpful pointers for the effective use of IR from microblogs. Even though the paper is an early report, the timing attribute could have been introduced earlier and covered more fully. Overall, the paper provides stimulating reading about a modern use of IR.

Reviewer: Ned Chapin	Review #: CR139627 (1204-0397)

1)	Hotz, R. L. Decoding our chatter. The Wall Street Journal, October 1, 2011, http://online.wsj.com/article/SB10001424052970204138204576598942105167646.html.

2)	Corbyn, Z. Twitter to track dengue fever outbreaks in Brazil. New Scientist, July 18, 2011, http://www.newscientist.com/article/mg21128215.600-twitter-to-track-dengue-fever-outbreaks-in-brazil.html.

Information Search And Retrieval (H.3.3 )

Would you recommend this review?

yes

Other reviews under "Information Search And Retrieval":	Date

Nested transactions in a combined IRS-DBMS architecture Schek H. (ed) Research and development in information retrieval (, King’s College, Cambridge,701984. Type: Proceedings	Nov 1 1985

An integrated fact/document information system for office automation Ozkarahan E., Can F. (ed) Information Technology Research Development Applications 3(3): 142-156, 1984. Type: Article	Oct 1 1985

Access methods for text Faloutsos C. ACM Computing Surveys 17(1): 49-74, 1985. Type: Article	Jan 1 1986

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy