Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Properties of extended Boolean models in information retrieval
Lee J.  Research and development in information retrieval (, Dublin, Ireland,1901994.Type:Proceedings
Date Reviewed: Dec 1 1995

Ranked document retrieval outputs in decreasing order of query-document similarities are obviously useful to control the size of a retrieved document set. Conventional Boolean retrieval systems, however, do not provide such ranked output because they cannot compute similarity coefficients between queries and documents. The author of this paper analyzes several extended Boolean models in order to determine which one is the most suitable for achieving high retrieval effectiveness.

Although extended Boolean models use document term weights to calculate query-document similarities, ranking is often not satisfactory. The author demonstrates with clear examples that some models (fuzzy set models) can generate incorrectly ranked output that does not agree with human behavior. Positively compensatory operators and binary soft Boolean operators in other models (Waller-Kraft, Paice, P-Norm [1], and Infinite-One) are shown to overcome this problem. The author continues to demonstrate with a new set of clear examples that these models (except for P-Norm) still violate the usual assumption that all the terms given in a query are equally important. Lee concludes that, since P-Norm is the only model that solves both the deficiency of fuzzy models and the unequal importance problem, it is more effective than any of the other extended Boolean models.

The concluding section is devoted to an analysis of the meaning of query weights. The analysis concludes that P-Norm is superior, since it uses relative query weights, found to be easier for users to write than absolute query weights. The author provides clear examples and presents the analysis in a very readable and convincing form supported by well-written mathematical proofs.

Reviewer:  D. B. Lange Review #: CR118922 (9512-0990)
1) Smith, M. E. Aspects of the P-norm model of information retrieval: syntactic query generation, efficiency, and theoretical properties. Ph.D. thesis, Cornell University, Ithaca, NY, 1990.
Bookmark and Share
 
Retrieval Models (H.3.3 ... )
 
 
Indexing Methods (H.3.1 ... )
 
 
Query Formulation (H.3.3 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Retrieval Models": Date
Evaluation of an inference network-based retrieval model
Turtle H., Croft W. (ed) ACM Transactions on Information Systems 9(3): 187-222, 1991. Type: Article
May 1 1993
On a model of distributed information retrieval systems based on thesauri
Mazur Z. Information Processing and Management: an International Journal 20(4): 499-505, 1984. Type: Article
Sep 1 1985
Information processing in linear vector space
Kunz M. Information Processing and Management: an International Journal 20(4): 519-525, 1984. Type: Article
Mar 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy