Computing Reviews, the leading online review service for computing literature.

Search

Properties of extended Boolean models in information retrieval
Lee J. Research and development in information retrieval (, Dublin, Ireland,1901994.Type:Proceedings

Date Reviewed: Dec 1 1995

Ranked document retrieval outputs in decreasing order of query-document similarities are obviously useful to control the size of a retrieved document set. Conventional Boolean retrieval systems, however, do not provide such ranked output because they cannot compute similarity coefficients between queries and documents. The author of this paper analyzes several extended Boolean models in order to determine which one is the most suitable for achieving high retrieval effectiveness. Although extended Boolean models use document term weights to calculate query-document similarities, ranking is often not satisfactory. The author demonstrates with clear examples that some models (fuzzy set models) can generate incorrectly ranked output that does not agree with human behavior. Positively compensatory operators and binary soft Boolean operators in other models (Waller-Kraft, Paice, P-Norm [1], and Infinite-One) are shown to overcome this problem. The author continues to demonstrate with a new set of clear examples that these models (except for P-Norm) still violate the usual assumption that all the terms given in a query are equally important. Lee concludes that, since P-Norm is the only model that solves both the deficiency of fuzzy models and the unequal importance problem, it is more effective than any of the other extended Boolean models. The concluding section is devoted to an analysis of the meaning of query weights. The analysis concludes that P-Norm is superior, since it uses relative query weights, found to be easier for users to write than absolute query weights. The author provides clear examples and presents the analysis in a very readable and convincing form supported by well-written mathematical proofs.

Reviewer: D. B. Lange	Review #: CR118922 (9512-0990)

1)	Smith, M. E. Aspects of the P-norm model of information retrieval: syntactic query generation, efficiency, and theoretical properties. Ph.D. thesis, Cornell University, Ithaca, NY, 1990.

Retrieval Models (H.3.3 ... )

Indexing Methods (H.3.1 ... )

Query Formulation (H.3.3 ... )

Would you recommend this review?

yes

Other reviews under "Retrieval Models":	Date

Evaluation of an inference network-based retrieval model Turtle H., Croft W. (ed) ACM Transactions on Information Systems 9(3): 187-222, 1991. Type: Article	May 1 1993

On a model of distributed information retrieval systems based on thesauri Mazur Z. Information Processing and Management: an International Journal 20(4): 499-505, 1984. Type: Article	Sep 1 1985

Information processing in linear vector space Kunz M. Information Processing and Management: an International Journal 20(4): 519-525, 1984. Type: Article	Mar 1 1985

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy