Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Database performance evaluation in an indexed file environment
Fedorowicz J. ACM Transactions on Database Systems12 (1):85-110,1987.Type:Article
Date Reviewed: Aug 1 1987

This paper develops and validates two analytical prediction models: one for access time and another for response time in an indexed file context. While the validity of the models proves to be excellent, their generalizability is not as broad as is implied by the title and abstract of the paper. This is because the models rely on Zipfian assumptions about the distribution of access keys, and these assumptions are valid only for files containing natural language text or keywords. The author constructed and tested the prediction models in a bibliographic searching environment, so it should not be assumed that the results are valid in the broader domain of general purpose database management systems that are implemented using indexed files. The author clearly notes these limitations in the text, but they should also have been reflected in the title and abstract. A more appropriate title might have been “Performance Prediction Models for Indexed Bibliographic Databases.”

Within the bibliographic searching realm, the prediction models are quite accurate. Testing the prediction equations against actual searches in the MEDLINE database, an R-square value of 0.95 was obtained for predictions of access time, and 0.66 for response time predictions. These R-square values reflect the proportion of overall variance in the criterion variable that was explained by the analytical prediction model using regression analysis.

A possible limitation of the results is that the models were validated using data collected in 1979 in an IBM mainframe environment. The parameters for the response time prediction model that were valid in 1979 may not be accurate in 1987. This would be due not only to absolute improvements in CPU speed since then, but to enhancement of operating system task scheduling and priority assignment techniques. The paper does not discuss the operating system environment in which the data were collected.

Important applications for accurate models of response time in bibliographic systems are discussed. The design of optimum index structures, determination of the amount of information that should be indexed for each citation, overall database size, limits on the number of subscribers, and pricing structures can all be efficiently investigated using the validated models. The author has made a very significant contribution toward the development of more generalizable prediction models.

Reviewer:  Geoffry S. Howard Review #: CR111572
Bookmark and Share
 
File Organization (D.4.3 ... )
 
 
Access Methods (H.2.2 ... )
 
 
File Organization (H.3.2 ... )
 
 
Retrieval Models (H.3.3 ... )
 
Would you recommend this review?
yes
no
Other reviews under "File Organization": Date
File structures: a conceptual toolkit
Folk M., Zoellick B., Addison-Wesley Longman Publishing Co., Inc., Boston, MA, 1987. Type: Book (9789780201120035)
May 1 1988
Disk file allocation based on the buddy system
Koch P. ACM Transactions on Computer Systems 5(4): 352-370, 1987. Type: Article
Jun 1 1988
A fast file system for UNIX
McKusick M., Joy W., Leffler S., Fabry R. ACM Transactions on Computer Systems 2(3): 181-197, 1984. Type: Article
Nov 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy