Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Building the data warehouse
Inmon W., John Wiley & Sons, Inc., New York, NY, 2005. 543 pp. Type: Book (9780764599446)
Date Reviewed: Jun 20 2006

The world of information technology is often full of hyperbole, so the use of the words “father,” “Bible,” and “classic” in the same paragraph may seem excessive. However, they are appropriate in the context of this book. Data warehousing is now a very important part of information technology, and credit for being the father of the field is an appropriate designation for the author. This standard data warehousing reference book has long been a comprehensive introduction to the core concepts and methods of data warehousing. Therefore, the terms “classic” and “Bible” apply to it.

As befits a fourth edition, the book builds upon its previous base with discussions of newer topics, such as compliance and multidimensional database design. The book can serve as a reference guide to remind experienced data warehousing professionals of the first principles of data warehousing. However, info!rmation technology professionals who are new to data warehousing can use the book to immerse themselves in understanding the fundamentals of data warehousing. The topics in the book are remarkably easy to follow and easy to absorb. Numerous simple, clear figures illuminate the discussion.

Most of the first half of the book delves into the aspects of environment, design, and technology that are necessary for the data warehouse. Fundamental to understanding the data warehouse environment is knowing that the system development life cycle of the data warehouse is essentially opposite to the standard waterfall system development life cycle. A data warehouse follows a “build it and they will come” methodology, where requirements are specified after the data warehouse is built. The key characteristics of a data warehouse are that it is subject-oriented, integrated, nonvolatile, and time-oriented. The purpose of a data warehouse is to support decision making, and t!his means that it has to be able to support requirements that have not yet been defined.

The chapter on data warehouse design goes on to describe the data model that is necessary for a data warehouse, especially since the life cycle methodology says that a data warehouse has to be built iteratively. A key decision for a data warehouse is the level of granularity of the data. If the level of detail is too voluminous, the performance of the data warehouse can be seriously affected. This does not mean that the data warehouse will not have to manage a large volume of data. The technology chapter discusses managing a large data warehouse efficiently. The chapter also discusses multidimensional database management system processing (also known as online analytical processing).

The book then goes into a number of architectural issues, including distributed data warehouses, integrating external data into the data warehouse, how to work with the Web, and how to deal with !a really large data warehouse. The book covers the relationship of a data warehouse with data marts and operational data stores. The business issues of cost justification and return on investment, as well as corporate information governance, each receive chapter-length treatment. The book concludes with an extensive data warehouse design review checklist.

Some key topics, such as granularity, are covered in more than one place in the book. Some readers may not like this redundancy, but the repetition should help with the retention and reinforcement of key points. Finally, the book contains a useful glossary and an extensive list of references. Overall, the book covers the subject of data warehousing thoroughly.

The world of information technology is often full of hyperbole, so the use of the words “father,” “Bible,” and “classic” in the same paragraph may seem excessive. However, they are appropriate in the context of this book!. Data warehousing is now a very important part of information technology, and credit for being the father of the field is an appropriate designation for the author. This standard data warehousing reference book has long been a comprehensive introduction to the core concepts and methods of data warehousing. Therefore, the terms “classic” and “Bible” apply to it.

As befits a fourth edition, the book builds upon its previous base with discussions of newer topics, such as compliance and multidimensional database design. The book can serve as a reference guide to remind experienced data warehousing professionals of the first principles of data warehousing. However, information technology professionals who are new to data warehousing can use the book to immerse themselves in understanding the fundamentals of data warehousing. The topics in the book are remarkably easy to follow and easy to absorb. Numerous simple, clear figures illuminate the discussion.

Most of the first half of the book delves into the aspects of environment, design, and technology that are necessary for the data warehouse. Fundamental to understanding the data warehouse environment is knowing that the system development life cycle of the data warehouse is essentially opposite to the standard waterfall system development life cycle. A data warehouse follows a “build it and they will come” methodology, where requirements are specified after the data warehouse is built. The key characteristics of a data warehouse are that it is subject-oriented, integrated, nonvolatile, and time-oriented. The purpose of a data warehouse is to support decision making, and this means that it has to be able to support requirements that have not yet been defined.

The chapter on data warehouse design goes on to describe the data model that is necessary for a data warehouse, especially since the life cycle methodology says that a data warehouse has to be built iteratively. A key decision for a data warehouse is the level of granularity of the data. If the level of detail is too voluminous, the performance of the data warehouse can be seriously affected. This does not mean that the data warehouse will not have to manage a large volume of data. The technology chapter discusses managing a large data warehouse efficiently. The chapter also discusses multidimensional database management system processing (also known as online analytical processing).

The book then goes into a number of architectural issues, including distributed data warehouses, integrating external data into the data warehouse, how to work with the Web, and how to deal with a really large data warehouse. The book covers the relationship of a data warehouse with data marts and operational data stores. The business issues of cost justification and return on investment, as well as corporate information governance, each receive chapter-length treatment. The book concludes with an extensive data warehouse design review checklist.

Some key topics, such as granularity, are covered in more than one place in the book. Some readers may not like this redundancy, but the repetition should help with the retention and reinforcement of key points. Finally, the book contains a useful glossary and an extensive list of references. Overall, the book covers the subject of data warehousing thoroughly.

Reviewer:  David G. Hill Review #: CR132935 (0705-0438)
Bookmark and Share
  Reviewer Selected
Featured Reviewer
 
 
Data Warehouse And Repository (H.2.7 ... )
 
 
Data Mining (H.2.8 ... )
 
 
Logging And Recovery (H.2.7 ... )
 
 
Database Administration (H.2.7 )
 
 
Database Applications (H.2.8 )
 
Would you recommend this review?
yes
no
Other reviews under "Data Warehouse And Repository": Date
The IBM data warehouse architecture
Bontempo C., Zagelow G. Communications of the ACM 41(9): 38-48, 1998. Type: Article
Jan 1 1999
Building the data warehouse
Gardner S. Communications of the ACM 41(9): 52-60, 1998. Type: Article
May 1 1999
Interactive data warehousing
Singh H., Prentice Hall PTR, Upper Saddle River, NJ, 1998. Type: Book (9780130803719)
Aug 1 1999
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy