Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
A normal form for XML documents
Arenas M., Libkin L.  Principles of database systems (Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium, Madison, Wisconsin, Jun 3-5, 2002)85-96.2002.Type:Proceedings
Date Reviewed: Dec 19 2003

Arenas and Libkin define the background and requirements of a normal form of Extensible Markup Language (XML), with the goal of converting arbitrary XML into well-formed XML. Since my own work is with Extensible Business Reporting Language (XBRL) processor implementation, this background, and their algorithm for normalizing XML documents, is of high interest to me. Recent database implementations for storing normal-form XML data--both those implemented specifically for XML, such as Apache.org’s XIndice, and by XML features added to popular commercial heavyweight relational databases--make this technology timely and of wide interest.

The authors begin by demonstrating the need to normalize XML, based on XML data structures that have redundancies of data in different areas of the structure. They relate that to relational functional dependency. Extending from relational to XML functional dependencies, the authors then represent XML tree structured data into tuples (XBRL also does this). Functional dependencies are defined using tree tuples. Finally, an XML normal form (XNF) is defined.

Anticipating massive Web databases of poorly organized data, the authors’ goal is to provide principles for good design, and algorithms for producing that design, sort of an XML schema reengineering. My take on this process is different: I see the need for this technology in the opposite direction, not for the rescue of Web “stuff,” but for the upcoming new generations of databases and software that will all be based on XML technologies. Mainstream business processing now considers XML and XML derivatives for automated interchange between systems, and as core infrastructural technology.

Physically, this paper seems to be about the right length (12 pages). The presentation is consistent with the authors’ ideas, and the text was easy to read. The latest reference is from one year before publication.

This paper demonstrates that XML structure can be converted to a normal form, and that data can be losslessly converted into the resulting structure. In my work with XBRL, there are more expressed semantics than just the structure of data. Source fact values expressed in XML are given context, measure, role restrictions, and semantics, defined in XML linking language (XLink) linkbases. Applications of the authors’ theories, if extendable, would be very useful.

I would recommend this paper to researchers and developers of XML and XML-derived data structuring and data storing technology.

Reviewer:  Herman Fischer Review #: CR128796 (0405-0675)
Bookmark and Share
 
Markup Languages (I.7.2 ... )
 
 
XML (I.7.2 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Markup Languages": Date
XML: a manager’s guide
Dick K., Addison-Wesley Longman Publishing Co., Inc., Boston, MA, 2002.  336, Type: Book (9780201770063)
Jan 13 2003
Localization in XML: identifying multiple criteria that drive effective VE system design
Savourel Y. Markup Languages 3(4): 387-393, 2001. Type: Article
Jul 23 2003
Learning XSLT
Fitzgerald M., O’Reilly & Associates, Inc., Sebastopol, CA, 2003. Type: Book (9780596003272)
Dec 20 2004
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy