Computing Reviews, the leading online review service for computing literature.

Search

Binary queries for document trees
Berlea A., Seidl H. Nordic Journal of Computing11 (1):41-71,2004.Type:Article

Date Reviewed: Dec 21 2004

In an Extensible Markup Language (XML) application, it may be necessary to simultaneously locate a k-tuple of nodes in an input (XML) tree that satisfies some given relationship. The difficulty here is that it is necessary to maintain context information about the initial part of any potential match while searching for the complete match. This paper describes an approach, based on the use of efficient push down tree automata, that gives an effective algorithm for finding k-ary matches. A language, fxgrep, is introduced that is a practical query language for unary and binary queries. The algorithm for finding binary matches uses two passes through the document tree. The first, which is a top-down left-to-right pass, labels the nodes of the tree according to the states of the automaton at the node during the run. A second pass, also top-down, but this time right-to-left, can collect the matches because the labels provide the needed context information. For binary queries, the algorithm is, at worst, of order n squared, where n is the number of nodes in the tree, but this happens only when the number of matching nodes is itself of order n, so that, in general, the order will be closer to being linear in n. Although the authors have implemented the algorithm, and the language fxgrep, they do not present any experimental results. It would be interesting to have a repository of problems on which one could test k-ary matching algorithms.

Reviewer: J. P. E. Hodgson	Review #: CR130559 (0506-0694)

Sorting And Searching (F.2.2 ... )

Automata (F.1.1 ... )

Languages And Systems (I.7.2 ... )

Search Process (H.3.3 ... )

XML (I.7.2 ... )

Information Search And Retrieval (H.3.3 )

Would you recommend this review?

yes

Other reviews under "Sorting And Searching":	Date

Binary search trees of almost optimal height Anderson A., Icking C., Klein R., Ottmann T. (ed) Acta Informatica 28(2): 165-178, 1990. Type: Article	May 1 1992

More nearly optimal algorithms for unbounded searching, part II Reingold E., Shen X. SIAM Journal on Computing 20(1): 184-208, 1991. Type: Article	Apr 1 1992

A simple linear-time algorithm for in situ merging Mannila H., Ukkonen E. Information Processing Letters 18(4): 203-208, 1984. Type: Article	Feb 1 1985

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy