Computing Reviews, the leading online review service for computing literature.

Search

A New Distance for High Level RNA Secondary Structure Comparison
Allali J., Sagot M. IEEE/ACM Transactions on Computational Biology and Bioinformatics2 (1):3-14,2005.Type:Article

Date Reviewed: Jan 12 2006

When I read the title of this paper, I expected an interesting contribution to the important problem of measuring the similarity between RNA molecules. Regrettably, this paper presents a questionable solution to a doubtful problem. After giving some background information, I will explain why this is the case. The RNA secondary structure is the structure induced by the ordered nucleotide base pairs and the unpaired bases in an RNA molecule. The RNA primary structure is the sequence of nucleotides in an RNA molecule. Because different sequences can produce similar secondary structures, RNA secondary structures are used for comparing RNA molecules. RNA secondary structures can be represented as ordered labeled trees, where “ordered” means that sibling nodes are ordered, so that the tree is an accurate representation of the RNA structure. There are multiple ways to define such trees. Several edit operations, such as node relabeling, node insertion, and node deletion, are defined on these trees, and a cost is associated with these operations. Comparing two RNA structures then reduces to finding the edit distance between the trees representing the structures, that is, to finding the sequence of edit operations with minimum cost that map one tree to the other. The authors claim that these classical edit operations are inadequate, because they can lead to tree mappings that do not accurately reflect the similarity between structures. In fact, the authors’ claim is based on using inappropriate tree representations for the RNA structures, and on using an incomplete definition of a map between trees (their map does not specify preserving structural properties such as sibling and ancestor order). Based on this claim, the authors propose new edit operations, called edge fusion and node fusion, define a new edit distance, and describe an algorithm to determine this distance. Unfortunately, the proposed algorithm has exponential time and space complexity, thus it is inefficient for nontrivial RNA sequences. In addition to its questionable scientific merit, this paper is marred by its hard-to-read figures and clumsy notation. An example of poor notation is in Section 4.3: “a path is list of of pairs (e or u, v).” The editors of the journal should have reviewed this paper more carefully before publication.

Reviewer: Gabriel Mateescu	Review #: CR132293 (0608-0844)

Trees (E.1 ... )

Biology And Genetics (J.3 ... )

Would you recommend this review?

yes

Other reviews under "Trees":	Date

The hB-tree: a multiattribute indexing method with good guaranteed performance Lomet D., Salzberg B. ACM Transactions on Database Systems 15(3): 625-658, 1990. Type: Article	Jun 1 1991

Multidimensional trees Baldwin W., Strawn G. Theoretical Computer Science 84(2): 293-311, 1991. Type: Article	Oct 1 1992

Hash trees versus b-trees Bell D., Deen S. The Computer Journal 27(3): 218-224, 1984. Type: Article	Feb 1 1985

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy