Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
A New Distance for High Level RNA Secondary Structure Comparison
Allali J., Sagot M. IEEE/ACM Transactions on Computational Biology and Bioinformatics2 (1):3-14,2005.Type:Article
Date Reviewed: Jan 12 2006

When I read the title of this paper, I expected an interesting contribution to the important problem of measuring the similarity between RNA molecules. Regrettably, this paper presents a questionable solution to a doubtful problem. After giving some background information, I will explain why this is the case.

The RNA secondary structure is the structure induced by the ordered nucleotide base pairs and the unpaired bases in an RNA molecule. The RNA primary structure is the sequence of nucleotides in an RNA molecule. Because different sequences can produce similar secondary structures, RNA secondary structures are used for comparing RNA molecules. RNA secondary structures can be represented as ordered labeled trees, where “ordered” means that sibling nodes are ordered, so that the tree is an accurate representation of the RNA structure.

There are multiple ways to define such trees. Several edit operations, such as node relabeling, node insertion, and node deletion, are defined on these trees, and a cost is associated with these operations. Comparing two RNA structures then reduces to finding the edit distance between the trees representing the structures, that is, to finding the sequence of edit operations with minimum cost that map one tree to the other.

The authors claim that these classical edit operations are inadequate, because they can lead to tree mappings that do not accurately reflect the similarity between structures. In fact, the authors’ claim is based on using inappropriate tree representations for the RNA structures, and on using an incomplete definition of a map between trees (their map does not specify preserving structural properties such as sibling and ancestor order). Based on this claim, the authors propose new edit operations, called edge fusion and node fusion, define a new edit distance, and describe an algorithm to determine this distance. Unfortunately, the proposed algorithm has exponential time and space complexity, thus it is inefficient for nontrivial RNA sequences.

In addition to its questionable scientific merit, this paper is marred by its hard-to-read figures and clumsy notation. An example of poor notation is in Section 4.3: “a path is list of of pairs (e or u, v).” The editors of the journal should have reviewed this paper more carefully before publication.

Reviewer:  Gabriel Mateescu Review #: CR132293 (0608-0844)
Bookmark and Share
  Featured Reviewer  
 
Trees (E.1 ... )
 
 
Biology And Genetics (J.3 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Trees": Date
The hB-tree: a multiattribute indexing method with good guaranteed performance
Lomet D., Salzberg B. ACM Transactions on Database Systems 15(3): 625-658, 1990. Type: Article
Jun 1 1991
Multidimensional trees
Baldwin W., Strawn G. Theoretical Computer Science 84(2): 293-311, 1991. Type: Article
Oct 1 1992
Hash trees versus b-trees
Bell D., Deen S. The Computer Journal 27(3): 218-224, 1984. Type: Article
Feb 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy