Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Aligning and clustering patterns to reveal the protein functionality of sequences
Wong A., Lee E. IEEE/ACM Transactions on Computational Biology and Bioinformatics11 (3):548-560,2014.Type:Article
Date Reviewed: Nov 10 2014

Protein sequence analysis is a technique for discovering the structures and functions of proteins in living organisms using functions like comparison of sequences, identification of intrinsic features, sequence differences and variations, and molecular structures; it assists in revealing evolution and genetic diversity. The authors present a computationally efficient method to reveal the protein functionality of sequences using aligning and clustering patterns.

The method, the aligned pattern (AP) synthesis process, is made up of three steps: pattern discovery, AP clustering, and AP cluster refinement. Step one finds nonredundant statistically significant associations of amino acids. It uses a fast and space-efficient algorithm using statistical conditions as confidence thresholds to restrict the patterns discovered as statistically significant and nonredundant. Step two groups and aligns sequence patterns, and synthesizes the patterns into AP clusters. The authors employ the global Needleman-Wunsch alignment algorithm and the local Smith-Waterman alignment algorithm for merging two AP clusters into one using hierarchical clustering. Finally, step three refines the AP clusters into weak AP clusters and then to conserved AP clusters. This improves the sequence coverage while maintaining cluster entropy.

The authors conduct in silico tests using three biological datasets: cytochrome c, ubiquitin, and triosephosphate isomerase (TIM). For consistency, they use, “the minimum occurrence for each pattern set as half of the total number of sequences multiplied by the percentage of identity and coverage.” They observe cytochrome c, ubiquitin, and TIM results showing AP clusters correlating "to binding sites that richly represent ... binding segments as patterns and ... binding residues as aligned columns," hierarchical clustering performance and AP cluster quality, and biological significance, respectively.

This is an interesting read about protein analysis; it clearly profiles the nature of the protein functionality domain. The authors claim to develop a unique and novel AP synthesis process.

Reviewer:  Lalit Saxena Review #: CR142916 (1502-0185)
Bookmark and Share
 
Clustering (I.5.3 )
 
 
Biology And Genetics (J.3 ... )
 
 
Sequencing And Scheduling (F.2.2 ... )
 
 
Nonnumerical Algorithms And Problems (F.2.2 )
 
 
Life And Medical Sciences (J.3 )
 
Would you recommend this review?
yes
no
Other reviews under "Clustering": Date
On the convergence of “A self-supervised vowel recognition system”
Pathak A., Pal S. Pattern Recognition 20(2): 237-244, 1987. Type: Article
Aug 1 1988
Conceptual clustering of structured objects: a goal-oriented approach
Stepp R., Michalski R. (ed) Artificial Intelligence 28(1): 43-69, 1986. Type: Article
Sep 1 1986
The enhanced LBG algorithm
Patané G., Russo M. Neural Networks 14(9): 1219-1237, 2001. Type: Article
Apr 2 2003
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy