Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Connected spoken word recognition algorithms by constant time delay DP, O (n) DP and augmented continuous DP matching
Nakagawa S. Information Sciences33 (1-2):63-85,1984.Type:Article
Date Reviewed: Jun 1 1985

Three new Dynamic Programming (DP) algorithms have been proposed for connected-word recognition: constant time delay, O(n) DP, and augmented continuous DP matching algorithms.

The constant time delay DP algorithm is the same as the two level DP algorithm [1], except that local distance computations are reduced by a factor by executing computations after a constant time delay (every W frames) and by increased storage for local distances.

The O(n) DP algorithm is the same as the Bridle algorithm [2] for variable length connected-word recognition. The algorithm provides one pass recognition of connected words and is computationally the least expensive of all three algorithms proposed in the paper. However, the O(n) DP algorithm cannot perform syntax driven recognition while the other two can. The O(n) DP algorithm can be modified to provide unconstrained endpoint recognition by skipping reference frames, or test frames, or both. Another possible modification for the O(n) DP algorithm is the use of frame weights to emphasize certain dissimilar segments of the reference patterns.

The augmented continuous DP matching algorithm is derived from word spotting [3]. This algorithm produces only a suboptimal solution, but is considerably faster than the constant time delay DP algorithm. The algorithm is also well suited for decoding word sequences from a given syntax.

A few comments would be appropriate at this time. The author has made the paper as mathematically complete as possible. Careful reading gives a good insight into the algorithms involved. However, to a more casual reader the algorithms would be difficult to understand without going into the mathematical details. Some insight into the algorithms without having to understand the equations would have been useful.

The author implies that syntactical constraints cannot be applied to O(n) DP programming algorithm. However, Bridle [2] has shown that syntactic constraints can be applied easily. This is evident if one separate copy of the reference is maintained for each place in the syntax where the reference is associated.

Reviewer:  V. Gupta Review #: CR108952
1) Sakoe, H.Two level DP-matching--a dynamic programming based pattern matching algorithm for connected word recognition, IEEE Trans. Acoust. Speech Signal Process. ASSP-27 (1979), 588–595.
2) Bridle, J. S.; Brown, M. D.; and Chamberlain, R. M.An algorithm for connected word recognition, in Automatic speech analysis and recognition, Proc. of the NATO advanced study institute (Bonas, France, June 29-July 10, 1981), Reidel, Dordrecht, The Netherlands, 1982.
3) Christiansen, R. W.; and Rushforth, C. K.Detecting and locating key words in continuous speech using linear predictive coding, IEEE Trans. Acoust. Speech Signal Process. ASSP-25 (1977), 361–367.-
Bookmark and Share
 
Speech Recognition And Synthesis (I.2.7 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Speech Recognition And Synthesis": Date
On-line recognition of spoken words from a large vocabulary
Kohonen T. (ed), Riittinen H., Reuhkala E., Haltsonen S. Information Sciences 33(1-2): 3-30, 1984. Type: Article
Oct 1 1985
The phonetic basis for computer speech processing
Ladefoged P., Prentice Hall International (UK) Ltd., Hertfordshire, UK, 1985. Type: Book (9789780131638419)
Dec 1 1987
Frequency-domain analysis of speech
Fallside F. (ed), Prentice Hall International (UK) Ltd., Hertfordshire, UK, 1985. Type: Book (9789780131638419)
Dec 1 1987
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy