Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
On-line recognition of spoken words from a large vocabulary
Kohonen T. (ed), Riittinen H., Reuhkala E., Haltsonen S. Information Sciences33 (1-2):3-30,1984.Type:Article
Date Reviewed: Oct 1 1985

This paper introduces a very interesting technique for the online recognition of large-vocabulary words spoken in isolation. The technique involves a two-stage recognition scheme in which the speech signal is first converted into phoneme-based strings (“test strings”) using a modified spectral recognition method, and then a quick comparison of these phoneme-based strings with reference transcriptions (“prototypes”) stored in a dictionary is performed. There are several reference transcriptions corresponding to a word, represented in a hash index table. The word recognition decision is based on the distances of the test strings from the prototypes of a word-class.

Statistical pattern-recognition methods, incorporating iterative “learning procedures,” are used for determining the test strings of the spoken words. The word recognition is then tackled by a technique called the “redundant hash addressing” wherein, instead of conducting a symbol-by-symbol comparison of the test string with every prototype string, the features of the test string are compared with a table of all the features that have occurred in the prototype strings.

Since most of the computational activity is centered around the construction of the reference data, this method yields a fast rate of recognition.

Reviewer:  A. K. Menon Review #: CR108951
Bookmark and Share
 
Speech Recognition And Synthesis (I.2.7 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Speech Recognition And Synthesis": Date
Connected spoken word recognition algorithms by constant time delay DP, O (n) DP and augmented continuous DP matching
Nakagawa S. Information Sciences 33(1-2): 63-85, 1984. Type: Article
Jun 1 1985
The phonetic basis for computer speech processing
Ladefoged P., Prentice Hall International (UK) Ltd., Hertfordshire, UK, 1985. Type: Book (9789780131638419)
Dec 1 1987
Frequency-domain analysis of speech
Fallside F. (ed), Prentice Hall International (UK) Ltd., Hertfordshire, UK, 1985. Type: Book (9789780131638419)
Dec 1 1987
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy