Computing Reviews, the leading online review service for computing literature.

Search

On-line recognition of spoken words from a large vocabulary
Kohonen T. (ed), Riittinen H., Reuhkala E., Haltsonen S. Information Sciences33 (1-2):3-30,1984.Type:Article

Date Reviewed: Oct 1 1985

This paper introduces a very interesting technique for the online recognition of large-vocabulary words spoken in isolation. The technique involves a two-stage recognition scheme in which the speech signal is first converted into phoneme-based strings (“test strings”) using a modified spectral recognition method, and then a quick comparison of these phoneme-based strings with reference transcriptions (“prototypes”) stored in a dictionary is performed. There are several reference transcriptions corresponding to a word, represented in a hash index table. The word recognition decision is based on the distances of the test strings from the prototypes of a word-class. Statistical pattern-recognition methods, incorporating iterative “learning procedures,” are used for determining the test strings of the spoken words. The word recognition is then tackled by a technique called the “redundant hash addressing” wherein, instead of conducting a symbol-by-symbol comparison of the test string with every prototype string, the features of the test string are compared with a table of all the features that have occurred in the prototype strings. Since most of the computational activity is centered around the construction of the reference data, this method yields a fast rate of recognition.

Reviewer: A. K. Menon	Review #: CR108951

Speech Recognition And Synthesis (I.2.7 ... )

Would you recommend this review?

yes

Other reviews under "Speech Recognition And Synthesis":	Date

Connected spoken word recognition algorithms by constant time delay DP, O (n) DP and augmented continuous DP matching Nakagawa S. Information Sciences 33(1-2): 63-85, 1984. Type: Article	Jun 1 1985

The phonetic basis for computer speech processing Ladefoged P., Prentice Hall International (UK) Ltd., Hertfordshire, UK, 1985. Type: Book (9789780131638419)	Dec 1 1987

Frequency-domain analysis of speech Fallside F. (ed), Prentice Hall International (UK) Ltd., Hertfordshire, UK, 1985. Type: Book (9789780131638419)	Dec 1 1987

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy