Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Architectural optimizations for low-power, real-time speech recognition
Krishna R., Mahlke S., Austin T.  Compilers, architectures and synthesis for embedded systems (Proceedings of the international conference, San Jose, California, USA,220-231.2003.Type:Proceedings
Date Reviewed: May 12 2004

Ever wondered how much continuous speech recognition could be done with one AA battery? Eighteen thousand, 180,000, or even 1.8 million words? Given an imputed demand for speech recognition on handheld devices, the dilemma is clear: massive parallel processing algorithms face severely limited AA battery power reservoirs.

The authors argue that a simplified multi-threaded architecture that uses sublanguage information and decentralized controllers to reduce combinatorics in processing speech improves search efficiency, cuts down on the rate of data requests into the memory system, and, consequently, uses less power. The paper briefly introduces the state-of-the-art of speech processing, succinctly presents the authors’ proposal for a system architecture that could effectively be used for handheld devices, and presents a thorough, seven-page performance evaluation, before concluding with a cogent summary of related work and future research directions.

The findings are threefold. First, high-concurrency execution environments with latency tolerance improve speech recognition. Second, reduction of static power dissipation leads to less energy consumption for a given task. Third, the crux in improving performance lies in optimizing the memory system, and reducing heat dissipation during power consumption.

The authors extrapolate a performance of about 95 to 100 words per minute (18,000 words) for three hours of AA battery life. The do-ability is almost certain; the actual usability is a different story. Until speech recognizers go beyond is-this-what-you-mean confirmation prompts, and handle rudimentary dialogue without repetitive user input, battery life takes a back seat to the creature comforts of real-life spoken interaction.

Reviewer:  Klaus K. Obermeier Review #: CR129604 (0411-1346)
Bookmark and Share
  Featured Reviewer  
 
Mobile Processors (C.1.4 ... )
 
 
Real-Time And Embedded Systems (C.3 ... )
 
 
Speech Recognition And Synthesis (I.2.7 ... )
 
 
Special-Purpose And Application-Based Systems (C.3 )
 
Would you recommend this review?
yes
no
Other reviews under "Mobile Processors": Date
Programming wireless devices with the Java 2 platform, micro edition
Riggs R., Huopaniemi J., Taivalsaari A., Patel M., Uotila A., Sun Microsystems, Inc., Mountain View, CA, 2003.  434, Type: Book (9780321197986)
Dec 17 2003
Mobile disruption: the technologies and applications that are driving the mobile Internet
Funk J., Wiley-Interscience, New York, NY, 2003.  211, Type: Book (9780471511229)
May 14 2004
How to do everything with your Treo 600
Ball D., Foster D., McGraw-Hill Osborne Media, 2004. Type: Book (9780072255812)
Mar 2 2005
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy