Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Speech enhancement in the STFT domain
Benesty J., Chen J., Habets E., Springer Publishing Company, Incorporated, New York, NY, 2011. 116 pp. Type: Book (978-3-642232-49-7)
Date Reviewed: May 21 2012

Larger than an exhaustive IEEE tutorial paper, but smaller than an actual book, this “brief” focuses on frequency domain processing for speech enhancement. The style is that of a theorem (or rather statement) proof. Its conciseness and attention to the details in most of the derivations or proofs make it well suited for a crash course in frequency domain speech enhancement, with an exclusive bias toward theoretical developments.

Unfortunately, this theoretical formality is pushed to the limit. For example, in chapter 4 on multichannel speech enhancement including microphone arrays, there is not a single graph. Perhaps the reader is supposed to know the subject well enough, so as not to need graphs and diagrams, but why then would a reader well versed in the subject need to read that chapter at all? In fact, the book contains a total of only five figures, all in the introductory chapter. This is rather strange for a signal processing book, especially when one considers that array processing owes many of its unique features to specific geometries. In other words, this book falls short on the practical side.

If ones writes a thesis, and needs to fill it with impressive derivations, then this is the right book. If one needs to get an intuitive feel for how enhancement in the frequency domain works or hands-on experience by simulating algorithms, it would be better to look for Loizou’s book on speech enhancement [1]. That book has the same mathematical elegance as the current one, with the added bonus of implementation details and simulation results that are reproducible. Loizou’s book lacks a multichannel section, but given the fact that the current book’s multichannel section is overly formal, it is hard to look at this omission as a disadvantage when comparing the two.

Reviewer:  Vladimir Botchev Review #: CR140170 (1209-0905)
1) Loizou, P. C. Speech enhancement: theory and practice. CRC Press, Boca Raton, FL, 2007.
Bookmark and Share
  Reviewer Selected
 
 
Speech Recognition And Synthesis (I.2.7 ... )
 
 
Computation Of Transforms (F.2.1 ... )
 
 
Fast Fourier Transforms (FFT) (G.1.2 ... )
 
 
Approximation (G.1.2 )
 
 
Numerical Algorithms And Problems (F.2.1 )
 
Would you recommend this review?
yes
no
Other reviews under "Speech Recognition And Synthesis": Date
On-line recognition of spoken words from a large vocabulary
Kohonen T. (ed), Riittinen H., Reuhkala E., Haltsonen S. Information Sciences 33(1-2): 3-30, 1984. Type: Article
Oct 1 1985
Connected spoken word recognition algorithms by constant time delay DP, O (n) DP and augmented continuous DP matching
Nakagawa S. Information Sciences 33(1-2): 63-85, 1984. Type: Article
Jun 1 1985
The phonetic basis for computer speech processing
Ladefoged P., Prentice Hall International (UK) Ltd., Hertfordshire, UK, 1985. Type: Book (9789780131638419)
Dec 1 1987
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy