This book describes an interesting topic: how to assess speech synthesis and speech recognition systems, that is, what components to look at and what methodologies to use in assessing their performance. The text is largely unreadable and of little interest.
First, the typography is perfectly awful. Much of the text is spread out over the 41-2--inch lines with only a few words per line. The result is a large amount of empty space on each line. A typical six-word line consists of 35-16- inches of text and 13-16- inches of space between words, with the spacing haphazardly chosen as 6-16-, 5-16-, 2-16-, 3-16-, and 3-16- inches between pairs of words. A normal human being used to conventional typography will find such a presentation most disturbing and will give up in a hurry.
This book represents the initial report of “Esprit Project 1541,” entitled “Speech Assessment Methodologies.” (Esprit is a research program initiated some years ago by the European Economic Community (EEC) to encourage international cooperation between industrial and academic research groups in various countries of the EEC.) Several research projects have been supported under the Esprit umbrella, including this speech assessment work. Each chapter of this book is a report by one of the cooperating organizations and deals with different aspects of speech assessment, including speech recognition, speech synthesis, existing speech databases, phonetic transcription methods, database requirements, and speech workstation technology. The reports were apparently prepared early in 1988.
Even allowing for the fact that the text does not deal with the guts of the problem--the actual state of the art in speech work--but only with some relatively arcane questions of the assessment of speech handling systems, it is disturbing to find that it contains little that is new or interesting. The reader would do much better to read a shorter, more concentrated, and properly focused piece written by a single expert (see, for example, Klatt [1]) instead of this lengthy catalog of systems, components, and possibilities.
A few specialists in speech performance assessment might find something of interest here. The rest of us should, and no doubt will, stay away.