Computing Reviews, the leading online review service for computing literature.

Search

Real-time audio-to-score alignment of music performances containing errors and arbitrary repeats and skips
Nakamura T., Nakamura E., Sagayama S. IEEE/ACM Transactions on Audio, Speech and Language Processing24 (2):329-339,2016.Type:Article

Date Reviewed: Aug 12 2016

The primary contribution of this paper is the development and presentation of two hidden Markov models (HMM) that can follow a monophonic audio performance in real time and recognize arbitrary repeats/skips and other errors in a performance. While musicians or sound performers play or rehearse by following an accompanying score playing on a laptop, for example, the need to revise a series of notes or alter the starting point several times typically occurs when practicing or rehearsing. The problem addressed by the authors is the computational cost of managing these necessary interruptions to the score. Even though following or playing a music score on a laptop (score following) is by no means something new, managing arbitrary interruptions in a score is of practical importance. To address the problem of computational cost and the complexity of altering the score in various and changing locations within the score, the authors have developed two novel HMMs. The first or top model contains an interface that describes arbitrary errors such as skips and repeats and can compute the probability distribution before and after each skip or repeat. The second or bottom model relates to subevents such as sustaining a note or sound, for example. The second section of the paper provides a clear description of the models that includes the equations and figures that describe the logic. I found the equations difficult, but the figures provided were easy to follow and clarified the logic of the HMMs. I found this work in progress promising for score following with convenient management of arbitrary errors and exceptional instrumental sounds. If you are interested in mathematical engineering, Markov models, probabilistic models, and sound, you may find this audio-to-score project engaging and well worthwhile. This is a well-written paper that describes complex mathematical solutions that in my opinion may eventually produce worthwhile results.

Reviewer: Alyx Macfadyen	Review #: CR144685 (1611-0846)

Sound And Music Computing (H.5.5 )

Signal Processing Systems (C.3 ... )

Applications And Expert Systems (I.2.1 )

Special-Purpose And Application-Based Systems (C.3 )

Would you recommend this review?

yes

Other reviews under "Sound And Music Computing":	Date

Music, cognition, and computerized sound Cook P., MIT Press, Cambridge, MA, 1999. Type: Book (9780262032568)	Jul 1 1999

Linux music & sound Phillips D., No Starch Press, San Francisco, CA, 2000. 399, Type: Book (9781886411340)	Aug 1 2001

Machine musicianship Rowe R., MIT Press, Cambridge, MA, 2001. 399, Type: Book (9780262182065)	Aug 1 2001

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy