Computing Reviews, the leading online review service for computing literature.

Search

Supervised dictionary learning for music genre classification
Yeh C., Yang Y. ICMR 2012 (Proceedings of the 2nd ACM International Conference on Multimedia Retrieval, Hong Kong, Jun 5-8, 2012)1-8.2012.Type:Proceedings

Date Reviewed: Aug 15 2012

This paper uses a supervised dictionary learning method to classify music genres. It shows how text-like representation captures relevant music information. The main results prove the high accuracy of the method, with references to other state-of-the-art developments. The method starts with an online dictionary learning (ODL) method, which is a “first-order stochastic gradient descent algorithm” that scans a training set and processes an element by alternating a sparse coding step for computing the codeword decomposition. The main algorithm proposed in the paper, known as the supervised dictionary learning (SDL) algorithm, incorporates ground truth labels in the dictionary learning phase to enlarge the differences between the learned codewords. Using SDL, the GTZAN dataset (“composed of 1,000 30-second clips covering ten genres”) achieved 84.7 percent accuracy for music genre classification and the ISMIR2004Genre dataset (“1,458 full-length songs covering six genres”) achieved 90.8 percent accuracy. Overall, the methodology improves on the bag-of-frames (BOF) model, which represents each song “as a histogram over a dictionary of music ‘codewords’ selected or learned from a music collection,” by applying techniques from dictionary learning and sparse coding to music information retrieval. The main contributions of the paper reside in the results section, where the authors benchmark multiple encoding and construction techniques, proving that the sparsity-enforced dictionary learning method achieves the highest accuracy. Most importantly, the authors note that the entire framework can be easily applied to other multimedia retrieval problems.

Reviewer: George Popescu	Review #: CR140538 (1302-0141)

Methodologies And Techniques (H.5.5 ... )

Systems (H.5.5 ... )

Would you recommend this review?

yes

Other reviews under "Methodologies And Techniques":	Date

Real sound synthesis for interactive applications Cook P., A. K. Peters, Ltd., Natick, MA, 2002. 250, Type: Book (9781568811680)	Jan 23 2004

From music to 3D scenography and back again Lervig M. In Production methods. New York, NY: Springer-Verlag New York, Inc., 2003. Type: Book Chapter	May 21 2004

Music analysis and retrieval systems for audio signals Tzanetakis G., Cook P. Journal of the American Society for Information Science and Technology 55(12): 1077-1083, 2004. Type: Article	Jul 8 2005

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy