Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Supervised dictionary learning for music genre classification
Yeh C., Yang Y.  ICMR 2012 (Proceedings of the 2nd ACM International Conference on Multimedia Retrieval, Hong Kong, Jun 5-8, 2012)1-8.2012.Type:Proceedings
Date Reviewed: Aug 15 2012

This paper uses a supervised dictionary learning method to classify music genres. It shows how text-like representation captures relevant music information. The main results prove the high accuracy of the method, with references to other state-of-the-art developments.

The method starts with an online dictionary learning (ODL) method, which is a “first-order stochastic gradient descent algorithm” that scans a training set and processes an element by alternating a sparse coding step for computing the codeword decomposition. The main algorithm proposed in the paper, known as the supervised dictionary learning (SDL) algorithm, incorporates ground truth labels in the dictionary learning phase to enlarge the differences between the learned codewords.

Using SDL, the GTZAN dataset (“composed of 1,000 30-second clips covering ten genres”) achieved 84.7 percent accuracy for music genre classification and the ISMIR2004Genre dataset (“1,458 full-length songs covering six genres”) achieved 90.8 percent accuracy. Overall, the methodology improves on the bag-of-frames (BOF) model, which represents each song “as a histogram over a dictionary of music ‘codewords’ selected or learned from a music collection,” by applying techniques from dictionary learning and sparse coding to music information retrieval.

The main contributions of the paper reside in the results section, where the authors benchmark multiple encoding and construction techniques, proving that the sparsity-enforced dictionary learning method achieves the highest accuracy. Most importantly, the authors note that the entire framework can be easily applied to other multimedia retrieval problems.

Reviewer:  George Popescu Review #: CR140538 (1302-0141)
Bookmark and Share
  Reviewer Selected
 
 
Methodologies And Techniques (H.5.5 ... )
 
 
Systems (H.5.5 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Methodologies And Techniques": Date
Real sound synthesis for interactive applications
Cook P., A. K. Peters, Ltd., Natick, MA, 2002.  250, Type: Book (9781568811680)
Jan 23 2004
From music to 3D scenography and back again
Lervig M. In Production methods. New York, NY: Springer-Verlag New York, Inc., 2003. Type: Book Chapter
May 21 2004
Music analysis and retrieval systems for audio signals
Tzanetakis G., Cook P. Journal of the American Society for Information Science and Technology 55(12): 1077-1083, 2004. Type: Article
Jul 8 2005
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy