Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Comparison of mid-level feature coding approaches and pooling strategies in visual concept detection
Koniusz P., Yan F., Mikolajczyk K. Computer Vision and Image Understanding117 (5):479-492,2013.Type:Article
Date Reviewed: Jul 26 2013

The paper makes a systematic and detailed evaluation of two main stages of bag-of-words (BoW) modeling for computer vision. BoW is an accepted technique in computer vision to represent vocabularies of image features. The authors explore the mid-level coding of image descriptors and pooling steps.

The first part of the paper provides a thorough review of four mid-level coding approaches: soft assignment, its extension approximate locality-constrained soft assignment (LcSA), sparse coding (SC), and approximate locality-constrained linear coding (LLC). The authors analyze the accuracy and speed of coding each of these schemes, and suggest several interesting solutions to improve the system’s performance. For instance, minimizing the residual error of approximation of a descriptor vector proves useful for optimally setting the coding parameters. This fact is discussed and expressively illustrated. The authors also propose a fast hierarchical nearest neighbor search based on a compact dictionary of the l-nearest neighbors.

The third section represents an exhaustive exploration of six pooling methods: average, max-pooling, power normalization, theoretical expectation of max-pooling and the probability of at least one particular visual word being present in an image, Lp-norm as a tradeoff between average and max-pooling, and mix-order max-pooling. The paper introduces a new scheme to supplement the max approach, and demonstrates its value by assessing cross vocabulary leakage and descriptor interdependence. The experimental section evaluates the performance of the four mid-level approaches in the framework of the mentioned pooling methods and on a range of datasets. The authors outline the benefits of the suggested improvements for the classification results.

The paper’s sound investigation and proposed solutions represent a significant and valuable contribution. It is appropriate for researchers and designers of image recognition applications.

Reviewer:  Svetlana Segarceanu Review #: CR141398 (1310-0939)
Bookmark and Share
  Featured Reviewer  
 
Feature Evaluation And Selection (I.5.2 ... )
 
 
Feature Representation (I.4.7 ... )
 
 
General (I.4.0 )
 
 
General (I.5.0 )
 
Would you recommend this review?
yes
no
Other reviews under "Feature Evaluation And Selection": Date
Labeled point pattern matching by Delaunay triangulation and maximal cliques
Ogawa H. Pattern Recognition 19(1): 35-40, 1986. Type: Article
Feb 1 1988
Features selection and ‘possibility theory’
Di Gesù V., Maccarone M. Pattern Recognition 19(1): 63-72, 1986. Type: Article
Dec 1 1987
An analytic-to-holistic approach for face recognition based on a single frontal view
Lam K., Yan H. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(7): 673-686, 1998. Type: Article
Oct 1 1998
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy