Computing Reviews, the leading online review service for computing literature.

Search

Comparison of mid-level feature coding approaches and pooling strategies in visual concept detection
Koniusz P., Yan F., Mikolajczyk K. Computer Vision and Image Understanding117 (5):479-492,2013.Type:Article

Date Reviewed: Jul 26 2013

The paper makes a systematic and detailed evaluation of two main stages of bag-of-words (BoW) modeling for computer vision. BoW is an accepted technique in computer vision to represent vocabularies of image features. The authors explore the mid-level coding of image descriptors and pooling steps. The first part of the paper provides a thorough review of four mid-level coding approaches: soft assignment, its extension approximate locality-constrained soft assignment (LcSA), sparse coding (SC), and approximate locality-constrained linear coding (LLC). The authors analyze the accuracy and speed of coding each of these schemes, and suggest several interesting solutions to improve the system’s performance. For instance, minimizing the residual error of approximation of a descriptor vector proves useful for optimally setting the coding parameters. This fact is discussed and expressively illustrated. The authors also propose a fast hierarchical nearest neighbor search based on a compact dictionary of the l-nearest neighbors. The third section represents an exhaustive exploration of six pooling methods: average, max-pooling, power normalization, theoretical expectation of max-pooling and the probability of at least one particular visual word being present in an image, L_p-norm as a tradeoff between average and max-pooling, and mix-order max-pooling. The paper introduces a new scheme to supplement the max approach, and demonstrates its value by assessing cross vocabulary leakage and descriptor interdependence. The experimental section evaluates the performance of the four mid-level approaches in the framework of the mentioned pooling methods and on a range of datasets. The authors outline the benefits of the suggested improvements for the classification results. The paper’s sound investigation and proposed solutions represent a significant and valuable contribution. It is appropriate for researchers and designers of image recognition applications.

Reviewer: Svetlana Segarceanu	Review #: CR141398 (1310-0939)

Feature Evaluation And Selection (I.5.2 ... )

Feature Representation (I.4.7 ... )

General (I.4.0 )

General (I.5.0 )

Would you recommend this review?

yes

Other reviews under "Feature Evaluation And Selection":	Date

Labeled point pattern matching by Delaunay triangulation and maximal cliques Ogawa H. Pattern Recognition 19(1): 35-40, 1986. Type: Article	Feb 1 1988

Features selection and ‘possibility theory’ Di Gesù V., Maccarone M. Pattern Recognition 19(1): 63-72, 1986. Type: Article	Dec 1 1987

An analytic-to-holistic approach for face recognition based on a single frontal view Lam K., Yan H. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(7): 673-686, 1998. Type: Article	Oct 1 1998

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy