Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Best of 2016 Recommended by Editor Recommended by Reviewer Recommended by Reader
Search
Globally optimal joint image segmentation and shape matching based on Wasserstein modes
Schmitzer B., Schnörr C. Journal of Mathematical Imaging and Vision52 (3):436-458,2015.Type:Article
Date Reviewed: Aug 19 2015

At times, the task of understanding the contents of an image has used two sequential steps of segmentation and recognition, without interplay between the two. A better, though more complex, approach is to determine object boundaries as part of the process of recognizing those objects, that is, “joint image segmentation and shape matching.” A priori information about the shapes of expected objects is generally used, but these priors can be either overly simplistic or demanding to apply.

Schmitzer and Schnörr describe an approach that achieves a higher level of generality yet can be applied in computationally reasonable ways. The essence is to define a function that can be simultaneously optimized across different poses and variations in segmentation. The set of possible “transports” defines a form of a 2-Wasserstein space, within which poses and segmentations form a manifold. This allows for simultaneous optimization, though there is some low-level approximation (in terms of rotation and scale, for example). Practical use of the technique requires the application of several computational methods, from contour learning to the formulation and solution of a min-cut graph problem to find the optimal transport. Each is discussed in detail and external works are cited.

One interesting section discusses the authors’ strategy for dealing with the following problem: the natural technique considers all variation from the prior as having equal weight, but some object deformations are more likely than others. To deal with this, they describe a family of possible object shapes. This family is subject to statistical analysis to find the principal components of the variation, in the contour representation, which can be related directly to the optimization problem.

The technique is powerful, yet mathematically complex. This paper is not simple to read and digest only because of the mathematical fluency required to understand and to follow the development of the technique. It is very well written; two obviously talented authors have done a great job of making the imposing mathematics involved as clear as possible. Still, not all readers will be able to follow each step, particularly at the first reading. Importantly, the references are comprehensive and especially serve to provide background reading on the mathematics involved. This is a good approach for the solution of a difficult, common, and fundamental problem: finding objects in unsegmented images. A complex solution to a difficult problem is not surprising; the investment of time to understand the mathematics involved is easily justified. This paper provides a thorough and clear description of such a solution, and is recommended to anyone interested in image segmentation and understanding.

Reviewer:  Creed Jones Review #: CR143705 (1512-1077)
Bookmark and Share
  Editor Recommended
 
 
Segmentation (I.4.6 )
 
Would you recommend this review?
yes
no
Other reviews under "Segmentation": Date
Knowledge-based interpretation of outdoor natural color scenes
Ohta Y., Pitman Publishing, Inc., Marshfield, MA, 1985. Type: Book (9789780273086734)
Jun 1 1986
Image segmentation and uncertainty
Wilson R., Spann M., Research Studies Press Ltd., Taunton, UK, 1988. Type: Book (9780863800672)
Feb 1 1990
Surfaces in range image understanding
Besl P., Springer-Verlag New York, Inc., New York, NY, 1988. Type: Book (9789780387967738)
Dec 1 1989
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy