Computing Reviews, the leading online review service for computing literature.

Search

Putting objects in perspective
Hoiem D., Efros A., Hebert M. International Journal of Computer Vision80 (1):3-15,2008.Type:Article

Date Reviewed: May 27 2009

Human viewers use context. A blurry box on the road, a location where cars usually are, is probably a car; a blurry, vertical human-sized image near a car is probably a human. The existing automatic image recognition systems already use elements of context-based reasoning. However, these systems are mostly limited to situations where one object is already recognized, and this recognized object is then used to gauge the size and function of nearby objects. In human understanding, the process is more complex; for example, sometimes we recognize an object as a car because a human stands near it, and sometimes we recognize an object as a human because he or she stands near a car. The paper begins with a convincing example: two objects that are not individually recognizable, when placed together, are easily recognizable. To automatically understand such images, the authors propose a new integrated technique that simultaneously looks for both three-dimensional (3D) coordinates of the points and the objects. They use a Bayesian approach, with the prior probability obtained by a statistical analysis of actual images. They make an additional assumption that all objects are at ground level. While this assumption may sound too restrictive, the usual human understanding has the same limitation--we easily notice a person at street level, but finding a person up in a tree is a more difficult visual puzzle. The resulting integrated approach works surprisingly well.

Reviewer: V. Kreinovich	Review #: CR136880 (1001-0086)

Reconstruction (I.4.5 )

Camera Calibration (I.4.1 ... )

Object Recognition (I.4.8 ... )

Digitization and Image Capture (I.4.1 )

Scene Analysis (I.4.8 )

Would you recommend this review?

yes

Other reviews under "Reconstruction":	Date

Linear quadtrees: a blocking technique for contour filling Gargantini I., Atkinson H. Pattern Recognition 17(3): 285-293, 1984. Type: Article	Jun 1 1985

Estimating the viewing parameters of random, noisy projections of asymmetric objects for tomographic reconstruction Lauren P., Nandhakumar N. IEEE Transactions on Pattern Analysis and Machine Intelligence 19(5): 417-430, 1997. Type: Article	Apr 1 1998

Principles of computerized tomographic imaging Kak A., Slaney M., Society for Industrial and Applied Mathematics, Philadelphia, PA, 2001. 327, Type: Book (9780898714944)	Apr 19 2002

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy