Computing Reviews, the leading online review service for computing literature.

Search

Learning semantic representations of objects and their parts
Mesnil G., Bordes A., Weston J., Chechik G., Bengio Y. Machine Learning94 (2):281-301,2014.Type:Article

Date Reviewed: Jul 3 2014

Digital images are everywhere. To make those images searchable, we laboriously annotate them with a list of contents. Ideally, we want a program that does the annotation for us, but this is still an unsolved problem in image analysis. This paper considers a slightly different problem of extracting parts-owner relationships from an image. Their algorithm can be used to annotate an image and its subregions simultaneously as an object (for example, a car) and its parts (for example, a wheel, headlights, windshield, and so on). The parts-owner relationships provide the semantics of the image, thus the work may be a step toward an automated image understanding system. There are two databases, WordNet and ImageNet, that provide a large number of parts-owner relationships in words and images, respectively. We can use them to train the system. Incorporating the parts-owner relationships may improve the annotation accuracy, since the presence of an object suggests the presence of its parts, and vice versa. These are the motivations behind the work. Training such a system is tricky. Since state-of-the-art object recognition is still not good enough, the algorithm bypasses the visual association of objects and parts altogether. Instead, it learns associations between images and labels provided by ImageNet, as well as parts-owner relationships in labels provided by WordNet. This approach appears clever, but highlights a general issue in artificial intelligence: we often settle for a compromise due to the lack of a reliable artificial vision system. This paper may not be an easy read as it uses algorithms from other works with little explanation. There are some careless errors in the references and notations. Nevertheless, it is entertaining.

Reviewer: T. Kubota	Review #: CR142470 (1410-0887)

Learning (I.2.6 )

Object Recognition (I.4.8 ... )

Image Representation (I.4.10 )

Information Search And Retrieval (H.3.3 )

Would you recommend this review?

yes

Other reviews under "Learning":	Date

Learning in parallel networks: simulating learning in a probabilistic system Hinton G. (ed) BYTE 10(4): 265-273, 1985. Type: Article	Nov 1 1985

Macro-operators: a weak method for learning Korf R. Artificial Intelligence 26(1): 35-77, 1985. Type: Article	Feb 1 1986

Inferring (mal) rules from pupils’ protocols Sleeman D. Progress in artificial intelligence (, Orsay, France,391985. Type: Proceedings	Dec 1 1985

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy