Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Learning semantic representations of objects and their parts
Mesnil G., Bordes A., Weston J., Chechik G., Bengio Y. Machine Learning94 (2):281-301,2014.Type:Article
Date Reviewed: Jul 3 2014

Digital images are everywhere. To make those images searchable, we laboriously annotate them with a list of contents. Ideally, we want a program that does the annotation for us, but this is still an unsolved problem in image analysis. This paper considers a slightly different problem of extracting parts-owner relationships from an image. Their algorithm can be used to annotate an image and its subregions simultaneously as an object (for example, a car) and its parts (for example, a wheel, headlights, windshield, and so on). The parts-owner relationships provide the semantics of the image, thus the work may be a step toward an automated image understanding system.

There are two databases, WordNet and ImageNet, that provide a large number of parts-owner relationships in words and images, respectively. We can use them to train the system. Incorporating the parts-owner relationships may improve the annotation accuracy, since the presence of an object suggests the presence of its parts, and vice versa. These are the motivations behind the work.

Training such a system is tricky. Since state-of-the-art object recognition is still not good enough, the algorithm bypasses the visual association of objects and parts altogether. Instead, it learns associations between images and labels provided by ImageNet, as well as parts-owner relationships in labels provided by WordNet. This approach appears clever, but highlights a general issue in artificial intelligence: we often settle for a compromise due to the lack of a reliable artificial vision system.

This paper may not be an easy read as it uses algorithms from other works with little explanation. There are some careless errors in the references and notations. Nevertheless, it is entertaining.

Reviewer:  T. Kubota Review #: CR142470 (1410-0887)
Bookmark and Share
  Featured Reviewer  
 
Learning (I.2.6 )
 
 
Object Recognition (I.4.8 ... )
 
 
Image Representation (I.4.10 )
 
 
Information Search And Retrieval (H.3.3 )
 
Would you recommend this review?
yes
no
Other reviews under "Learning": Date
Learning in parallel networks: simulating learning in a probabilistic system
Hinton G. (ed) BYTE 10(4): 265-273, 1985. Type: Article
Nov 1 1985
Macro-operators: a weak method for learning
Korf R. Artificial Intelligence 26(1): 35-77, 1985. Type: Article
Feb 1 1986
Inferring (mal) rules from pupils’ protocols
Sleeman D.  Progress in artificial intelligence (, Orsay, France,391985. Type: Proceedings
Dec 1 1985
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy