Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
A two-stage framework for 3D face reconstruction from RGBD images
Wang K., Wang X., Pan Z., Liu K. IEEE Transactions on Pattern Analysis and Machine Intelligence36 (8):1493-1504,2014.Type:Article
Date Reviewed: Apr 8 2015

The application of MS Kinect is popular since it provides the user with color images with depth in real time. Related research in computer graphics has been focused on getting structured information out of the raw data. Although this is the first application of face reconstruction using Kinect, the ideas behind it are not as new: divide and conquer, linear shape model, and global smoothing. As long as it is done right, this application definitely boosts the value of Kinect.

As pointed out by the authors, the main challenges of recovering a smooth, high-vertex-count 3D mesh with facial structures are noise, corruption, low resolution, and large deformations. Instead of facing all of these difficulties at one time, it is wise to divide the input data into patches and carry out the best protocol locally before they are put together again. This is where active appearance model (AAM) face tracking comes into play. You may notice that the difficulty of modeling when there are large deformations is alleviated due to the low degree of deformation freedom that each local patch has. Local sparse coding introduces a linear model to reconstruct each patch, and because of this model, occlusion and noise are handled properly. Another interesting part of this work is that it not only recovers the facial geometry, but also builds the vertex correspondences between the result mesh and each set of training data. This makes final global smoothing possible by deforming a training mesh onto the stitched patches.

The experiments show some improvement over existing methods, especially when input data is challenging. They also generated some data using traditional structured lights for quantitative analysis, which gives readers an idea of how far Kinect can go.

Although the authors want to make further improvements to recover subtle facial details, this seems to be the end of the story until new hardware becomes available. Considering the processing time is over one minute for this method, improving its efficiency without sacrificing its good properties might be more useful.

Reviewer:  Chang Liu Review #: CR143320 (1507-0630)
Bookmark and Share
  Featured Reviewer  
 
Reconstruction (I.4.5 )
 
 
Feature Evaluation And Selection (I.5.2 ... )
 
 
Image Databases (H.2.8 ... )
 
 
Three-Dimensional Displays (I.3.1 ... )
 
 
Feature Measurement (I.4.7 )
 
Would you recommend this review?
yes
no
Other reviews under "Reconstruction": Date
Linear quadtrees: a blocking technique for contour filling
Gargantini I., Atkinson H. Pattern Recognition 17(3): 285-293, 1984. Type: Article
Jun 1 1985
Estimating the viewing parameters of random, noisy projections of asymmetric objects for tomographic reconstruction
Lauren P., Nandhakumar N. IEEE Transactions on Pattern Analysis and Machine Intelligence 19(5): 417-430, 1997. Type: Article
Apr 1 1998
Principles of computerized tomographic imaging
Kak A., Slaney M., Society for Industrial and Applied Mathematics, Philadelphia, PA, 2001.  327, Type: Book (9780898714944)
Apr 19 2002
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy