Computing Reviews, the leading online review service for computing literature.

Search

A two-stage framework for 3D face reconstruction from RGBD images
Wang K., Wang X., Pan Z., Liu K. IEEE Transactions on Pattern Analysis and Machine Intelligence36 (8):1493-1504,2014.Type:Article

Date Reviewed: Apr 8 2015

The application of MS Kinect is popular since it provides the user with color images with depth in real time. Related research in computer graphics has been focused on getting structured information out of the raw data. Although this is the first application of face reconstruction using Kinect, the ideas behind it are not as new: divide and conquer, linear shape model, and global smoothing. As long as it is done right, this application definitely boosts the value of Kinect. As pointed out by the authors, the main challenges of recovering a smooth, high-vertex-count 3D mesh with facial structures are noise, corruption, low resolution, and large deformations. Instead of facing all of these difficulties at one time, it is wise to divide the input data into patches and carry out the best protocol locally before they are put together again. This is where active appearance model (AAM) face tracking comes into play. You may notice that the difficulty of modeling when there are large deformations is alleviated due to the low degree of deformation freedom that each local patch has. Local sparse coding introduces a linear model to reconstruct each patch, and because of this model, occlusion and noise are handled properly. Another interesting part of this work is that it not only recovers the facial geometry, but also builds the vertex correspondences between the result mesh and each set of training data. This makes final global smoothing possible by deforming a training mesh onto the stitched patches. The experiments show some improvement over existing methods, especially when input data is challenging. They also generated some data using traditional structured lights for quantitative analysis, which gives readers an idea of how far Kinect can go. Although the authors want to make further improvements to recover subtle facial details, this seems to be the end of the story until new hardware becomes available. Considering the processing time is over one minute for this method, improving its efficiency without sacrificing its good properties might be more useful.

Reviewer: Chang Liu	Review #: CR143320 (1507-0630)

Reconstruction (I.4.5 )

Feature Evaluation And Selection (I.5.2 ... )

Image Databases (H.2.8 ... )

Three-Dimensional Displays (I.3.1 ... )

Feature Measurement (I.4.7 )

Would you recommend this review?

yes

Other reviews under "Reconstruction":	Date

Linear quadtrees: a blocking technique for contour filling Gargantini I., Atkinson H. Pattern Recognition 17(3): 285-293, 1984. Type: Article	Jun 1 1985

Estimating the viewing parameters of random, noisy projections of asymmetric objects for tomographic reconstruction Lauren P., Nandhakumar N. IEEE Transactions on Pattern Analysis and Machine Intelligence 19(5): 417-430, 1997. Type: Article	Apr 1 1998

Principles of computerized tomographic imaging Kak A., Slaney M., Society for Industrial and Applied Mathematics, Philadelphia, PA, 2001. 327, Type: Book (9780898714944)	Apr 19 2002

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy