Logo EPFL
I&C
 Ecole Polytechnique Fédérale de Lausanne
     Research
 English only       EPFL > I&C > CVLAB > Research > Body > Faces
 RESEARCH
 Research Areas
Ph.D. Theses
 CVLAB CONTENTS
 People
Research
Publications
Teaching
Student projects
Software
Data
Jobs
Intranet
 QUICK LINKS
 EPFL Infoscience
I&C Doctoral School

 

3-D FACIAL RECONSTRUCTION FROM UNCALIBRATED IMAGE SEQUENCES

In recent years, the movie industry has produced such realistic 3--D face models from images that we have come to take them for granted. However, a quick look at the credits  at the end of  a movie such as  ``The Matrix Reloaded''  and at the budgets that are  involved, should alert the careful scientist  to the fact that this   is   a   misperception.    The extraordinary quality of the models shown in that movie required the use of a studio with five calibrated high resolution cameras, carefully  controlled  lighting,  and  an   untold  number  of  hours  of  work. Furthermore the  3--D shapes are often obtained  not directly from the  images but by laser-scanning a plaster cast of the actors' faces.

We therefore address the structure-from-motion problem in the context of head modeling from video sequences for which calibration data is not available. This task is made challenging by the fact that correspondences are difficult to establish due to lack of texture and that a quasi-euclidean representation is required for realism. We have developed two distinct approaches based on bundle-adjustment:

  • Regularized bundle-adjustment. It takes advantage of our rough knowledge of the head's shape, in the form of a generic face model. It allows us to recover relative head-motion and epipolar geometry accurately and consistently enough to exploit a previously-developed stereo-based approach to head modeling. In this way, complete and realistic head models can be acquired with a cheap and entirely passive sensor, such as an ordinary video camera with minimal manual intervention.

  • Using PCA face models.  A  few years  ago, Blanz  &  Vetter  proposed an  impressive  appearance-based   approach  that  addresses  this   issue  using  a sophisticated statistical head model.   It includes shape and texture components that  have  been learned  from  a  large database  of  human  heads.  It  allows reconstruction  from a single  image and  uses the  Phong illumination  model to handle illumination effects, but the shape and texture recovery may be perturbed by  large  cast  shadows  or specularities.   In  our  own, work we  proposed a technique that  reduces the sensitivity  to illumination by replacing the  texture component  of the model  by information provided  by 2--D point correspondences  in all  pairs of consecutive  images. This  helps because such correspondences  tend to be  affected comparatively little  by illumination changes given proper normalization. Furthermore, this approach has the potential for increased automation  by eliminating the need for  3--D feature points whose projections are known.

Results

Face reconstruction example
Top row: 6 images of a short sequence. Bottom row: Corresponding reconstruction



Publications

M. Dimitrijevic, S. Ilic, and P. Fua. Accurate Face Models from Uncalibrated and Ill-Lit Video Sequences. In Conference on Computer Vision and Pattern Recognition, Washington, DC, June 2004.

P. Fua. Regularized Bundle-Adjustment to Model Heads from Image Sequences without Calibration Data , International Journal of Computer Vision, 38(2):153--171, 2000.


 



Comments/Feedback to webmaster.cvlab { at } epfl.ch
Last update : 15 September 2008 11:32:46