In this section, we present a fully automatic method for high resolution, non-rigid dense 3D point tracking [9]. Harmonic maps were used in [11] to do surface matching, albeit focusing on rigid transformations. Given the source manifold and the target manifold , only the boundary condition was used to constrain and uniquely determine the harmonic map . For applications like high resolution facial tracking though, we need to account for non-rigid deformations, with a high level of accuracy. To this end, we introduce additional feature correspondence constraints, in addition to the boundary constraint in our implementation of harmonic maps. We select a set of motion-representative feature corners (for example, for facial expression tracking, we select corners of eyes, lips, eye brows etc.) and establish inter-frame correspondences using commonly used techniques (for example, hierarchical matching used in [10]). We can then integrate these correspondence constraints with the boundary condition to calculate harmonic maps, which not only account for global rigid motion, but also subtle non-rigid deformations and hence achieve high accuracy registration and tracking.
The algorithm is illustrated in Figure 1 by considering the example of a synthetic surface undergoing non-rigid deformation.
(a)
(b)
(c)
(d)
(e)
|
High quality dense point clouds of facial geometry moving at video speeds are acquired using a phase-shifting based structured light ranging technique [6]. To use such data for temporal study of the subtle dynamics in expressions, an efficient non-rigid 3D motion tracking algorithm is needed to establish inter-frame correspondences. Because our dynamic range sequences are acquired at a high frame rate (30 Hz), we can assume that the local deformation between two adjacent frames is small. To register two frames, we align their respective harmonic maps as closely as possible by imposing the suitable boundary and feature constraints. The motivation to do so is to establish a common parametric domain for the two surfaces, which, coupled with the above mentioned property, allows to recover 3D registration between the two frames.In our case, the harmonic maps are diffeomorphisms, that is one to one and on-to, and hence lend themselves as a natural choice for surface parameterization in tracking applications.
The outline of the non-rigid tracking algorithm is given as follows:
The accuracy of the proposed tracking algorithm is demonstrated through experiments on real data. We performed tracking on four subjects performing various expressions for a total of twelve sequences of 250-300 frames each (at 30Hz). Each frame contains approximately 80K 3D points, whereas the generic face mesh contains 8K nodes. The tracking results are available as video clips at http://www.cs.sunysb.edu/~ial/expressionModeling.html, including opening and closing of the mouth (female subject) or strongly asymmetric smile (male subject). Our technique tracks very accurately even in the case of topology change and severe `folding' of the data. (See Figure 2)
(a)
(b)
(c)
(d)
(e)
|