Next: Conclusions
Up: Experiments on Wavelet Networks
Previous: Face Recognition independent of
In this section we present the results of two face-pose estimation
experiments that were carried out to verify the ``optimal filter
bank'' principle of the WNs.
For both experiments we connected a doll's head to a robot arm and let
the robot move the doll's head in front of a fixed camera. With this
the correct pose was always known.
In the first experiment we tracked the doll's head with a color blob
tracker and distributed sets of 4
complex Gabor filters with the different orientations of 0,
,
and
over the tracked
inner face region. The resulting complex projections of these
filters were then fed into an artificial neural LLM network (ANN)
[Bruske and Sommer, 1995,Ritter et al.,
1991].
This was done so for training as well for testing.
A precise description of this experiment can be found in
[Bruske et al.,
1998].
The mean pan/tilt error that we reached was
,
computed as
.
It is reasonable to assume that the choice of better Gabor
filters would result an even lower mean pan/tilt error. In our
second experiment,
we therefore optimized a template WN for the doll's
face with wavelets
(see fig. 10). As mother wavelet, the odd Gabor
function was used, fig. 10 shows the optimized WN.
As in the first experiment, the doll's head was
connected to a robot arm, so that the pan/tilt ground truth was
known. During the training of the ANN and testing, the doll's head was first
tracked using our face tracking method of Section 3.1
and then the optimal wavelet coefficient vectors were
computed. Fig. 11 shows example images of the
tracked doll's head.
Figure 11:
The images show different orientations of the doll's
head. The head is connected to a robot arm so that the ground
truth is known. The white square indicates the detected position,
scale and orientation of the WN.
|
The optimal coeff. vectors were then fed into the ANN. The
employed ANN was of the same type in both experiments.
In this second experiment the dimensionality
of the feature vectors was smaller: instead of the 128 complex values
coefficients of the first experiment, we used in the second experiment only
52 real valued coefficients.
We used 400 training images in both experiments.
With this, we reached a mean pan/tilt error of
with a processing speed of
fps on a 450 MHz Linux
Pentium. The experiments have been repeated several times, and the
variations of the estimated mean pan/tilt error over several
experiments were small (
for the WN experiment).
A more detailed description of our experimental results can be found
in [Krüger et al.,
2000].
Next: Conclusions
Up: Experiments on Wavelet Networks
Previous: Face Recognition independent of
Volker Krueger
2001-05-31