The most commonly accepted hypothesis states that a 3D-point M is
projected with a perspective projection onto an image plane on a 2D-point
.
Choosing a reference frame attached to the camera, the
projection equation is:
This model can be refined, by taking optical distortions into account [18,4,7]. In this paper, we will consider that the needed corrections have been done as a preprocessing.
Two approximations have been proposed in the literature :
The perspective projection model may be approximated
[2,15,12,13,14] to
its first order with respect to the 3D coordinates. This is equivalent to
approximate the perspective projection in two steps: (i) a projection parallel to
the gaze direction onto an auxiliary plane
which is parallel to the image plane
and passes through the scene center
followed
by (ii) a perspective projection onto the image plane. This so called
para-perspective model yields linear equations (2).
Equation 2 corresponds to the most general case of para-perspective projection although more simple expressions have been proposed [16].
The zero-order development with respect to the 3D depth consists in a rougher
approximation. It is also equivalent to another two steps approximation: (i)
an orthogonal projection onto the auxiliary plane
followed by
(ii) a perspective projection onto the image plane. This approximation, called
the orthographic model (4), is well adapted to foveal attention
and is characterized by linear equations without any new parameter. It is an
approximation of the para-perspective model when the observed objects are in
the fovea, i.e. close to the optical axis :
projection case |
![]() |
![]() |
perspective projection | 1 | 1 |
orthographic projection | 0 | 0 |
para-perspective projection | 1 | 0 |
Let I1 and I2 denote two images. In the general case, there exists a
fundamental relation between points
in I2 and points
in I1 :
where
is called the fundamental matrix [9].
However, this relation is not defined in some singular cases. For example, it is well
known that, in the perspective projection case, if the displacement is a pure
rotation or, if the scene is planar, the relation between points is
homographic :
where
is called the
homographic matrix.
Thus
.
Remember that
and
Let
and
.
Previous equations lead to :
This relation is homographic if and only if
or if there exists a
(3
3) matrix
such as
.
The first condition induces a displacement constraint. It leads to the simple
equation
meaning that the rotation axis is parallel
to the gaze direction. The second condition induces a geometric relation on the 3D
point : Z1 is an affine function of X1 and Y1, meaning that the
3D points must belong to a plane
,
which cannot contain the
optical axis and the gaze direction (see [13] for a demonstration).
The orthographic case is a particular case of para-perspective projection for which the gaze direction is the optical axis. Following a demonstration similar to the para-perspective case, we also obtain two constraints; the displacement constraint states that the rotation axis must be parallel to the optical axis, and the geometric constraint states that the 3D-points must belong to the same plane which does not contain the optical axis. All constraints on displacement and scene geometry for homographic relations are summarized in the following table :
projection | displacement constraint | geometric constraint |
perspective |
![]() |
plane |
para-perspective |
![]() |
plane Z=f(X,Y) |
orthographic |
![]() |
plane Z=f(X,Y) |