This section recalls briefly the mathematical background on perspective projections necessary for our purposes. For more details see [4,15].
A pinhole camera is modeled by its optical center C and
its retinal plane (or image plane) .
A 3-D point
W is projected into an image point M given by the
intersection of
with the line containing C and W.
The line containing C and orthogonal to
is called the
optical axis and its intersection with
is the
principal point. The distance between C and
is
the focal length.
Let
be the coordinates of W in the
world reference frame (fixed arbitrarily) and
the coordinates of M in the image plane (pixels). The mapping
from 3-D coordinates to 2-D coordinates is the perspective
projection, which is represented by a linear transformation in
homogeneous coordinates. Let
and
be the homogeneous
coordinates of M and W respectively; then the perspective
transformation is given by the matrix
:
![]() |
(3) |
The camera position and orientation (extrinsic parameters), are
encoded by the
rotation matrix
and the
translation vector
,
representing the rigid transformation
that brings the camera reference frame onto the world reference frame.
Let us write the PPM as
![]() |
(4) |
In Cartesian coordinates, the projection (1) writes
The focal plane is the plane parallel to the retinal plane that
contains the optical center C. The coordinates
of
C are given by
The optical ray associated to an image point M is the line
M C, i.e. the set of 3-D points
.
Its parametric equation in
Cartesian coordinates writes:
Let us consider a stereo rig composed by two pinhole cameras
(Fig. 1). Let
and
be the optical
centers of the left and right cameras respectively. A 3-D point
is projected onto both image planes, to points
and
,
which constitute a conjugate pair. Given a point
in the left image plane, its conjugate point in the right image
is constrained to lie on a line called the epipolar line (of
). Since
may be the projection of an arbitrary
point on its optical ray, the epipolar line is the projection through
of the optical ray of
.
All the epipolar lines
in one image plane pass through a common point (
and
respectively) called the epipole, which is the
projection of the optical center of the other camera.
When
is in the focal plane of the right camera, the right
epipole is at infinity, and the epipolar lines form a bundle of
parallel lines in the right image. A very special case is when both
epipoles are at infinity, that happens when the line
(the baseline) is contained in both focal planes, i.e., the
retinal planes are parallel to the baseline. Epipolar lines, then,
form a bundle of parallel lines in both images. Any pair of images
can be transformed so that epipolar lines are parallel and horizontal
in each image. This procedure is called rectification.