Next: Summary of Outstanding Problems
Up: From Surface To Objects:
Previous: Discussion
This book has described an Artificial Intelligence approach
to the problem of three dimensional object recognition, based on
methods that lead to general purpose vision systems
rather than to limited single application systems.
While efficiency is ultimately important, competence must come first.
Only a few researchers have used
sketch-like
surface data, and the work described here has attempted to explore
the whole path from surfaces to objects.
The structure of the approach mirrors classical edge-based
recognition processes, but the use of surface data required new definitions
of the processes and their interconnections.
Some of the interesting aspects of the individual recognition processes were:
- object modeling
- a surface modeling method based on distinct curvature
class patches.
- criteria for how to group model SURFACEs into ASSEMBLYs.
- surface data
- proposed criteria for segmentation of surface image data into
surface patches useful for object recognition.
- surface hypothesizing
- analysis of surface occlusion cases to show what cases occur, how to
detect them and how to hypothetically reconstruct the missing data.
Because the research used three dimensional surface image data, the reconstruction
is more robust than that based on only two dimensional image data.
- surface cluster formation
- the use of the surface cluster as an intermediate representation
between the surface image and the object hypotheses.
- rules for aggregating the surface patches into the surface clusters
corresponding to distinct objects.
- description
- a collection of data description modules that exploited
the three dimensional character of the raw data.
- model invocation
- a network formulation that incorporated both image property evidence
and relationship evidence from class and structural associations.
The formulation was incremental, used operations that were based on
general reasoning rather than strictly visual requirements and
supported a low-level, object independent generic vocabulary.
- hypothesis construction
- new methods for estimating the three dimensional placement of objects from
data associated with surface patches and the intersurface
relationships specified by the object model.
- methods for predicting and verifying the visibility of SURFACEs,
including back-facing, tangential and partially or
fully self-obscured front-facing structure.
- rules for explaining missing structure as instances of occlusion
by external, unrelated structure.
- methods for joining non-rigidly connected structures and simultaneously
estimating the connection degrees-of-freedom.
- methods for completely instantiating hypotheses for both
solid and laminar structures.
- verification
- criteria for verifying the physical existence of a hypothesis.
- criteria for verifying the identity of an object based on
surface evidence.
The research also emphasized the strong distinction between, but equally
strong dependence on, the suggestive "seeing" of model invocation
and the model-directed hypothesis construction and verification.
Finally, the effect of occlusion was considered throughout the visual
process, and methods were developed that helped overcome data loss
at each stage.
When applying the recognition processes described in this book to the
(hand) segmented range image shown in Figure 3.10,
the IMAGINE I system correctly recognized all modeled objects in the
test scene (the robot and trash can, and their subcomponents).
No misidentifications of solids occurred, though several individual surfaces
were misidentified.
Since the model base included 16 ASSEMBLYs, all of about the same size, including
several objects not in this scene (a chair and its subcomponents), this was
a good result.
The original geometric reasoning module gave decent, but not ideal position
and joint angle estimation (seen in Figure 9.16)
and a new network-based geometric reasoning method
improved on this (see Figure 1.11).
Three test scenes were analyzed.
The one used in this book was the most interesting, because it contained articulated objects with some curved surfaces,
laminar surfaced objects, partially constrained degrees-of-freedom (joints)
and considerable self-occlusion.
The success of the recognition process was largely because of the richness
of the
sketch data, but the use of the surface-based hierarchical models,
shape segmented surface patches and hierarchical surface cluster data
representation made the results easier to obtain.
This recognition process was clearly successful on the test image.
However, much research is still needed, and the following section
tries to make explicit some of the problems that remain.
I welcome others to add to the list and to solve them all.
Subsections
Next: Summary of Outstanding Problems
Up: From Surface To Objects:
Previous: Discussion
Bob Fisher
2004-02-26