Cognitive Vision Model Syllabus
Introduction
This is a syllabus resource for Cognitive Computer Vision,
such as might be taught in a comprehensive course on
Cognitive Computer Vision.
Recognising that what might actually be taught is a subset of this material, we have tried to structure this as a resource, meaning that the given topics are recommended, but the choice of topics for any particular course is up to the lecturer. This is a different resource from the Cognitive Computer Vision Ontology which tries to lay out a view of the structure of Cognitive Computer Vision.
There are many technologies that could have been included, but we are proposing those that we thought had the greatest value for Cognitive Vision systems, and are likely to be the foundation for the summer school course and textbook. This is not a hierarchy, nor are the topics mutually exclusive.
We have tried to identify the central topics here and aimed at a typical full-year course with 54 lecture hours. We think that at a minimum, coverage of each of the five Cognitive Computer Vision subject areas should have an overview, one or more techniques and an example application.
We have tried to be mildly prescriptive about the order of topics, starting with the most important (in our estimation), but are not specifying
the method of presentation, nor the depth, all of which will depend on the presenter's preferences and the amount of available time.
Some good general references are:
- Forsyth and Ponce. Computer Vision: a modern approach. Prentice-hall, 2002.
- Duda, Hart and Stork. Pattern Classification (2nd Edition). Wiley Interscience, 2000.
With ECVision funding, we are still working at: (1) identifying a key citation and (2) collecting online resources for each topic.
Basic prerequisite background knowledge:
- pixels and image structure
- image capture process
- basic color and texture
- basic imaging and optical projection
- basic feature detection: points, edges, lines, regions
- basic image processing: histograms, thresholding,
mathematical morphology
- basic geometric shapes, their properties and their fitting/parameter
estimation from image data
- basic probability and statistics, including estimation and hypothesis testing
Intermediate prerequisite background knowledge:
- retina and V1 in human visual system, saccades
- eigenvectors and linear algebra
- clustering and grouping
- multiple image sets for stereo and sequence analysis
- feature tracking
-
The Syllabus
There are five components here, and we assume that some material will be taught from each. Each of five components has a minimal time associated with it and also a full time.
- Knowledge Representation (3-12 hours)
- Overview/Issues (1-2 hours)
- Style
- Image/Appearance-based
- Relational/Graphical
- Probabilistic
- Ontological
- Geometric/Object
- Logical/Rule-based/Syntactic
- Procedural/Embodied
- Issues
- Indexing
- Certainty
- Scale
- Multiple Representations
- Storage
- Knowledge Representation Technologies (2-5 hours)
- Receptive Fields/Gaussian Derivatives
- Graph Representations
- Bayesian Network Models
- Hidden Markov Models
- Eigenspace / Principal Component Representations
- Active/Deformable/Parametric Shape Models
- Frames/Rules/Demons
- Applications/Case Studies (1-5 hours)
- Activity/Behavior/Processes/Dynamics
- Classification/Category
- Context/Scene/Situations
- Function
- Objects/Parts
- Ontologies
- Parameters
- Task Control
- General Resources
- Recognition, Categorization and Estimation (3-14 hours)
- Overview/Issues (1-2 hours)
- What
- Category
- Parameters
- Position
- State
- Issues
- Accuracy
- Genericity
- Labeling/Detection/Localization
- Recognition Technologies (1-7 hours)
- Bayesian Classification
- Model Based Indexing, Invocation
- Decision Trees, Sequential Classifiers
- k-Nearest Neighbor
- Neural Network/Perceptron Methods
- KMAX
- Applications/Case Studies (1-5 hours)
- Activity/Behavior/Processes/Dynamics
- Classification/Category
- Context/Scene/Situations
- Function
- Objects/Parts
- Parameters
- General Resources
- Reasoning about Structures and Events (4-11 hours)
- Overview/Issues (1-2 hours)
- Content
- Objects & spatial structures and their organisation
- Appearance/Visibility
- Events & temporal structures and their organisation
- Tasks/Goals
- Issues
- Performance
- Prediction
- Planning
- Decision making
- Information fusion
- Self-analysis
- Uncertainty
- Reasoning Technologies (overview) (3-9 hours)
- Bayesian Inference
- Change and Moving Object Detection
- Temporal Event Analysis
- Perceptual Organization, Grouping / Figure-Ground Separation
- Performance Analysis for Vision
- Correspondence Matching
- Optimization
- Planning for sensing and other processes
- Occlusion Understanding and Recovery
- Decision Making
- Probabilistic
- Rule Based
- Soft Control
- Applications/Case Studies (1-5 hours)
- Activity/Behavior/Processes/Dynamics
- Classification/Category
- Context/Scene/Situations
- Function
- Objects/Parts
- General Resources
Model Learning (2-12 hours)
- Overview/Issues (1-2 hours)
- Types of Learning
- Supervised
- Case-based
- Process identification: ARMA,ANOVA,HMM
- Unsupervised
- Issues
- Feature Selection
- Validation
- Learning Control (Robustness, Speed, Presentations, ...)
- Learning Technologies (1-5 hours)
- Bayesian / Probabilistic Model Learning
- Process Identification
- Graphical Models
- EM
- k-Means
- Principal Component Approaches
- Support Vector Machines
- Structure/Rule Learning
- Applications/Case Studies (1-5 hours)
- Activity/Behaviors/Processes/Dynamics
- Classification/Categor
- Context/Scenes/Situations
- Function
- Objects/Parts
- Parameters
- Task Control
- General Resources
Visual Process Control (1-5 hours)
- Overview/Issues (1-2 hours)
- Issues
- Quality/Accuracy
- Goal Specification
- Multiple/Single Sensor
- Distribution of Control
- Speed of Response
- What is controlled
- Sensing
- Attention/Focus of processing
- Processing Resources
- Reasoning Directions
- Classes of Control for Vision Systems
- Continuous Process Systems
- Single Image Processes
- Video-rate Systems
- Process Control Technologies (1-3 hours)
- "Expert-System" Control, Knowledge-Based Systems
- Behavior-Based/Reactive Control
- Hierarchical Control
- Heterarchical/Mixed Control
- General Resources
Good example areas and Case Studies (See also VAP book)
- Static Image Understanding
- Aerial Image Understanding
- Scene Understanding
- Image Sequence Understanding
- Behavior Analysis
- Movement Analysis
- Walker Identification
- Gesture Analysis
- Abnormal behavior detection
- Expression Understanding