Computer Vision
CS 554
Spring 2004
Department of Computer Engineering, Bilkent University
- Instructor: Pinar Duygulu
- Office : EA 433
- e-mail : duygulu[at]cs.bilkent.edu.tr
- Phone : (312) 290 31 43
- Office hours: Tuesdays 16:40-17:30 and by
appointment..
- Textbook: Computer Vision - A
modern Aproach
by David A. Forsyth & Jean Ponce, Prentice Hall, Ed. 1,
2002.
- Related Material : http://www.cs.bilkent.edu.tr/~duygulu/CVlinks.html
- Course web page: http://www.cs.bilkent.edu.tr/~duygulu/Courses/CS554/Spring2004
- Time & Location: Tuesdays 13:40-16:30 and
Fridays 9:40-12:30, EA 502
- Course Description:
Basic concepts in computational vision. Relation to human visual
perception. The analysis and understanding of image and video data.
Mathematical foundations, image formation and representation,
segmentation, feature extraction, contour and region analysis, camera
geometry and calibration, stereo, motion, 3-D reconstruction, object
and scene recognition, object and people tracking, human activity
recognition and inference.
- Prerequisites:Knowledge
of linear algebra and calculus, probability and statistics
- Grading:
- Homeworks (Implementation or paper review) - %35
- Paper presentations - %15
- Term project (Demonstration + Report + Presentation) : %50
- Assignments
- Due: April 9
- Write a summary of Canny's edge detection paper which will
be distributed in the class on Tuesday (April 6)
- Due: April 13
- Write a matlab program that produces a Gaussian pyramid
from an image (image1.jpg)
- Write a summary of David Lowe's paper on distinctive image
features
- Due : April 30
- Programming Assignment
- Find salient points in the image and compute features aroun
them to detect objects in cluttered backgroud or with occlusion and for
matching images (images)
- Write a summary of either Tomasi and Kanade or Lucas Kanade
paper
- Due : May 7
- Write a summary of Normalized Cuts paper
- shorter version of the paper is available now
- Due : May 20
- Programming
assignment
- Retrieve images from a database for a
query image and give rankings
- Compare images using
- global color histogram extracted from the
whole image, and
- RGB mean and standard deviation values
from the regions.
- Due : May 28, 13:00
- Project poster presentation
- Syllabus
- [April 2] Introduction
- Readings:
- Matlab primer (.pdf) by
Kermit Sigmon, University of Florida
- Basic math (.pdf),
figures (.pdf) by Martial
Hebert, CMU
- Links
- [April 6] Linear filters
- Topics
- Linear filters, convolution, smoothing,
derivatives, Fourier transform, sampling and aliazing, gaussian
pyramids (slides)
- Readings
- Computer
vision for interactive computer graphics,W. T. Freeman, D.
Anderson,
P. Beardsley, C. Dodge,
H. Kage, K. Kyuma,
Y. Miyake,
M. Roth,
K. Tanaka,
C. Weissman,
W. Yerazunis, in IEEE Computer Graphics and
Applications, volume 18, number 3, May--June, pp. 42-53, 1998.
- Links
- [April 9] Edge detection
- Derivatives,
Edge detection, Hough Transform (slides)
- A
Computational Approach to Edge
Detection, J. Canny, IEEE Transactions on
Pattern Analysis and Machine
Intelligence, Vol 8, No. 6, Nov 1986.
- [April 13] Texture
- Topics
- Texture analysis and
synthesis (slides)
- Presentation - by Ozcan Oksuz
- [April 16] Interest Points
- Topics
- Harris Detector, Local invariant points,
SIFT (slides)
- Readings
- A
combined corner and edge detector, Chris Harris and Mike Stephans,
Proceedings of The Fourth Alvey Vision Conference, Manchester, pp
147-151. 1988
- Local
Greyvalue Invariants for Image Retrieval,
C. Schmid and R. Mohr. In Pattern Analysis and Machine Intelligence,
1997.
- Indexing
based on scale invariant interest points. K.Mikolajczyk and
C.Schmid. In International
Conference on Computer Vision, 525-531, 2001
- Distinctive
Image Features from
Scale-Invariant Keypoints, David Lowe, International Journal of
Computer Vision, 2004.
- [April 20] Color
- Readings
- Chapters 4 and 6
- Colour in nature, book chapter
- Presentation
- Guest Lecture
- [April 23] no class
- [April 27] Cameras
- Topics
- Image formation, Camera geometry, Camera
calibration (slides)
- Readings
- Presentation
- by Tayfun Kucukyilmaz Face
recognition using eigenfaces, M. Turk and A. Pentland, Proc. IEEE
Conference on Computer Vision and Pattern Recognition, Maui, Hawaii,
1991. (slides)
- by Nazli Ikizler Name-It:
Naming and Detecting Faces in News Videos, Satoh, S., Nakamura,
Y., Kanade, T., IEEE Multimedia, 6(1), January-March, 1999 (slides)
- [April 30] Multi-view geometry &
Stereo
- Topics
- Stereopsis, Epipolar geometry, Matching,
Reconstruction (slides)
- [May 4] Motion
- Topics
- Optical flow, structure from motion,
Tracking (slides)
- [May 7] Segmentation
- Topics
- Segmentation and Grouping, Line fitting (slides)
- Readings
- Normalized
cuts and image
segmentation, Jianbo Shi and Jitendra Malik, IEEE Transactions on
Pattern Analysis and Machine Intelligence(PAMI), 2000
- a shorter
version published in IEEE Conf. Computer Vision and Pattern
Recognition(CVPR), June 1997, Puerto Rico
- Laws
of Organization in Perceptual Forms, Max Wertheimer, first
published as
Untersuchungen zur Lehre von der Gestalt II, in Psycologische
Forschung, 4, 301-350. Translation published in Ellis, W. (1938). A
source book of Gestalt psychology (pp. 71-88). London: Routledge &
Kegan Paul.
- Learning
to Detect Natural Image Boundaries Using Local Brightness, Color, and
Texture Cues, D. Martin, C. Fowlkes, and J. Malik. to
appear in PAMI
- Presentation
- by Behcet Ugur Toreyin
Mean Shift: A Robust Approach Toward Feature Space Analysis,
Dorin Comaniciu and Peter Meer, IEEE Transactions on Pattern Analysis
and Machine Intelligence, Vol. 24, no.5 , May 2002. (slides)
- by Ayisigi Basak Sevdik, Body
Plans, Forsyth, D.A. and Fleck, M.M., Proc. in Proceedings of
Computer Vision and Pattern Recognition (CVPR), 1997. (slides)
- [May 11] Recognition
- Topics
- Model based and template matching based
methods for recognition (slides)
- Presentation
- by Rabia Nuray Finding
and tracking people from the bottom up, Deva Ramanan and David
Forsyth, in Proceedings of Computer
Vision and Pattern Recognition (CVPR), Madison, Wisconsin, June
2003. (slides)
- by Tayfun Aytac Object
Class Recognition by Unsupervised Scale-Invariant Learning, Rob
Fergus, Pietro Perona, and Andrew Zisserman, Conference on
Computer Vision and Pattern Recognition, (2003). (slides)
- [May 14] Applications - Image and Video databases
- Topics
- Retrieval, browsing and other novel
applications on large datasests (slides)
- Links
- Presentation
- by Erhan Okuyan, Faces
and Names in the News Tamara Miller, Alexander C. Berg, Jaety
Edwards, Michael Maire,
Ryan White, Yee Whye Teh, Eric Learned-Miller, David A. Forsyth, To
Appear CVPR 2004 (slides)
- [May 18]