Personnel
Instructor: | Selim Aksoy |
---|---|
Office: | EA 423 |
Email: | |
Office Hours: | Thu 11:40-12:30 |
Course Information
Schedule: | Tue 8:40-10:30, Thu 10:40-11:30 (EB 204) |
---|---|
Mailing List: | http://retina.cs.bilkent.edu.tr/mailman/listinfo/cs551-spring2005 |
Prerequisites: | Probability theory, statistics, linear algebra |
Texts: |
|
Lecture Schedule
Chapters |
Contents |
---|---|
Introduction to Pattern Recognition[ Slides ] (Feb 3) |
Topics:
Readings:
References:
|
Bayesian Decision Theory[ Slides ] (Feb 8, 10, 15, 17) |
Topics:
Readings:
|
Parametric Models[ Slides: Part 1 | Part 2 | Part 3 | Part 4 ] (Feb 22, 24, Mar 1, 3, 8, 10) |
Topics:
Readings:
References:
|
Non-parametric Methods[ Slides ] (Mar 15, 17) |
Topics:
Readings:
|
Feature Reduction and Selection[ Slides ] (Mar 22, 24) |
Topics:
Readings:
References:
|
Non-Bayesian Classifiers[ Slides: Part 1 | Part 2 | Part 3 ] (Mar 29, 31, Apr 5, 7) |
Topics:
Readings:
References:
|
Spring Break(Apr 11-15) |
No class |
Unsupervised Learning and Clustering[ Slides ] (Apr 19, 21, 26) |
Topics:
Readings:
References:
|
Algorithm-Independent Learning Issues[ Slides ] (Apr 28, May 3, 5, 10) |
Topics:
Readings:
References:
|
Structural and Syntactic Pattern Recognition[ Slides ] (May 12, 17) |
Topics:
Readings:
References:
|
Assignments
- Homework assignment 1 (Due: March 3 as hardcopy in the class)
- Homework assignment 2 (Due: March 28 as online submission)
- Homework assignment 3 (Due: April 18 as online submission)
- Homework assignment 4 (Due: May 13 as online submission)
Late submission policy: Unless you make prior arrangements with me (before the due date), your grade will be multiplied by "1 - 0.33(t-due)" where due is the assignment due date and t is the date you submit it (t=due, due+1, due+2).
Project
The purpose of the project is to enable the students to get some hands-on experience in the design, implementation and evaluation of pattern recognition algorithms by applying them to real-world problems. The objective is to try multiple algorithms for different steps of the design cycle such as feature extraction and selection, model learning and estimation, classification and evaluation, to get as high an accuracy as possible on the selected datasets.
You can use your own data from your thesis research, select datasets from the list of data resources below, or contact the instructor for data from ongoing research on multimedia analysis (including video, audio and text data) or image classification. In any case, you should get prior approval before starting your project.
You are free to use any programming language but Matlab is strongly recommended because it is very convenient for prototyping and has many tools available for pattern recognition. You can write the codes yourself or use any code that is available in the public domain. In case you use somebody else's code, you are required to know the details of the algorithms that the code implements.
You are required to work in groups of three, and submit a project proposal, an interim progress report, a final report written in a conference paper format, and make a poster presentation during the finals week. Tentative schedule of the project is as follows:
- Project proposal (due April 8): Submit a 1-2 page proposal that describes the problem you would like to tackle, objective of the study, proposed algorithms, hardware/software tools and data that you plan to utilize, and evaluation strategies that you plan to use. Also provide a short list of related references.
- Interim progress report (due May 4): Submit a report that describes your progress with the project and your plans for the rest of the semester.
- Final report (due May 24): Submit a readable and well-organized report that provides proper motivation for the task, proper citation and discussion of related literature, proper explanation of the details of the approach and implementation strategies, proper performance evaluation, and detailed discussion of the results. Highlight your contributions and conclusions. Also submit well-documented software with your report.
- Poster presentation (due May 25): Present your work as a poster that fits to a board of approximately 1m-by-1m. Each team member should also provide a written description of her/his own contributions to the project.
All reports and software can be submitted using the online form.
Final report guidelines:
- Follow IEEE two-column format as shown in the example and the format definition table and glossary.
- The page limit is 6 pages.
- The report should not have any page numbers, headers or footers.
- You can use IEEE's LaTeX template or Word template. (LaTeX users: Be sure to use the template's conference mode.)
- PDF submission is recommended.
All projects' final reports are available as a pdf file (Bilkent only access).
# | Projects | Members |
---|---|---|
1 | Multi-scale region-based remote sensing image classification using a Hidden Markov Model | Gokhan Akcay, Bayram Boyraz, Derya Ozkan |
2 | Segmentation algorithm for news videos based on speech / speech+music classification | Caglar Ari, Osman Tapkan |
3 | A probabilistic content model learning application | Ibrahim Demir, Gonenc Ercan, E. Kartal Tabak |
4 | Region based object recognition using boosting algorithm | Demir Gokalp |
5 | Offline handwritten word recognition | Ayca Ozcelikkale, Deniz Ustebay, Funda Durupinar |
6 | Comparison of unsupervised learning algorithms | Hande Dogan, Sami Ezercan |
7 | Comparison of feature sets for text categorization | Yigithan Dedeoglu, I. Emre Sahin, H. Ozgur Tan, Yasin Uzun |
8 | Frontal face detection and recognition | A. Polat Ay, Kivanc Kose, Ayse Kucukyilmaz, Onur Onder |
9 | Comparison of classification algorithms using semi-supervised learning on middle-scale datasets | M. Erol Aran, Faysal Basci, H. Dogu Taskiran |
10 | A performance analysis of various pattern recognition algorithms on hazelnut impact sounds | Berkan Dulek, Ibrahim Onaran, Mehmet Turkan |
11 | Discrimination between upstream and downstream genes | Ozgun Babur |
12 | A comparative study of different techniques to target differentiation and localization with infrared sensors | Cagri Yuzbasioglu, Ilker Bayram, Serkan Onart, Tayfun Aytac |
Grading Policy
Homework and quiz: | 55% |
Term project: | 40% |
Class participation: | 5% |
Grades (Bilkent only access) for all quizzes, homework assignments and project are available (also available on SAPS).
Related Links
- Pattern Classification Book (Duda, Hart, Stork)
- Book's website
- Make sure you check the errata for the particular printing you have.
- Software Resources
- PRTools by the Delft Pattern Recognition Group (in Matlab) (local copy)
- Netlab Neural Network Software (in Matlab) (local copy of software and its documentation)
- Weka Data Mining Software (in Java)
- Bayes Net Toolbox (in Matlab)
- Hidden Markov Model Toolbox (in Matlab)
- SVMlight - SVM training package (in C)
- Sequential Minimal Optimization algorithm for SVM training
- LIBSVM - A Library for SVM (in C++ and Java, with interfaces for additional languages)
- Numerical Recipes (in C)
- Software resources from Pattern Recognition Information web site
- Software resources from Kevin Murphy's web site
- Software resources from Kernel Machines web site
- Software resources from Kernel Methods web site
- Software resources from American Association for Artificial Intelligence web site
- StatLib
- Mathtools.net Technical Computing (in Matlab, C/C++, Java)
- Matlab Tutorials
- Data Resources
- Pattern recognition related archives
- Computer vision test images
- UCI Machine Learning Repository
- Image database from the University of Washington
- Texture database from the University of Oulu
- Document database from the University of Oulu
- Other databases from the University of Oulu
- Image databases from CMU Vision and Autonomous Systems Center
- Various other datasets from the University of Washington
- Face databases from CMU
- Face databases from MIT
- Another page for face databases
- MNIST Database of handwritten digits
- Shape database from Brown University
- Reuters-21578 Text Categorization Collection
- NIST Scientific and Technical Databases
- RISC: Repository of Information on Semi-supervised Clustering
- Others
- Pattern Recognition Information
- International Association for Pattern Recognition (IAPR)
- IEEE Computer Society Technical Committee on Pattern Analysis and Machine Intelligence (PAMI)
- IAPR Technical Committee 1 on Statistical Techniques in Pattern Recognition
- IAPR Technical Committee 2 on Structural and Syntactical Pattern Recognition
- International Society for Bayesian Analysis
- Statistical Learning/Pattern Recognition Glossary
- Statistical Data Mining Tutorials
- Kernel Machines
- Learning with Kernels
- Engineering Statistics Handbook
- Introductory Statistics: Concepts, Models, and Applications
- The Probability Web