6.869 Course Materials

	MIT CSAIL 6.819/6.869: Advances in Computer Vision
Fall 2015
[Home \| Schedule \| Course Materials \| Final Project \| Piazza \| Stellar ]

Computer vision:

[Sz] Szeliski, Computer Vision: Algorithms and Applications, Springer, 2010 (online draft)
[HZ] Hartley and Zisserman, Multiple View Geometry in Computer Vision, Cambridge University Press, 2004
[FP] Forsyth and Ponce, Computer Vision: A Modern Approach, Prentice Hall, 2002
[Pa] Palmer, Vision Science, MIT Press, 1999

Learning:

[Mi] Mitchel, Machine Learning, McGraw-Hill, 1997
[DHS] Duda, Hart and Stork, Pattern Classification (2nd Edition), Wiley-Interscience, 2000

Graphical models:

[KF] Koller and Friedman, Probabilistic Graphical Models: Principles and Techniques, MIT Press, 2009

Image datasets:

Labelme: an online annotation tool to build image databases for computer vision research
OpenSurfaces: a large database of annotated surfaces created from real-world consumer photographs.
ImageNet: a large-scale image dataset for visual recognition organized by WordNet hierarchy
SUN Database: a benchmark for scene recognition and object detection with annotated scene categories and segmented objects
Places Database: a scene-centric database with 205 scene categories and 2.5 millions of labelled images
NYU Depth Dataset v2: a RGB-D dataset of segmented indoor scenes
Microsoft COCO: a new benchmark for image recognition, segmentation and captioning
Flickr100M: 100 million creative commons Flickr images
Labeled Faces in the Wild: a dataset of 13,000 labeled face photographs
Human Pose Dataset: a benchmark for articulated human pose estimation
YouTube Faces DB: a face video dataset for unconstrained face recognition in videos
UCF101: an action recognition data set of realistic action videos with 101 action categories
HMDB-51: a large human motion dataset of 51 action classes

Top computer vision conferences and papers:

Related courses:

Other resources:

MATLAB:

MATLAB at MIT: