|
MIT CSAIL
6.819/6.869: Advances in Computer Vision |
|
Fall 2015 |
|
Books
Computer vision:
- [Sz] Szeliski, Computer Vision: Algorithms and Applications, Springer, 2010 (online draft)
- [HZ] Hartley and Zisserman, Multiple View Geometry in Computer Vision, Cambridge University Press, 2004
- [FP] Forsyth and Ponce, Computer Vision: A Modern Approach, Prentice Hall, 2002
- [Pa] Palmer, Vision Science, MIT Press, 1999
Learning:
Graphical models:
Resources
Image datasets:
- Labelme: an online annotation tool to build image databases for computer vision research
- OpenSurfaces: a large database of annotated surfaces created from real-world consumer photographs.
- ImageNet: a large-scale image dataset for visual recognition organized by WordNet hierarchy
- SUN Database: a benchmark for scene recognition and object detection with annotated scene categories and segmented objects
- Places Database: a scene-centric database with 205 scene categories and 2.5 millions of labelled images
- NYU Depth Dataset v2: a RGB-D dataset of segmented indoor scenes
- Microsoft COCO: a new benchmark for image recognition, segmentation and captioning
- Flickr100M: 100 million creative commons Flickr images
- Labeled Faces in the Wild: a dataset of 13,000 labeled face photographs
- Human Pose Dataset: a benchmark for articulated human pose estimation
- YouTube Faces DB: a face video dataset for unconstrained face recognition in videos
- UCF101: an action recognition data set of realistic action videos with 101 action categories
- HMDB-51: a large human motion dataset of 51 action classes
Top computer vision conferences and papers:
- CVPR: IEEE Conference on Computer Vision and Pattern Recognition
- ICCV: International Conference on Computer Vision
- ECCV: European Conference on Computer Vision
- NIPS: Neural Information Processing Systems
Related courses:
Other resources:
MATLAB:
MATLAB at MIT: